1. Home
  2. Jobs
  3. Distributed Inference

Distributed Inference Jobs

Browse 665 Distributed Inference jobs on Inference Jobs.

141-160 of 665 jobs

1wNV

Senior Deep Learning Engineer - Model Evaluation & AI Systems

NVIDIA

Santa Clara, California, United States (On-site)$224k – $431.3k Yearly
3dNV

Senior Deep Learning Compiler Engineer - XLA

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wBA

Senior Product Engineer - Training Platform

Baseten

San Francisco, California, United States (On-site)$200k – $275k Yearly
1wCE

Performance Engineer

Cerebras

Toronto, Ontario, Canada (On-site)
2wNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wCO

Staff Research Engineer, Model Efficiency

Cohere

New York, New York, United States (Hybrid)
3dNV

Lead Principal Engineer, Enterprise Agentic AI Platform

NVIDIA

Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
2wMO

Member of Technical Staff - ML Performance

Modal

New York, New York, United States (On-site)$150k – $270k Yearly
5dBA

Senior Software Engineer - New Products

Baseten

San Francisco, California, United States (On-site)$185k – $285k Yearly
2wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
1wMA

AI Scientist - Warsaw

Mistral AI

Warszawa, Masovian Voivodeship, Poland (Hybrid)
6dNV

Senior Compiler Engineer, AI Inference Performance

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
1wAN

Engineering Manager, ML Acceleration

Anthropic

San Francisco, California, United States (Hybrid)$425k – $560k Yearly
3wGR

Senior Staff Engineer

Graphcore

Bristol, England, United Kingdom (On-site)
2wOP

Software Engineer, Platform Systems

OpenAI

San Francisco, California, United States (On-site)$310k – $460k Yearly
3dNV

Senior Systems Software Engineer - Deep Learning Solutions

NVIDIA

Toronto, Ontario, Canada (On-site)C$225k – C$275k Yearly
1wAN

Performance Engineer, GPU

Anthropic

San Francisco, California, United States (Hybrid)$315k – $560k Yearly
2wCO

Member of Technical Staff, Modeling

Cohere

London, England, United Kingdom or Remote (Worldwide)