1. Home
  2. Jobs
  3. Model Inference

Model Inference Jobs

Browse 825 Model Inference jobs on Inference Jobs.

101-120 of 825 jobs

2wBA

Engineering Manager - Forward Deployed Engineering (LLM)

Baseten

San Francisco, California, United States (On-site)$220k – $285k Yearly
6dAN

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)$280k – $560k Yearly
2wCO

Product Marketing Manager, CoreWeave Inference

CoreWeave

Livingston, New Jersey, United States (Hybrid)$143k – $210k Yearly
4dNV

Senior Compiler Engineer, AI Inference Platforms

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
6dCO

Principal Engineer, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
3wCE

Sr. Engineer, Inference Ecosystem Engineering

Cerebras

Sunnyvale, California, United States (On-site)
2wNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Santa Clara, California, United States (Hybrid)$184k – $356.5k Yearly
2wOP

Software Engineer, Load Balancing - Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
2wNV

Senior ASIC Design Verification Engineer

NVIDIA

California, United States (Hybrid)$168k – $310.5k Yearly
2wNE

Senior ML Engineer (Token Factory)

Nebius

Amsterdam, North Holland, Netherlands (On-site)
6dCE

Principal ML Investigator

Cerebras

Sunnyvale, California, United States (On-site)
3dCO

Senior Software Engineer I, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$139k – $204k Yearly
6dAN

Staff Software Engineer, Inference

Anthropic

Dublin, County Dublin, Ireland (Hybrid)€295k – €355k Yearly
6dCO

Senior Software Engineer II, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$165k – $242k Yearly
2wNV

Senior Software Engineer, Deep Learning Inference - TensorRT

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
5dNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Toronto, Ontario, Canada (Hybrid)C$170k – C$275k Yearly
3wAN

Software Engineer, Inference Deployment

Anthropic

San Francisco, California, United States (Hybrid)$320k – $485k Yearly
6dAN

Senior/Staff Software Engineer, Inference

Anthropic

New York, New York, United States (Hybrid)$300k – $485k Yearly
2wMA

Software Engineer, Technical Lead, Inference

Mistral AI

Île de Ré, Charente-Maritime, France (Hybrid)
4dTA

Machine Learning, Platform Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $250k Yearly