1. Home
  2. Jobs
  3. AI Model Inference

AI Model Inference Jobs

Browse 1,278 AI Model Inference jobs on Inference Jobs.

1,278 jobs

1wNV

Senior ML Framework Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
6dNV

Senior Deep Learning Engineer - Model Evaluation & AI Systems

NVIDIA

Santa Clara, California, United States (On-site)$224k – $431.3k Yearly
2wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)
3wCE

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)
3wNV

Platform Architecture Engineer, GeForce NOW

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wXT

AI Scientist, BioMedical AI

Xaira Therapeutics

South San Francisco, California, United States (On-site)$150k – $240k Yearly
5dAN

Engineering Manager, Inference

Anthropic

San Francisco, California, United States (Hybrid)$425k – $560k Yearly
4wNV

Agentic AI Solution Engineering Intern - Summer 2026

NVIDIA

Austin, Texas, United States (On-site)$20 – $71 Hourly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
4wNV

Technical Marketing Engineer, World Models - AV Physical AI

NVIDIA

Santa Clara, California, United States (On-site)$148k – $287.5k Yearly
2wCR

Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)

Crusoe

San Francisco, California, United States or Remote (California, United States + 1 more)$204k – $247k Yearly
2wLA

Fullstack Engineer, Applied AI

LangChain

San Francisco, California, United States (On-site)$170k – $195k Yearly
1wOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
6dOP

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)$455k – $555k Yearly
2wOP

Research Engineer, Privacy

OpenAI

San Francisco, California, United States (On-site)$380k – $460k Yearly
6dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly