1. Home
  2. Jobs
  3. United States
  4. Inference Optimization

Inference Optimization Jobs in United States

Discover Inference Optimization roles in United States on Inference Jobs and apply today.

3mo agoHA
4w agoTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160K – $230K Yearly
3mo agoOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380K – $380K Yearly
3mo agoPL
4w agoOP

TL, Research Inference

OpenAI

San Francisco, California, United States (On-site)$380K – $555K Yearly
3mo agoPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210K – $385K Yearly
2mo agoNV

Senior DL Algorithms Engineer - Inference Performance

NVIDIA

Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
3mo agoCO

Audio Inference Engineer, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
3mo agoOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325K – $490K Yearly
2mo agoNV
3w agoTM

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
5d agoCE
3mo agoD-
2mo agoCE

Sr. MTS - Inference ML Eng

Cerebras

Sunnyvale, California, United States (On-site)
2mo agoNV

Senior AI Inference Compiler Engineer

NVIDIA

Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoCA

Inference Engineer

Cartesia

San Francisco, California, United States (On-site)$180K – $250K Yearly
2mo agoNV

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
3mo agoOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325K – $490K Yearly