1. Home
  2. Jobs
  3. United States
  4. Inference Optimization

Inference Optimization Jobs in United States

Discover Inference Optimization roles in United States on Inference Jobs and apply today.

3w agoET
3mo agoOP
2mo agoNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
2mo agoNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
2mo agoTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200K – $280K Yearly
2mo agoNV

Senior Compiler Engineer, AI Inference Platforms

NVIDIA

Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
5d agoCE

Engineering Lead, Inference Platform

Cerebras

Sunnyvale, California, United States (On-site)
4w agoOP

Inference Technical Lead, On-Device Transformers

OpenAI

San Francisco, California, United States (Hybrid)$445K – $445K Yearly
4w agoTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160K – $230K Yearly
2w agoAN

Engineering Manager, Inference

Anthropic

San Francisco, California, United States (Hybrid)$425K – $560K Yearly
3d agoNV

Senior Deep Learning Software Engineer, LLM Performance

NVIDIA

Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
3mo agoSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175K – $280K Yearly
3mo agoCO

Member of Technical Staff, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
3mo agoBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150K – $250K Yearly
2mo agoNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly