1. Home
  2. Jobs
  3. United States
  4. California
  5. Inference Optimization

Inference Optimization Jobs in California, United States

Discover Inference Optimization roles in California, United States on Inference Jobs and apply today.

4w agoTE

Sr. Engineer, Software - AI Compiler

Tenstorrent

Santa Clara, California, United States (Hybrid)$100K – $500K Yearly
3w agoTM

Research, Pre-Training Science

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
2d agoNV

CPU Performance Architect

NVIDIA

Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
1mo agoNE
3w agoTM

Research, Audio Expertise

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
1mo agoNV
2w agoAN

Engineering Manager, ML Acceleration

Anthropic

San Francisco, California, United States (Hybrid)$500K – $850K Yearly
2w agoAN

Performance Engineer, GPU

Anthropic

San Francisco, California, United States (Hybrid)$280K – $850K Yearly
3mo agoOP
3w agoAN

Software Engineer, Cloud Inference Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$405K – $485K Yearly
3w agoTM

Research Engineer, Infrastructure, Training Systems

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
2mo agoNV
3mo agoOP

Software Engineer, Caching Infrastructure

OpenAI

San Francisco, California, United States (On-site)$255K – $405K Yearly
4w agoTA

Systems Research Engineer, GPU Programming

Together AI

San Francisco, California, United States (Hybrid)$160K – $230K Yearly
2mo agoBA

Software Engineer — GPU Networking & Distributed Systems

Baseten

San Francisco, California, United States (On-site)$150K – $250K Yearly
2w agoNV

NVIDIA 2026 Internships: PhD Autonomous Vehicles Research - US

NVIDIA

Santa Clara, California, United States (On-site)$30 – $94 Hourly