Inference-Time Compute Jobs
Explore Inference-Time Compute roles on Inference Jobs and apply today.
3w agoTA
Machine Learning Engineer - Inference
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
2mo agoNV
Senior Compiler Engineer, AI Inference Performance
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
2mo agoTA
Research Engineer, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
3w agoTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
1mo agoMA
Member of Technical Staff, Inference & RL Systems
Magic
San Francisco, California, United States (On-site)$225K – $550K Yearly
3mo agoCO
Site Reliability Engineer, Inference Infrastructure
Cohere
Toronto, Ontario, Canada or Remote (Canada + 2 more)
3mo agoOP
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
2mo agoXA
Software Engineer - Applied Inference
xAI
Palo Alto, California, United States (On-site)$180K – $440K Yearly
1mo agoNV
AI Inference Performance Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
3mo agoPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210K – $385K Yearly
2mo agoNV
Senior AI Inference Compiler Engineer
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3w agoTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly