Inference-Time Compute Jobs
Browse 498 Inference-Time Compute jobs on Inference Jobs.
141-160 of 498 jobs
2wNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
7dAI
ML Runtime Optimization Engineer
Applied Intuition
Mountain View, California, United States (On-site)$159.1k – $199.3k Yearly
2wBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150k – $250k Yearly
3wAN
[P] Compute Efficiency Engineer
Anthropic
San Francisco, California, United States (Hybrid)$1 – $2 Yearly
2wAI
Machine Learning Engineer - Defense
Applied Intuition
Sunnyvale, California, United States (On-site)$150k – $225k Yearly
2wNV
Compiler Verification Engineer, Compute Performance – GPU
NVIDIA
Austin, Texas, United States (On-site)$140k – $224.3k Yearly
2wD-
AI / ML System Software Engineer, Senior Staff
d-Matrix
Santa Clara, California, United States (Hybrid)$180k – $280k Yearly
2wOP
Strategic Finance, Compute
OpenAI
San Francisco, California, United States (Hybrid)$210k – $265k Yearly
7dNV
Senior Compiler Engineer - Compute Front-End
NVIDIA
Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
7dTM
Research Engineer, Infrastructure, Kernels
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
1wTA
Research Engineer, Frontier Speculative Decoding
Together AI
San Francisco, California, United States (On-site)$190k – $270k Yearly
2wOP
Software Engineer, Hardware
OpenAI
San Francisco, California, United States (Hybrid)$310k – $460k Yearly