Inference-Time Compute Jobs
Browse 542 Inference-Time Compute jobs on Inference Jobs.
21-40 of 542 jobs
1wPO
Member of Engineering (Pre-training and inference software)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
3wXA
Software Engineer - Applied Inference
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
1wOP
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)$325k – $490k Yearly
1wTA
Research Engineer, Core ML
Together AI
San Francisco, California, United States (On-site)$200k – $280k Yearly
6dCA
2wPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210k – $385k Yearly
6dTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
6dNV
Senior System Software Engineer - Dynamo-Triton Inference Server
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
6dVA
GPU Systems Engineer – HPC / Parallel Computing
Vast.ai
San Francisco, California, United States (On-site)$160k – $320k Yearly
3dNV
Senior AI Inference Compiler Engineer
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly