Inference-Time Compute Jobs
Explore Inference-Time Compute roles on Inference Jobs and apply today.
3mo agoBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150K – $250K Yearly
1mo agoD-
Principal Architect, Performance Analysis and Modeling
d-Matrix
Santa Clara, California, United States (Hybrid)$190K – $280K Yearly
3w agoTA
Senior Backend Engineer, Inference Platform
Together AI
San Francisco, California, United States (On-site)$160K – $250K Yearly
2mo agoNV
Senior Systems Software Engineer - Deep Learning Solutions
NVIDIA
Toronto, Ontario, Canada (On-site)C$225K – C$275K Yearly
4w agoVA
GPU Systems Engineer – HPC / Parallel Computing
Vast.ai
San Francisco, California, United States (On-site)$160K – $320K Yearly
2mo agoAN
Staff + Senior Software Engineer, Cloud Inference
Anthropic
San Francisco, California, United States (Hybrid)$300K – $485K Yearly
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly
2w agoNV
2mo agoNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
2mo agoNV
Senior Compiler Engineer - Compute Front-End
NVIDIA
Santa Clara, California, United States (On-site)$152K – $287.5K Yearly
2mo agoNV
3w agoTM
Research Engineer, Infrastructure, Kernels
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly