Low-latency Retrieval Jobs
Explore Low-latency Retrieval roles on Inference Jobs and apply today.
3w agoSC
AI Infrastructure Engineer, Model Serving Platform
Scale
San Francisco, California, United States (On-site)$179.4K – $224.3K Yearly
3w agoAN
Engineering Manager, Inference Routing and Performance
Anthropic
San Francisco, California, United States (Hybrid)$405K – $485K Yearly
2w agoAN
Research Engineer, Performance RL
Anthropic
San Francisco, California, United States (Hybrid)$350K – $850K Yearly
2w agoTM
Research Engineer, Infrastructure, RL Systems
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
1mo agoCE
3w agoSC
Machine Learning Research Scientist / Research Engineer, Post-Training
Scale
San Francisco, California, United States (On-site)$252K – $315K Yearly
3w agoSC
ML Research Engineer, ML Systems
Scale
San Francisco, California, United States (On-site)$218.4K – $273K Yearly
1mo agoNV
Senior Deep Learning Scientist, Multimodal Conversational AI
NVIDIA
Santa Clara, California, United States (On-site)$184K – $287.5K Yearly
6d agoAN
Research Engineer, Machine Learning (Horizons)
Anthropic
San Francisco, California, United States (Hybrid)$500K – $850K Yearly
1mo agoNV
AI Inference Performance Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
1mo agoNV
Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$168K – $264.5K Yearly
3w agoNV
Deep Learning Engineer - LLM and VLM Model Compression
NVIDIA
Warszawa, Masovian Voivodeship, Poland (On-site)zł 292.5K – zł 650K Yearly
1mo agoNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
1mo agoAA
Senior AI Researcher- Reinforcement learning (f/m/d)
Aleph Alpha
Heidelberg, Baden-Württemberg, Germany (Hybrid)
1w agoTA
Senior Machine Learning Engineer, Voice AI
Together AI
San Francisco, California, United States (On-site)$200K – $260K Yearly
3w agoVA
3w agoSC
Machine Learning Research Scientist/ Engineer, Agents
Scale
San Francisco, California, United States (On-site)$275K – $350K Yearly
19h agoNV
Senior Product Manager, AI Inference - Dynamo
NVIDIA
Santa Clara, California, United States (On-site)$208K – $327.8K Yearly
6d agoAN
Research Engineer, Pretraining Scaling (London)
Anthropic
London, England, United Kingdom (On-site)£260K – £630K Yearly