1. Home
  2. Jobs
  3. Low-latency Retrieval

Low-latency Retrieval Jobs

Explore Low-latency Retrieval roles on Inference Jobs and apply today.

3w agoSC

AI Infrastructure Engineer, Model Serving Platform

Scale

San Francisco, California, United States (On-site)$179.4K – $224.3K Yearly
2w agoAN

Research Engineer, Performance RL

Anthropic

San Francisco, California, United States (Hybrid)$350K – $850K Yearly
2w agoTM

Research Engineer, Infrastructure, RL Systems

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
3w agoSC
3w agoSC

ML Research Engineer, ML Systems

Scale

San Francisco, California, United States (On-site)$218.4K – $273K Yearly
6d agoAN

Research Engineer, Machine Learning (Horizons)

Anthropic

San Francisco, California, United States (Hybrid)$500K – $850K Yearly
1mo agoNV
3w agoNV
1mo agoNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
1w agoTA

Senior Machine Learning Engineer, Voice AI

Together AI

San Francisco, California, United States (On-site)$200K – $260K Yearly
3w agoVA

AI Agent Researcher

Vast.ai

San Francisco, California, United States (On-site)$160K – $320K Yearly
3w agoSC

Machine Learning Research Scientist/ Engineer, Agents

Scale

San Francisco, California, United States (On-site)$275K – $350K Yearly
19h agoNV
6d agoAN

Research Engineer, Pretraining Scaling (London)

Anthropic

London, England, United Kingdom (On-site)£260K – £630K Yearly