1. Home
  2. Jobs
  3. Low-latency RPC

Low-latency RPC Jobs

Browse 65 Low-latency RPC jobs on Inference Jobs.

41-60 of 65 jobs

6dAN

Software Engineer, Sandboxing (Systems)

Anthropic

San Francisco, California, United States (Hybrid)$300k – $405k Yearly
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
2wBA

Software Engineer - Model API's

Baseten

San Francisco, California, United States (On-site)$150k – $230k Yearly
4dBA

Software Engineer — GPU Networking & Distributed Systems

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
6dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
6dTM

Research Engineer, Infrastructure, Numerics

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)
5dNV

Senior Software Developer, AI Networking

NVIDIA

Texas, United States (Remote)$184k – $356.5k Yearly
6dTM

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wCO

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)
2wNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
3wXA

Member of Technical Staff, Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly