1. Home
  2. Jobs
  3. Latency Optimization

Latency Optimization Jobs

Browse 317 Latency Optimization jobs on Inference Jobs.

81-100 of 317 jobs

6dTM

Research Engineer, Infrastructure, Kernels

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
1dNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wAI

Software Engineer - Controls

Applied Intuition

Sunnyvale, California, United States (On-site)$170k – $250k Yearly
5dNV
2wSE

Embedded ML Engineer – Gesture Recognition

Sesame

San Francisco, California, United States (On-site)$175k – $280k Yearly
3dNV

Senior Compiler Engineer - AI

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
6dAN

Research Engineer, Pretraining Scaling

Anthropic

San Francisco, California, United States (On-site)$315k – $560k Yearly
4wNV

Deep Learning Algorithm Engineer - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)$124k – $241.5k Yearly
6dAN

Research Engineer, Pretraining Scaling (London)

Anthropic

London, England, United Kingdom (On-site)£250k – £435k Yearly
1wNV

Senior Software R&D Engineer, VLSI Physical Design

NVIDIA

Santa Clara, California, United States (Hybrid)$168k – $264.5k Yearly
2wNE

Field CTO - Media & Entertainment

Nebius

United States (Remote)$295k – $365k Yearly
4wNV

Software Engineer, CPU and SoC Profiling Tools

NVIDIA

Ustroń, Silesian Voivodeship, Poland or Remote (Poland)zł 183.8k – zł 318.5k Yearly
4wEL

B2B Performance Marketer - Media Buyer

ElevenLabs

United Kingdom or Remote (North America + 1 more)
3wAN

[P] Compute Efficiency Engineer

Anthropic

San Francisco, California, United States (Hybrid)$1 – $2 Yearly
2wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly