1. Home
  2. Jobs
  3. United States
  4. Low Latency Optimization

Low Latency Optimization Jobs in United States

Discover Low Latency Optimization roles in United States on Inference Jobs and apply today.

2mo agoNV

Senior Applied Deep Learning Research Scientist, Efficiency

NVIDIA

Santa Clara, California, United States (On-site)$192K – $356.5K Yearly
3w agoTM

Research Engineer, Infrastructure, Kernels

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
3d agoNV

Senior Deep Learning Software Engineer, LLM Performance

NVIDIA

Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
3w agoET

Front-End Power Engineer

Etched

Cupertino, California, United States (On-site)
3mo agoCO

Audio Inference Engineer, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
5d agoCE
3mo agoCE
5d agoCE

Engineering Lead, Inference Platform

Cerebras

Sunnyvale, California, United States (On-site)
2w agoQD

Senior Solutions Engineer (Bay Area)

Qdrant

San Francisco, California, United States (Hybrid)$180K – $225K Yearly
3w agoTM

Research, Audio Expertise

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
3d agoTE

AI Subsystems Physical Design Lead

Tenstorrent

Austin, Texas, United States (Hybrid)$100K – $500K Yearly
3mo agoBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150K – $250K Yearly
3mo agoBA

Engineering Manager - Model Performance

Baseten

San Francisco, California, United States (On-site)$230K – $300K Yearly
1mo agoTE

Sr. RTL Design Engineer - Tensix

Tenstorrent

Austin, Texas, United States (Hybrid)$100K – $500K Yearly
2mo agoNV

Senior Performance Engineer - Deep Learning

NVIDIA

Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380K – $380K Yearly