1. Home
  2. Jobs
  3. Inference Engineer

Inference Engineer Jobs

Browse 934 Inference Engineer jobs on Inference Jobs.

81-100 of 934 jobs

2wMA

Research Engineer

Magic

San Francisco, California, United States (On-site)$225k – $550k Yearly
5dNE

ML/AI Engineer

Nebius

Amsterdam, North Holland, Netherlands (On-site)
5dTA

Machine Learning Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $220k Yearly
1wOP

Software Engineer, Caching Infrastructure

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
1wOP

Software Engineer, Productivity

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
2wSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175k – $280k Yearly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
5dCA

Solutions Engineer

Cartesia

San Francisco, California, United States (On-site)$160k – $220k Yearly
2wNV

Senior Machine Learning Applications and Compiler Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
1wTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200k – $280k Yearly
5dAC

Infrastructure Engineer, ML Systems

Applied Compute

San Francisco, California, United States (On-site)
2wNV
1wOP

Research Engineer / Research Scientist - Foundations Retrieval Lead

OpenAI

San Francisco, California, United States (Hybrid)$460k – $555k Yearly
3dCE

Performance Reliability Engineer

Cerebras

Sunnyvale, California, United States (On-site)
3wXA

Member of Technical Staff, Model Evaluation

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dAC

Research Engineer

Applied Compute

San Francisco, California, United States (On-site)
4wNV

Deep Learning Algorithm Engineer - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)$124k – $241.5k Yearly
2wAI

Software Engineer - Full Stack, Defense

Applied Intuition

Sunnyvale, California, United States (On-site)$153k – $222k Yearly
4wNV

Deep Learning Software Engineer, FlashInfer - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-site)$108k – $195.5k Yearly