1. Home
  2. Jobs
  3. United States
  4. Inference Engineering

Inference Engineering Jobs in United States

Browse 20 Inference Engineering jobs in United States on Inference Jobs.

20 jobs

2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
2wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
4wCE

Inference Frontend

Cerebras

Sunnyvale, California, United States (On-site)
3wNV

Principal Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$248k – $391k Yearly
2wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
3wNV

Senior Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$200k – $391k Yearly
4wXA

Software Engineer - Applied Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wPO

Member of Engineering (Inference)

Poolside

United Kingdom or Remote (Europe + 1 more)
2wD-

Analog Design Engineer, Senior Staff

d-Matrix

Santa Clara, California, United States (Hybrid)$196k – $300k Yearly
3wCE

Sr. Engineer, Inference Ecosystem Engineering

Cerebras

Sunnyvale, California, United States (On-site)
4wCO

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)$92k – $135k Yearly
3wCE
4wXA

Member of Technical Staff, Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
2wBA

Engineering Manager - Forward Deployed Engineering (LLM)

Baseten

San Francisco, California, United States (On-site)$220k – $285k Yearly
3wCE

Deployment Engineer, AI Inference

Cerebras

Sunnyvale, California, United States (On-site)