Inference Jobs in United States

Browse 20 Inference jobs in United States on Inference Jobs.

20 jobs

4wCE

Inference Frontend

Cerebras

Sunnyvale, California, United States (On-site)
2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly
3wNV

Senior Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$200k – $391k Yearly
3wNV

Principal Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$248k – $391k Yearly
2wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wPO

Member of Engineering (Inference)

Poolside

United Kingdom or Remote (Europe + 1 more)
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
4wXA

Software Engineer - Applied Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
4wCO

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)$92k – $135k Yearly
3wNV

Senior Software Engineer, Deep Learning Inference - TensorRT

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
2wPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
3wCE
4wXA

Member of Technical Staff, Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
3wCE

Deployment Engineer, AI Inference

Cerebras

Sunnyvale, California, United States (On-site)
2wCO

Full-Stack Software Engineer, Inference

Cohere

Toronto, Ontario, Canada or Remote (Canada + 2 more)
2wBA

Engineering Manager - Forward Deployed Engineering (LLM)

Baseten

San Francisco, California, United States (On-site)$220k – $285k Yearly