1. Home
  2. Jobs
  3. Model Inference

Model Inference Jobs

Browse 864 Model Inference jobs on Inference Jobs.

864 jobs

1wOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
5dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wCE

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)
5dOP

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)$455k – $555k Yearly
5dNV

Senior Deep Learning Engineer - Model Evaluation & AI Systems

NVIDIA

Santa Clara, California, United States (On-site)$224k – $431.3k Yearly
1wNV

Senior ML Framework Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
5dCO

Director of Engineering, Inference Services

CoreWeave

Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
1wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
4hNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
3dNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
3wXA

Member of Technical Staff, Model Evaluation

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wSE

ML Engineer

Sesame

New York, New York, United States (On-site)$190k – $320k Yearly
1wCO

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)
1wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
5dCE

Senior Research Engineer - Inference ML

Cerebras

Sunnyvale, California, United States (Hybrid)