1. Home
  2. Jobs
  3. Language Model Inference

Language Model Inference Jobs

Browse 569 Language Model Inference jobs on Inference Jobs.

41-60 of 569 jobs

2wCR

Engineering Manager, Managed AI

Crusoe

San Francisco, California, United States (On-site)$237.6k – $288k Yearly
5dAN

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)$280k – $560k Yearly
4wNV

Software Product Manager - Nemotron

NVIDIA

Santa Clara, California, United States (On-site)$240k – $379.5k Yearly
1wOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
2wMA

Applied AI, Forward Deployed Machine Learning Engineer - Morocco

Mistral AI

Casablanca, Casablanca-Settat, Morocco (On-site)
1wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
5dOP

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)$455k – $555k Yearly
9hNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
3dOP

Data Scientist, Marketing Innovation

OpenAI

San Francisco, California, United States (On-site)$293k – $325k Yearly
3wXA

Member of Technical Staff, Model Evaluation

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dRU

Applied Research Lead, Language

Runway

North America + 1 more (Remote)$280k – $380k Yearly
3dNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
5dCO

Director of Engineering, Inference Services

CoreWeave

Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
1wNV

Senior ML Framework Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
5dCE

Principal ML Investigator

Cerebras

Sunnyvale, California, United States (On-site)
1wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
3wCE

Inference Frontend

Cerebras

Sunnyvale, California, United States (On-site)
1wOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
1wCA

Software Engineer, India

Cartesia

Bengaluru, Karnataka, India (On-site)₹7M – ₹9M Yearly