1. Home
  2. Jobs
  3. AI Inference

AI Inference Jobs

Browse 593 AI Inference jobs on Inference Jobs.

101-120 of 593 jobs

2wNV

Software Engineer, Metropolis Vision AI

NVIDIA

Ho Chi Minh City, Ho Chi Minh City, Vietnam (On-site)
3wXA

Member of Technical Staff, Model Evaluation

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wCO

Staff Research Engineer, Model Efficiency

Cohere

New York, New York, United States (Hybrid)
4dNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
6dSC

Senior Forward Deployed Data Scientist/Engineer

Scale

San Francisco, California, United States (Hybrid)$198k – $247.5k Yearly
2wD-

Senior Staff Machine Learning Engineer -Frameworks

d-Matrix

Santa Clara, California, United States (Hybrid)$155k – $250k Yearly
6dTM

Research, Audio Expertise

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
3wCR

Staff Software Engineer, Model LifeCycle

Crusoe

San Francisco, California, United States (On-site)$204k – $247k Yearly
2wD-

Senior Staff ML Researcher - LLM Algorithmic Optimization

d-Matrix

Bengaluru, Karnataka, India (Hybrid)₹4M – ₹6M Yearly
4dNV

Software Engineer, TensorRT Specialized Platforms - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-site)$124k – $195.5k Yearly
2wNV

Senior Deep Learning Engineer

NVIDIA

Warszawa, Masovian Voivodeship, Poland (Hybrid)zł 292.5k – zł 507k Yearly
2wNV

Senior Machine Learning Applications and Compiler Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175k – $280k Yearly
2wHA

Staff Software Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
2wMA

Research Engineer

Magic

San Francisco, California, United States (On-site)$225k – $550k Yearly
6dTA

Machine Learning Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $220k Yearly
4wGR
1wTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200k – $280k Yearly