1. Home
  2. Jobs
  3. Inference Engineering

Inference Engineering Jobs

Browse 923 Inference Engineering jobs on Inference Jobs.

61-80 of 923 jobs

3wNV

Platform Architecture Engineer, GeForce NOW

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wNV

Senior Deep Learning Engineer

NVIDIA

Warszawa, Masovian Voivodeship, Poland (Hybrid)zł 292.5k – zł 507k Yearly
1wNV

Senior ML Framework Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
6dVA

GPU Systems Engineer – HPC / Parallel Computing

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
2wD-

Software Engineering Intern, Developer and Qualification Tools

d-Matrix

Santa Clara, California, United States (Hybrid)$30 – $59 Hourly
2wOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
2wAI

Machine Learning Engineer - Defense

Applied Intuition

Ann Arbor, Michigan, United States (On-site)$130k – $200k Yearly
5dNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Toronto, Ontario, Canada (Hybrid)C$170k – C$275k Yearly
3dDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300k – $430k Yearly
4dMO

Forward Deployed ML Engineer

Modal

New York, New York, United States (On-site)$180k – $250k Yearly
2wNV

Senior ASIC Design Verification Engineer

NVIDIA

California, United States (Hybrid)$168k – $310.5k Yearly
2wD-

Senior Staff Machine Learning Engineer -Frameworks

d-Matrix

Santa Clara, California, United States (Hybrid)$155k – $250k Yearly
2wCO

Staff Research Engineer, Model Efficiency

Cohere

New York, New York, United States (Hybrid)
2wRA

Member of Technical Staff - Post-Training

Reflection AI

San Francisco, California, United States (On-site)
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
2wMA

Research Engineer

Magic

San Francisco, California, United States (On-site)$225k – $550k Yearly
4wNV

Technical Marketing Engineer, World Models - AV Physical AI

NVIDIA

Santa Clara, California, United States (On-site)$148k – $287.5k Yearly
2wAI

AI Infrastructure Engineer - Autonomy

Applied Intuition

Sunnyvale, California, United States (On-site)$153k – $222k Yearly
1wCE
6dAN

ML Infrastructure Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$320k – $405k Yearly