1. Home
  2. Jobs
  3. Quantization

Quantization Jobs

Browse 30 Quantization jobs on Inference Jobs.

30 jobs

2dNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
5mNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
3wXA

Member of Technical Staff, Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wNV

Senior Deep Learning Engineer

NVIDIA

Warszawa, Masovian Voivodeship, Poland (Hybrid)zł 292.5k – zł 507k Yearly
5dAN

Performance Engineer, GPU

Anthropic

San Francisco, California, United States (Hybrid)$315k – $560k Yearly
5dTE

Software Engineer

Tenstorrent

東京都, Tokyo Prefecture, Japan (On-site)
5dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
5dCE

Senior Research Engineer - Inference ML

Cerebras

Sunnyvale, California, United States (Hybrid)
5dAI

ML Runtime Optimization Engineer

Applied Intuition

Mountain View, California, United States (On-site)$159.1k – $199.3k Yearly
2wAI

ML Runtime Optimization Engineer - Lead

Applied Intuition

Sunnyvale, California, United States (On-site)$199.3k – $264.5k Yearly
2wD-

Senior Staff Machine Learning Engineer -Frameworks

d-Matrix

Santa Clara, California, United States (Hybrid)$155k – $250k Yearly
2wNV

Senior Research Scientist, AI Accelerator Design and VLSI

NVIDIA

Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
1wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
1wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
2wTE

Engineer, ML Models

Tenstorrent

Santa Clara, California, United States (Hybrid)$100k – $500k Yearly
3dCR

Research Engineer

Crusoe

Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
1wOP

Research-Hardware Codesign Engineer

OpenAI

San Francisco, California, United States (Hybrid)$230k – $460k Yearly
4wNV

Research Scientist, AI Accelerator SW HW Co-Design - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)$168k – $264.5k Yearly
2dDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300k – $430k Yearly