1. Home
  2. Jobs
  3. Model Quantization

Model Quantization Jobs

Browse 88 Model Quantization jobs on Inference Jobs.

88 jobs

3dNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2hNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
5dAI

ML Runtime Optimization Engineer

Applied Intuition

Mountain View, California, United States (On-site)$159.1k – $199.3k Yearly
2wNV

Senior Developer Relations Manager - COSMOS and Foundation Models

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
5dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wAI

ML Runtime Optimization Engineer - Lead

Applied Intuition

Sunnyvale, California, United States (On-site)$199.3k – $264.5k Yearly
1wBA

Software Engineer - Model API's

Baseten

San Francisco, California, United States (On-site)$150k – $230k Yearly
1wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
2wTE

Engineer, ML Models

Tenstorrent

Santa Clara, California, United States (Hybrid)$100k – $500k Yearly
1wOP

Research-Hardware Codesign Engineer

OpenAI

San Francisco, California, United States (Hybrid)$230k – $460k Yearly
2wD-

Senior Staff Machine Learning Engineer -Frameworks

d-Matrix

Santa Clara, California, United States (Hybrid)$155k – $250k Yearly
5dTE

Software Engineer

Tenstorrent

東京都, Tokyo Prefecture, Japan (On-site)
5dAN

Performance Engineer, GPU

Anthropic

San Francisco, California, United States (Hybrid)$315k – $560k Yearly
2wD-

Machine Learning Research Intern

d-Matrix

Santa Clara, California, United States (Hybrid)$30 – $59 Hourly
5dCE

Senior Research Engineer - Inference ML

Cerebras

Sunnyvale, California, United States (Hybrid)
2dDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300k – $430k Yearly
2wNV

Senior Deep Learning Engineer

NVIDIA

Warszawa, Masovian Voivodeship, Poland (Hybrid)zł 292.5k – zł 507k Yearly
5dMA

Future AI Global leaders - Applied Science

Mistral AI

Île de Ré, Charente-Maritime, France (On-site)