1. Home
  2. Jobs
  3. Model Quantization

Model Quantization Jobs

Browse 85 Model Quantization jobs on Inference Jobs.

61-80 of 85 jobs

1dNV

Senior Engineer - Deep Learning Compiler Verification and Infrastructure

NVIDIA

Santa Clara, California, United States (On-site)$140k – $224.3k Yearly
6dCE

Full Stack LLM Engineer

Cerebras

Toronto, Ontario, Canada (On-site)
5dNV

Principal Product Manager, AI Frameworks

NVIDIA

Santa Clara, California, United States (On-site)$240k – $379.5k Yearly
5dTE

Sr. Engineer, Software - Simulation/Hardware Modeling

Tenstorrent

Austin, Texas, United States (Hybrid)$100k – $500k Yearly
6dAN

Research Engineer, Production Model Post-Training - London

Anthropic

London, England, United Kingdom (Hybrid)£270k – £340k Yearly
2wCE

Senior Full Stack LLM Engineer - Training

Cerebras

Sunnyvale, California, United States (On-site)
4dNV

Senior Software Engineer - Developer Tools for Deep Learning

NVIDIA

Westford, Massachusetts, United States (On-site)$152k – $287.5k Yearly
6dNV

Senior GPU Functional Modeling Architect

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
4wD-

Software Engineering Intern, Simulation and Modeling

d-Matrix

Santa Clara, California, United States (Hybrid)$30 – $59 Hourly
2wMO

Member of Technical Staff - ML Performance

Modal

New York, New York, United States (On-site)$150k – $270k Yearly
6dAN

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)$280k – $560k Yearly
6dTM

Research Engineer, Infrastructure, Numerics

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
6dTM

Research Engineer, Infrastructure, Kernels

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)
2wNE

Senior ML Engineer (Token Factory)

Nebius

Amsterdam, North Holland, Netherlands (On-site)