1. Home
  2. Jobs
  3. Model Compression

Model Compression Jobs

Explore Model Compression roles on Inference Jobs and apply today.

4w agoNV
2mo agoNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
2mo agoNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
2w agoAN

Staff Machine Learning Engineer, Agent Skills

Anthropic

San Francisco, California, United States (Hybrid)$500K – $850K Yearly
2mo agoNV

Senior DFT ATPG Engineer

NVIDIA

Yokne'am, Northern District, Israel (On-site)
3w agoTE

Design for Test Engineer

Tenstorrent

Santa Clara, California, United States (Hybrid)$100K – $500K Yearly
3w agoNV
4w agoTE

Staff, Design for Test Engineer (DFT)

Tenstorrent

Bengaluru, Karnataka, India (Hybrid)
3mo agoBA

Software Engineer - Model API's

Baseten

San Francisco, California, United States (On-site)$150K – $230K Yearly
3mo agoBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150K – $250K Yearly
2w agoNV

Senior DFT Engineer

NVIDIA

Tel Aviv District, Israel (On-site)
3mo agoCE

AI Models, Product Manager

Cerebras

Sunnyvale, California, United States or Remote (United States)
2mo agoNV

DFT ATPG Engineer

NVIDIA

Yokne'am, Northern District, Israel (On-site)
2mo agoTA

Research Engineer, Frontier Speculative Decoding

Together AI

San Francisco, California, United States (On-site)$190K – $270K Yearly
3mo agoBA
4w agoAI

ML Runtime Optimization Engineer

Applied Intuition

Sunnyvale, California, United States (On-site)$159.1K – $199.3K Yearly