Model Compression Jobs
Explore Model Compression roles on Inference Jobs and apply today.
4w agoNV
Deep Learning Engineer - LLM and VLM Model Compression
NVIDIA
Warszawa, Masovian Voivodeship, Poland (On-site)zł 292.5K – zł 650K Yearly
2mo agoNV
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
2mo agoNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
4w agoNE
2w agoAN
Staff Machine Learning Engineer, Agent Skills
Anthropic
San Francisco, California, United States (Hybrid)$500K – $850K Yearly
3w agoTE
Design for Test Engineer
Tenstorrent
Santa Clara, California, United States (Hybrid)$100K – $500K Yearly
3mo agoBA
Software Engineer - Model API's
Baseten
San Francisco, California, United States (On-site)$150K – $230K Yearly
3mo agoBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150K – $250K Yearly
3mo agoCE
2mo agoTA
Research Engineer, Frontier Speculative Decoding
Together AI
San Francisco, California, United States (On-site)$190K – $270K Yearly
3mo agoBA
Software Engineer, Model Performance Tooling
Baseten
Canada or Remote (Canada + 1 more)C$130K – C$200K Yearly
4w agoAI
ML Runtime Optimization Engineer
Applied Intuition
Sunnyvale, California, United States (On-site)$159.1K – $199.3K Yearly