Model Quantization Jobs
Browse 88 Model Quantization jobs on Inference Jobs.
88 jobs
3dNV
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2hNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
5dAI
ML Runtime Optimization Engineer
Applied Intuition
Mountain View, California, United States (On-site)$159.1k – $199.3k Yearly
2wNV
Senior Developer Relations Manager - COSMOS and Foundation Models
NVIDIA
Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
5dTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
2wAI
ML Runtime Optimization Engineer - Lead
Applied Intuition
Sunnyvale, California, United States (On-site)$199.3k – $264.5k Yearly
1wBA
Software Engineer - Model API's
Baseten
San Francisco, California, United States (On-site)$150k – $230k Yearly
1wBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150k – $250k Yearly
2wTE
1wOP
Research-Hardware Codesign Engineer
OpenAI
San Francisco, California, United States (Hybrid)$230k – $460k Yearly
2wD-
Senior Staff Machine Learning Engineer -Frameworks
d-Matrix
Santa Clara, California, United States (Hybrid)$155k – $250k Yearly
5dAN
Performance Engineer, GPU
Anthropic
San Francisco, California, United States (Hybrid)$315k – $560k Yearly
2wD-
Machine Learning Research Intern
d-Matrix
Santa Clara, California, United States (Hybrid)$30 – $59 Hourly
2dDE
Staff Software Engineer, ML Infrastructure
Decagon
San Francisco, California, United States (On-site)$300k – $430k Yearly
2wNV
Senior Deep Learning Engineer
NVIDIA
Warszawa, Masovian Voivodeship, Poland (Hybrid)zł 292.5k – zł 507k Yearly
5dMA