Low-Precision Inference Jobs
Browse 291 Low-Precision Inference jobs on Inference Jobs.
21-40 of 291 jobs
1wOP
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)$325k – $490k Yearly
2wOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380k – $380k Yearly
3dNV
Senior AI Inference Compiler Engineer
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
6dVA
GPU Systems Engineer – HPC / Parallel Computing
Vast.ai
San Francisco, California, United States (On-site)$160k – $320k Yearly
1wD-
Analog Design Engineer, Senior Staff
d-Matrix
Santa Clara, California, United States (Hybrid)$196k – $300k Yearly
2wD-
Software Engineering Intern, Developer and Qualification Tools
d-Matrix
Santa Clara, California, United States (Hybrid)$30 – $59 Hourly
6dNV
Senior GPU Low Power Architect
NVIDIA
Santa Clara, California, United States (On-site)$136k – $264.5k Yearly
6dTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
2wCR
Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)
Crusoe
San Francisco, California, United States or Remote (California, United States + 1 more)$204k – $247k Yearly
5dAN
Engineering Manager, Inference
Anthropic
San Francisco, California, United States (Hybrid)$425k – $560k Yearly
1wPO
Member of Engineering (Pre-training and inference software)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
3dNV
Senior Compiler Engineer, AI Inference Platforms
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
4dMO