Inference Optimization Jobs
Browse 482 Inference Optimization jobs on Inference Jobs.
21-40 of 482 jobs
14hNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
3wXA
Member of Technical Staff - Multimodal Interactions Post-training
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
1wD-
Senior Staff ML Researcher - LLM Algorithmic Optimization
d-Matrix
Bengaluru, Karnataka, India (Hybrid)₹4M – ₹6M Yearly
2wNV
Senior Developer Relations Manager - COSMOS and Foundation Models
NVIDIA
Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
5dAN
ML Infrastructure Engineer, Safeguards
Anthropic
San Francisco, California, United States (Hybrid)$320k – $405k Yearly
2wAI
Machine Learning Engineer - Defense
Applied Intuition
Ann Arbor, Michigan, United States (On-site)$130k – $200k Yearly
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
5dAN
Staff Software Engineer, Inference
Anthropic
Dublin, County Dublin, Ireland (Hybrid)€295k – €355k Yearly
3wNV
Platform Architecture Engineer, GeForce NOW
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
3wXA
Member of Technical Staff, Grok Imagine
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wMA
1wOP
Software Engineer, Productivity
OpenAI
San Francisco, California, United States (On-site)$255k – $405k Yearly
2wRA
Member of Technical Staff - Post-Training
Reflection AI
San Francisco, California, United States (On-site)
2wNV
Senior Machine Learning Applications and Compiler Engineer
NVIDIA
Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
1wOP
Software Engineer, Model Inference
OpenAI
San Francisco, California, United States (On-site)$325k – $490k Yearly
4dAN
Engineering Manager, Inference
Anthropic
San Francisco, California, United States (Hybrid)$425k – $560k Yearly
3dCR
Principal Product Manager, General Compute (SF, Sunnyvale, New York)
Crusoe
San Francisco, California, United States (Hybrid)$260.8k – $326k Yearly
5dXA
Member of Technical Staff, RL Training Framework
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly