Compute Efficiency Jobs
Browse 954 Compute Efficiency jobs on Inference Jobs.
954 jobs
3wAN
[P] Compute Efficiency Engineer
Anthropic
San Francisco, California, United States (Hybrid)$1 – $2 Yearly
2wNV
Senior AI Performance and Efficiency Engineer
NVIDIA
Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
2wOP
Software Engineer, Platform Systems
OpenAI
San Francisco, California, United States (On-site)$310k – $460k Yearly
5dXA
Member of Technical Staff - Reasoning Efficiency
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
5dAN
Data Science Engineer, Capacity and Efficiency
Anthropic
New York, New York, United States (Hybrid)$275k – $370k Yearly
1wCO
Member of Technical Staff, Model Efficiency
Cohere
New York, New York, United States or Remote (New York, United States + 3 more)
3wNV
Senior Applied Deep Learning Research Scientist, Efficiency
NVIDIA
Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
3wCO
Finance Manager - Optimization and Efficiency
CoreWeave
Livingston, New Jersey, United States (Hybrid)$115k – $168k Yearly
9hNV
Principal GPU Memory Architect
NVIDIA
Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
1wOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380k – $380k Yearly
1wOP
Content Integrity Analyst
OpenAI
San Francisco, California, United States (Hybrid)$280k – $280k Yearly
1wOP
Strategic Finance, Hardware R&D Finance Manager
OpenAI
San Francisco, California, United States (Hybrid)$265k – $265k Yearly
1wAN
Research Compute Operations
Anthropic
San Francisco, California, United States (Hybrid)$270k – $290k Yearly
2wNV
GPU Power Architect - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$100k – $189.8k Yearly