Inference Accelerators Jobs
Browse 278 Inference Accelerators jobs on Inference Jobs.
278 jobs
2wD-
Analog Design Engineer, Senior Staff
d-Matrix
Santa Clara, California, United States (Hybrid)$196k – $300k Yearly
7dAN
2wNV
Senior ASIC Design Verification Engineer
NVIDIA
California, United States (Hybrid)$168k – $310.5k Yearly
6dAN
Engineering Manager, Inference
Anthropic
San Francisco, California, United States (Hybrid)$425k – $560k Yearly
2wD-
AI Security Architect, Principal
d-Matrix
Santa Clara, California, United States or Remote (United States)$220k – $300k Yearly
2wD-
Senior Staff ML Researcher - LLM Algorithmic Optimization
d-Matrix
Bengaluru, Karnataka, India (Hybrid)₹4M – ₹6M Yearly
3wCE
Inference Compiler and Frontend Engineer – Dubai
Cerebras
Dubai, Dubai, United Arab Emirates (On-site)
2wPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210k – $385k Yearly
4wCE
Python / PyTorch Developer — Frontend Inference Compiler – Dubai
Cerebras
United Arab Emirates (On-site)
6dAN
Engineering Manager, ML Acceleration
Anthropic
San Francisco, California, United States (Hybrid)$425k – $560k Yearly
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
7dTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
7dAN
2wPE
Inference Engineering Manager
Perplexity
San Francisco, California, United States (On-site)$300k – $385k Yearly
7dOP
Inference Runtime, Engineering Manager
OpenAI
San Francisco, California, United States (On-site)$455k – $555k Yearly
2wOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380k – $380k Yearly