AI Inference Accelerator Jobs
Browse 638 AI Inference Accelerator jobs on Inference Jobs.
638 jobs
1wD-
Analog Design Engineer, Senior Staff
d-Matrix
Santa Clara, California, United States (Hybrid)$196k – $300k Yearly
5dAN
4dAN
Engineering Manager, Inference
Anthropic
San Francisco, California, United States (Hybrid)$425k – $560k Yearly
1wD-
AI Security Architect, Principal
d-Matrix
Santa Clara, California, United States or Remote (United States)$220k – $300k Yearly
2wPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210k – $385k Yearly
4dAN
Engineering Manager, ML Acceleration
Anthropic
San Francisco, California, United States (Hybrid)$425k – $560k Yearly
1wD-
Senior Staff ML Researcher - LLM Algorithmic Optimization
d-Matrix
Bengaluru, Karnataka, India (Hybrid)₹4M – ₹6M Yearly
3dNV
Principal Software Engineer - AI Inference
NVIDIA
Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
3dNV
Senior AI Inference Compiler Engineer
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wNV
Senior Software Research Architect, AI Networking
NVIDIA
Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
2wCE
Inference Compiler and Frontend Engineer – Dubai
Cerebras
Dubai, Dubai, United Arab Emirates (On-site)
3wCO
Software Engineer, Inference AI/ML
CoreWeave
Sunnyvale, California, United States (Hybrid)$92k – $135k Yearly
1wOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380k – $380k Yearly
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
5dXA
1wNV
Senior ML Framework Performance Engineer - AI for Science at Scale
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly