Inference Accelerators Jobs
Browse 276 Inference Accelerators jobs on Inference Jobs.
41-60 of 276 jobs
2wNV
Senior Software Engineer - Inference as a Service
NVIDIA
Santa Clara, California, United States (On-site)$200k – $391k Yearly
2wCO
2wPO
Member of Engineering (Pre-training and inference software)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
2wOP
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)$325k – $490k Yearly
3wCE
1wNE
Senior Site Reliability Engineer — Token Factory (Inference Platform)
Nebius
Netherlands + 4 more (Remote)
2wCR
Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)
Crusoe
San Francisco, California, United States or Remote (California, United States + 1 more)$204k – $247k Yearly
5dNV
Senior Compiler Engineer, AI Inference Platforms
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wNV
Senior Software Engineer, Deep Learning Inference - TensorRT
NVIDIA
Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
4wXA
Member of Technical Staff, Inference
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
3wAN
Software Engineer, Inference Deployment
Anthropic
San Francisco, California, United States (Hybrid)$320k – $485k Yearly
5dNV
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
1wAN
Staff Software Engineer, Inference
Anthropic
Dublin, County Dublin, Ireland (Hybrid)€295k – €355k Yearly
1wNV
Senior ML Framework Performance Engineer - AI for Science at Scale
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
1wNV
Senior System Software Engineer - Dynamo-Triton Inference Server
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wMA
2wBA
Engineering Manager - Forward Deployed Engineering (LLM)
Baseten
San Francisco, California, United States (On-site)$220k – $285k Yearly