Inference Capacity Jobs
Browse 285 Inference Capacity jobs on Inference Jobs.
41-60 of 285 jobs
2wRA
Member of Technical Staff - GPU Infrastructure
Reflection AI
San Francisco, California, United States (On-site)
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
5dCO
Director of Engineering, Inference Services
CoreWeave
Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
1wPO
Member of Engineering (Pre-training and inference software)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
2wD-
Senior Staff Machine Learning Engineer -Frameworks
d-Matrix
Santa Clara, California, United States (Hybrid)$155k – $250k Yearly
3dNV
Senior Compiler Engineer, AI Inference Performance
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
1wBA
Engineering Manager - Forward Deployed Engineering (LLM)
Baseten
San Francisco, California, United States (On-site)$220k – $285k Yearly
1wNV
Senior ML Framework Performance Engineer - AI for Science at Scale
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
3dNV
Senior Compiler Engineer, AI Inference Platforms
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wNV
Senior Deep Learning Performance Architect
NVIDIA
California, United States (Hybrid)$152k – $287.5k Yearly
5dXA
Member of Technical Staff, RL Training Framework
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
3wCO
Software Engineer, Inference AI/ML
CoreWeave
Sunnyvale, California, United States (Hybrid)$92k – $135k Yearly
2wNV
Senior Software Engineer, Deep Learning Inference - TensorRT
NVIDIA
Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
3wXA
Member of Technical Staff, Model Evaluation
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
11hNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wNV
Senior Systems Engineer – High-Performance AI and Networking Applications
NVIDIA
Santa Clara, California, United States (On-site)$184k – $356.5k Yearly