Model Parallelization jobs
Explore Model Parallelization roles on Inference Jobs and apply today.
481-500 of 592 jobs
Researcher, Health AI
OpenAI
San Francisco, California, United States (Hybrid)
$310k – $460k Yearly
Principal GPU Memory Architect
NVIDIA
Santa Clara, California, United States (On-site)
$272k – $431.3k Yearly
Deep Learning Algorithm Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)
$124k – $241.5k Yearly
Senior System Software Engineer - Dynamo-Triton Inference Server
NVIDIA
Santa Clara, California, United States (On-site)
$152k – $241.5k Yearly
Forward Deployed Engineer (FDE) - NYC
OpenAI
New York, New York, United States (Hybrid)
$180k – $280k Yearly
Research Scientist, Condensed Matter Theory
Periodic Labs
Franklin, Louisiana, United States or Remote (United States)
Director, Engineering – Software Engineering and AI Inferencing Platforms
NVIDIA
Hanoi, Hanoi, Vietnam (On-site)
GPU Power Architect - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)
$100k – $189.8k Yearly
Member of Technical Staff - Evaluations
Reflection AI
San Francisco, California, United States (On-site)
Technical Program Manager, Science Operations - Paris
Mistral AI
Île de Ré, Charente-Maritime, France (Hybrid)
Member of Technical Staff - Reasoning Post-training
xAI
Palo Alto, California, United States (On-site)
$180k – $440k Yearly
Senior Deep Learning Research Engineer
NVIDIA
Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
Member of Technical Staff, RL Training Framework
xAI
Palo Alto, California, United States (On-site)
$180k – $440k Yearly