Model Parallelization jobs
Explore Model Parallelization roles on Inference Jobs and apply today.
21-40 of 578 jobs
AI Research Intern, TAO Multi-Modal Model Development - 2026
NVIDIA
Hanoi, Hanoi, Vietnam (On-site)
Research Engineering Manager - Model Training
Perplexity
San Francisco, California, United States (On-site)
$300k – $470k Yearly
Software Engineer, Kernel Development and Optimization
Tenstorrent
Gdańsk, Pomeranian Voivodeship, Poland (Hybrid)
Senior Staff Machine Learning Engineer -Frameworks
d-Matrix
Santa Clara, California, United States (Hybrid)
$155k – $250k Yearly
Research Engineer, Infrastructure, Numerics
Thinking Machines Lab
San Francisco, California, United States (On-site)
$350k – $475k Yearly
Performance Engineer
Anthropic
San Francisco, California, United States (Hybrid)
$315k – $560k Yearly
Research-Hardware Codesign Engineer
OpenAI
San Francisco, California, United States (Hybrid)
$230k – $460k Yearly
Senior NIC System Modeling and Simulation Engineer
NVIDIA
Yokneam Ilit, Northern District, Israel (On-site)
GPU Architecture Engineer - New College Grad 2025
NVIDIA
Santa Clara, California, United States (On-site)
$124k – $241.5k Yearly
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)
$175k – $280k Yearly
GPU Systems Engineer – HPC / Parallel Computing
Vast.ai
San Francisco, California, United States (On-site)
$160k – $320k Yearly
Senior Developer Technology Engineer, CPU Performance
NVIDIA
Santa Clara, California, United States (Hybrid)
$152k – $287.5k Yearly
ML Engineer, Large Language Models (LLM Training & Inference Optimization)
Nebius
Europe (Remote)
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)
$160k – $230k Yearly
[Omni] Member of Technical Staff, World Model
xAI
Bay Area, California, United States (On-site)
$180k – $440k Yearly
Model Behavior Architect
Perplexity
San Francisco, California, United States (On-site)
$180k – $260k Yearly