Model Parallelization Jobs
Browse 971 Model Parallelization jobs on Inference Jobs.
971 jobs
5dNV
Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents
NVIDIA
Santa Clara, California, United States (On-site)$224k – $356.5k Yearly
6dAN
Performance Engineer, GPU
Anthropic
San Francisco, California, United States (Hybrid)$315k – $560k Yearly
1wNV
Senior ML Framework Performance Engineer - AI for Science at Scale
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
5dAN
Machine Learning Systems Engineer, Research Tools
Anthropic
San Francisco, California, United States (Hybrid)$320k – $405k Yearly
3wNV
Senior Research Scientist, Multi-Modal Language Models
NVIDIA
Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
2wNV
Senior AI Software Engineer, GenAI Framework
NVIDIA
Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
6dAI
ML Runtime Optimization Engineer
Applied Intuition
Mountain View, California, United States (On-site)$159.1k – $199.3k Yearly
2wAI
ML Runtime Optimization Engineer - Lead
Applied Intuition
Sunnyvale, California, United States (On-site)$199.3k – $264.5k Yearly
2wBA
Software Engineer - Model API's
Baseten
San Francisco, California, United States (On-site)$150k – $230k Yearly
2wNV
Senior Solutions Architect, Global Partner Team
NVIDIA
Singapore, Central Singapore Community Development Council, Singapore (On-site)
2wTA
Research Intern, Model Shaping (Summer 2026)
Together AI
San Francisco, California, United States (On-site)
2wNV
Senior Developer Relations Manager - COSMOS and Foundation Models
NVIDIA
Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
2wBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150k – $250k Yearly