Model Parallelism Jobs
Explore Model Parallelism roles on Inference Jobs and apply today.
3mo agoCA
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
3mo agoBA
Software Engineer, Model Performance Tooling
Baseten
Canada or Remote (Canada + 1 more)C$130K – C$200K Yearly
3mo agoOP
Software Engineer, Model Inference
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
3w agoTA
Systems Research Engineer, GPU Programming
Together AI
San Francisco, California, United States (Hybrid)$160K – $230K Yearly
3mo agoTE
C++ Machine Learning Engineer, Models Training
Tenstorrent
Warszawa, Masovian Voivodeship, Poland (Hybrid)
3mo agoBA
Senior Software Engineer - Model Training
Baseten
San Francisco, California, United States (On-site)$200K – $275K Yearly
3mo agoBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150K – $250K Yearly
2mo agoNV
Senior AI Developer Technology Engineer, Financial Sector
NVIDIA
Santa Clara, California, United States (Hybrid)$152K – $241.5K Yearly
2mo agoGR
2026 Graduate Software Engineer - Neuro Engine Modelling
Graphcore
Bristol, England, United Kingdom (On-site)
2mo agoNV
Senior GPU Functional Modeling Architect
NVIDIA
Santa Clara, California, United States (On-site)$152K – $287.5K Yearly
3mo agoCO
Member of Technical Staff, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)