Model Parallelism Jobs
Explore Model Parallelism roles on Inference Jobs and apply today.
2mo agoTA
Research Engineer, Frontier Speculative Decoding
Together AI
San Francisco, California, United States (On-site)$190K – $270K Yearly
2mo agoCE
Python / PyTorch Developer — Frontend Inference Compiler – Dubai
Cerebras
United Arab Emirates (On-site)
1mo agoD-
Principal Architect, Performance Analysis and Modeling
d-Matrix
Santa Clara, California, United States (Hybrid)$190K – $280K Yearly
2mo agoTE
Software Engineer, TT-Distributed
Tenstorrent
Santa Clara, California, United States (Hybrid)$100K – $500K Yearly
3mo agoD-
Software Engineering Intern, Simulation and Modeling
d-Matrix
Santa Clara, California, United States (Hybrid)$30 – $59 Hourly
3mo agoOP
Distributed Training Engineer, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380K – $555K Yearly
4w agoNV
2mo agoNV
Developer Technology Intern, High-Performance Databases - Summer 2026
NVIDIA
Santa Clara, California, United States (On-site)$20 – $71 Hourly
2mo agoTE
Software Engineer, Kernel Development and Optimization
Tenstorrent
Gdańsk, Pomeranian Voivodeship, Poland (Hybrid)
3mo agoNV
Manager, Compiler Engineering – Backend GPU
NVIDIA
Austin, Texas, United States (On-site)$224K – $356.5K Yearly