Model Parallelization Jobs
Browse 522 Model Parallelization jobs on Inference Jobs.
41-60 of 522 jobs
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
1wTA
Research Engineer, Frontier Speculative Decoding
Together AI
San Francisco, California, United States (On-site)$190k – $270k Yearly
3wNV
Senior Scientific Machine Learning Engineer – Earth-2
NVIDIA
Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
7dTM
Research Engineer, Infrastructure, Kernels
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
4wCE
Python / PyTorch Developer — Frontend Inference Compiler – Dubai
Cerebras
United Arab Emirates (On-site)
7dRU
Applied Research Lead, Model Scaling
Runway
New York, New York, United States or Remote (North America + 1 more)$280k – $380k Yearly
4wNV
Senior Applied Researcher, Foundational AI Models for Biology
NVIDIA
Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
2wXA
Member of Technical Staff, World Model
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wBA
Software Engineer, Model Performance Tooling
Baseten
Canada or Remote (Canada + 1 more)C$130k – C$200k Yearly
4dDE
Staff Software Engineer, ML Infrastructure
Decagon
San Francisco, California, United States (On-site)$300k – $430k Yearly
2wPE
3wTE
C++ Machine Learning Engineer, Models Training
Tenstorrent
Austin, Texas, United States (Hybrid)$100k – $500k Yearly
7dXA
Member of Technical Staff - Enterprise Model Evaluation
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wCO
Member of Technical Staff, Model Efficiency
Cohere
New York, New York, United States or Remote (New York, United States + 3 more)
7dAN
Research Engineer, Production Model Post Training
Anthropic
San Francisco, California, United States (Hybrid)$315k – $340k Yearly