1. Home
  2. Jobs
  3. Model Parallelization

Model Parallelization jobs

Explore Model Parallelization roles on Inference Jobs and apply today.

21-40 of 578 jobs

NV1w

AI Research Intern, TAO Multi-Modal Model Development - 2026

NVIDIA

Hanoi, Hanoi, Vietnam (On-site)

PE3w

Research Engineering Manager - Model Training

Perplexity

San Francisco, California, United States (On-site)

$300k – $470k Yearly

TE1w

Software Engineer, Kernel Development and Optimization

Tenstorrent

Gdańsk, Pomeranian Voivodeship, Poland (Hybrid)

NV2w

Computer Architecture Intern - 2026

NVIDIA

Shanghai, Shanghai, China (On-site)

NV2w

Software Engineering Intern, Test Development - 2026

NVIDIA

Shanghai, Shanghai, China (On-site)

D-2w

Senior Staff Machine Learning Engineer -Frameworks

d-Matrix

Santa Clara, California, United States (Hybrid)

$155k – $250k Yearly

TM1w

Research Engineer, Infrastructure, Numerics

Thinking Machines Lab

San Francisco, California, United States (On-site)

$350k – $475k Yearly

AN1w

Performance Engineer

Anthropic

San Francisco, California, United States (Hybrid)

$315k – $560k Yearly

OP2w

Research-Hardware Codesign Engineer

OpenAI

San Francisco, California, United States (Hybrid)

$230k – $460k Yearly

D-2w

ML Compiler Architect, Senior Principal

d-Matrix

Toronto, Ontario, Canada (Hybrid)

NV5d

Senior NIC System Modeling and Simulation Engineer

NVIDIA

Yokneam Ilit, Northern District, Israel (On-site)

NV1w

GPU Architecture Engineer - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-site)

$124k – $241.5k Yearly

CA2w

Researcher: Model Architecture, UK

Cartesia

London, England, United Kingdom (On-site)

SE2w

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)

$175k – $280k Yearly

VA1w

GPU Systems Engineer – HPC / Parallel Computing

Vast.ai

San Francisco, California, United States (On-site)

$160k – $320k Yearly

NV1w

Senior Developer Technology Engineer, CPU Performance

NVIDIA

Santa Clara, California, United States (Hybrid)

$152k – $287.5k Yearly

NE1w
TA1w

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)

$160k – $230k Yearly

XA1w

[Omni] Member of Technical Staff, World Model

xAI

Bay Area, California, United States (On-site)

$180k – $440k Yearly

PE2w

Model Behavior Architect

Perplexity

San Francisco, California, United States (On-site)

$180k – $260k Yearly