1. Home
  2. Jobs
  3. Model Pruning

Model Pruning Jobs

Browse 925 Model Pruning jobs on Inference Jobs.

925 jobs

5dAI

ML Runtime Optimization Engineer

Applied Intuition

Mountain View, California, United States (On-site)$159.1k – $199.3k Yearly
2wAI

ML Runtime Optimization Engineer - Lead

Applied Intuition

Sunnyvale, California, United States (On-site)$199.3k – $264.5k Yearly
5dCE

Senior Research Engineer - Inference ML

Cerebras

Sunnyvale, California, United States (Hybrid)
2wTA

Research Intern, Model Shaping (Summer 2026)

Together AI

San Francisco, California, United States (On-site)
2wNV

Senior 3D Modeler and Asset Prep Artist

NVIDIA

Santa Clara, California, United States (On-site)$156k – $287.5k Yearly
3wXA

Member of Technical Staff, Model Evaluation

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
1wXA

Member of Technical Staff, World Model

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dXA

[Omni] Member of Technical Staff, World Model

xAI

Bay Area, California, United States (On-site)$180k – $440k Yearly
3wNV

Senior Applied Deep Learning Research Scientist, Efficiency

NVIDIA

Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
1wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
2hNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
4wOP

Model Policy Manager

OpenAI

San Francisco, California, United States (Hybrid)$255k – $325k Yearly
1wBA

Software Engineer - Model API's

Baseten

San Francisco, California, United States (On-site)$150k – $230k Yearly
1wCO
2wOP

Model Policy Manager, Chemical & Biological Risk

OpenAI

San Francisco, California, United States (Hybrid)$207k – $295k Yearly
2wPE

Model Behavior Architect

Perplexity

San Francisco, California, United States (On-site)$180k – $260k Yearly