1. Home
  2. Jobs
  3. Distributed Serving

Distributed Serving Jobs

Browse 599 Distributed Serving jobs on Inference Jobs.

81-100 of 599 jobs

5dAN

Senior/Staff Software Engineer, Inference

Anthropic

New York, New York, United States (Hybrid)$300k – $485k Yearly
2wRA

Member of Technical Staff - GPU Infrastructure

Reflection AI

San Francisco, California, United States (On-site)
5dNE

Technical Product Manager (Cluster Experience)

Nebius

Amsterdam, North Holland, Netherlands or Remote (Europe)
5dTM

Research, Audio Expertise

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wMA

Research Engineer, Machine Learning

Mistral AI

Palo Alto, California, United States (Hybrid)
5dTM

Research Engineer, Infrastructure, Tinker

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
5dAN

Research Engineer, Pretraining Scaling (London)

Anthropic

London, England, United Kingdom (On-site)£250k – £435k Yearly
3dTA

Machine Learning, Platform Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $250k Yearly
2wPW

Member of Technical Staff, Back-end & API

Parallel Web Systems

San Francisco, California, United States (On-site)
5dCO

Solutions Architect - HPC/AI/ML

CoreWeave

London, England, United Kingdom (Hybrid)£116k – £155k Yearly
5dTM

Software Engineer, Data Infrastructure

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wPW

Member of Technical Staff, Infrastructure & Scaling

Parallel Web Systems

San Francisco, California, United States (On-site)
1wXA

Member of Technical Staff, World Model

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2dCO

Software Engineer II - Artifact Management

CoreWeave

Livingston, New Jersey, United States (Hybrid)$109k – $160k Yearly
5dAN

Senior Software Engineer, Inference

Anthropic

Dublin, Dublin, Ireland (Hybrid)€235k – €295k Yearly
2wD-

Senior Staff Machine Learning Engineer -Frameworks

d-Matrix

Santa Clara, California, United States (Hybrid)$155k – $250k Yearly
1wOP

Software Engineer, Monetization Delivery

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
2wPE

UK Internship Program

Perplexity

London, England, United Kingdom (Hybrid)