1. Home
  2. Jobs
  3. Inference Pipelines

Inference Pipelines jobs

Explore Inference Pipelines roles on Inference Jobs and apply today.

41-60 of 624 jobs

CO1w

Director of Engineering, Inference Services

CoreWeave

Sunnyvale, California, United States (Hybrid)

$206k – $303k Yearly

HA2w

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)

CE3w

Engineering Manager, Inference Platform

Cerebras

Sunnyvale, California, United States (On-site)

OP2w

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)

$380k – $380k Yearly

AN1w

Staff Software Engineer, Inference

Anthropic

Dublin, County Dublin, Ireland (Hybrid)

€295k – €355k Yearly

AN1w

Senior/Staff Software Engineer, Inference

Anthropic

New York, New York, United States (Hybrid)

$300k – $485k Yearly

CR2w

Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)

Crusoe

San Francisco, California, United States or Remote (California, United States + 1 more)

$204k – $247k Yearly

D-4w

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)

$30 – $59 Hourly

NV1w

Senior System Software Engineer - Dynamo-Triton Inference Server

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $241.5k Yearly

TA1w

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)

$160k – $230k Yearly

PO2w

Member of Engineering (Inference)

Poolside

United Kingdom or Remote (Europe + 1 more)

NE1w

Senior Manager, Pipeline and BDR Operations

Nebius

Europe + 1 more (Remote)

$180k – $275k Yearly

PO2w

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)

NE1w

Senior Technical Product Manager Token Factory - Inference

Nebius

United States (Remote)

$204k – $255k Yearly

CE3w

Principal Engineer, AI Inference Reliability

Cerebras

United States + 1 more (Remote)

OP2w

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)

$325k – $490k Yearly

BA2w

Engineering Manager - Forward Deployed Engineering (LLM)

Baseten

San Francisco, California, United States (On-site)

$220k – $285k Yearly

CO3w

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)

$92k – $135k Yearly

TM1w

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)

$350k – $475k Yearly

CE1w

Senior Research Engineer - Inference ML

Cerebras

Sunnyvale, California, United States (Hybrid)