1. Home
  2. Jobs
  3. Inference Workloads

Inference Workloads Jobs

Browse 237 Inference Workloads jobs on Inference Jobs.

21-40 of 237 jobs

3wCE

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)
6dTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)
2wCR

Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)

Crusoe

San Francisco, California, United States or Remote (California, United States + 1 more)$204k – $247k Yearly
3wCE

Deployment Engineer, AI Inference

Cerebras

Sunnyvale, California, United States (On-site)
6dCO

Director of Engineering, Inference Services

CoreWeave

Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
2wCO

Full-Stack Software Engineer, Inference

Cohere

Toronto, Ontario, Canada or Remote (Canada + 2 more)
3wCE
3wXA

Member of Technical Staff, Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wOP

Software Engineer, Productivity

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
6dNV

Senior System Software Engineer - Dynamo-Triton Inference Server

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
2wPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
2wNV

Senior Software Engineer, Blueprints - NIM Integrations

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
3wCO

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)$92k – $135k Yearly
2wBA

Engineering Manager - Forward Deployed Engineering (LLM)

Baseten

San Francisco, California, United States (On-site)$220k – $285k Yearly