1. Home
  2. Jobs
  3. Inference Frameworks

Inference Frameworks Jobs

Browse 361 Inference Frameworks jobs on Inference Jobs.

41-60 of 361 jobs

3wAN

Software Engineer, Inference Deployment

Anthropic

San Francisco, California, United States (Hybrid)$320k – $485k Yearly
5dTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
2wNV

Senior Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$200k – $391k Yearly
1wOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
5dAN

Senior Software Engineer, Inference

Anthropic

Dublin, Dublin, Ireland (Hybrid)€235k – €295k Yearly
5dCO

Director of Engineering, Inference Services

CoreWeave

Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
1wPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
5dAN

Senior/Staff Software Engineer, Inference

Anthropic

New York, New York, United States (Hybrid)$300k – $485k Yearly
1wPO

Member of Engineering (Inference)

Poolside

United Kingdom or Remote (Europe + 1 more)
5dAN

Staff Software Engineer, Inference

Anthropic

Dublin, County Dublin, Ireland (Hybrid)€295k – €355k Yearly
3wXA

Member of Technical Staff, Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dTM

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
5dNV

Senior System Software Engineer - Dynamo-Triton Inference Server

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
5dAN

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)$280k – $560k Yearly
2wNV

Senior Software Engineer, Deep Learning Inference - TensorRT

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wCE

Deployment Engineer, AI Inference

Cerebras

Sunnyvale, California, United States (On-site)