1. Home
  2. Jobs
  3. Inference Frameworks

Inference Frameworks Jobs

Browse 361 Inference Frameworks jobs on Inference Jobs.

361 jobs

5dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
3wCE

Inference Frontend

Cerebras

Sunnyvale, California, United States (On-site)
2wCE

Sr. Engineer, Inference Ecosystem Engineering

Cerebras

Sunnyvale, California, United States (On-site)
1wOP

Software Engineer, Productivity

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
2wD-

Senior Staff Machine Learning Engineer -Frameworks

d-Matrix

Santa Clara, California, United States (Hybrid)$155k – $250k Yearly
1wXA

Member of Technical Staff, Grokipedia - Synthetic Data & Epistemics

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dXA

Member of Technical Staff, RL Training Framework

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
1wNV

Senior ML Framework Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
3wXA

Member of Technical Staff, Model Evaluation

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dCO

Solutions Architect - Security

CoreWeave

Livingston, New Jersey, United States (Hybrid)$165k – $220k Yearly
4dAN

Engineering Manager, Inference

Anthropic

San Francisco, California, United States (Hybrid)$425k – $560k Yearly
2wNV

Director, Software Architecture

NVIDIA

Yokne'am, Northern District, Israel (On-site)
2wNV

Senior AI Software Engineer, GenAI Framework

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
2wCO

Solutions Architect - Kubernetes

CoreWeave

London, England, United Kingdom (Hybrid)£98k – £130k Yearly
2wCR

Engineering Manager, Managed AI

Crusoe

San Francisco, California, United States (On-site)$237.6k – $288k Yearly
2wCR

Senior Software Engineer, Managed AI - AI Platform

Crusoe

San Francisco, California, United States (On-site)$172.4k – $209k Yearly
4wNV

Agentic AI Solution Engineering Intern - Summer 2026

NVIDIA

Austin, Texas, United States (On-site)$20 – $71 Hourly
1wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)