1. Home
  2. Jobs
  3. AI Inference Frameworks

AI Inference Frameworks Jobs

Browse 659 AI Inference Frameworks jobs on Inference Jobs.

659 jobs

6dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
1wNV

Senior ML Framework Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
3wCE

Inference Frontend

Cerebras

Sunnyvale, California, United States (On-site)
2wNV

Director, Software Architecture

NVIDIA

Yokne'am, Northern District, Israel (On-site)
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)
5dAN

Engineering Manager, Inference

Anthropic

San Francisco, California, United States (Hybrid)$425k – $560k Yearly
6dCO

Solutions Architect - Security

CoreWeave

Livingston, New Jersey, United States (Hybrid)$165k – $220k Yearly
2wNV

Senior AI Software Engineer, GenAI Framework

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
3wCE

Sr. Engineer, Inference Ecosystem Engineering

Cerebras

Sunnyvale, California, United States (On-site)
4wNV

Agentic AI Solution Engineering Intern - Summer 2026

NVIDIA

Austin, Texas, United States (On-site)$20 – $71 Hourly
2wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
3wCR

Senior Software Engineer, Managed AI - AI Platform

Crusoe

San Francisco, California, United States (On-site)$172.4k – $209k Yearly
3wCR

Engineering Manager, Managed AI

Crusoe

San Francisco, California, United States (On-site)$237.6k – $288k Yearly
3wXA

Member of Technical Staff, Model Evaluation

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
4dNV

Senior Compiler Engineer - AI

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
4dCE

Performance Reliability Engineer

Cerebras

Sunnyvale, California, United States (On-site)
5dAN

Engineering Manager, ML Acceleration

Anthropic

San Francisco, California, United States (Hybrid)$425k – $560k Yearly
3wCO

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)$92k – $135k Yearly
6dCO

Solutions Architect - Kubernetes

CoreWeave

Livingston, New Jersey, United States (Hybrid)$165k – $220k Yearly