AI Inference Jobs in California, United States

Browse 255 AI Inference jobs in California, United States on Inference Jobs.

21-40 of 255 jobs

2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
3wNV

Senior Deep Learning Performance Architect

NVIDIA

California, United States (Hybrid)$152k – $287.5k Yearly
2wLA

Fullstack Engineer, Applied AI

LangChain

San Francisco, California, United States (On-site)$170k – $195k Yearly
2wD-

AI Security Architect, Principal

d-Matrix

Santa Clara, California, United States or Remote (United States)$220k – $300k Yearly
3wCE
2wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
2wPO

Member of Engineering (Inference)

Poolside

United Kingdom or Remote (Europe + 1 more)
3wCE

Sr. Engineer, Inference Ecosystem Engineering

Cerebras

Sunnyvale, California, United States (On-site)
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
2wOP

Research Engineer, Privacy

OpenAI

San Francisco, California, United States (On-site)$380k – $460k Yearly
3wCR

Senior Software Engineer, Managed AI - AI Platform

Crusoe

San Francisco, California, United States (On-site)$172.4k – $209k Yearly
2wNV

Senior AI Software Engineer, GenAI Framework

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
3wCR

Engineering Manager, Managed AI

Crusoe

San Francisco, California, United States (On-site)$237.6k – $288k Yearly
2wNV

Senior Software Engineer, Blueprints - NIM Integrations

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
2wPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
2wOP

Security Researcher, Trusted Computing and Cryptography

OpenAI

United States or Remote (United States)$324k – $490k Yearly
5dCR

Site Reliability Engineer, Managed AI

Crusoe

San Francisco, California, United States (On-site)$204k – $247k Yearly
6dNE

Chief ML Researcher, Product

Nebius

United States (Remote)$200k – $300k Yearly
2wOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
4wXA

Member of Technical Staff, Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly