AI Inference jobs in Santa Clara, California, United States

Discover AI Inference roles in Santa Clara, California, United States on Inference Jobs and apply today.

21-40 of 142 jobs

NV4w

Technical Marketing Engineer, World Models - AV Physical AI

NVIDIA

Santa Clara, California, United States (On-site)

$148k – $287.5k Yearly

PO2w

Member of Engineering (Inference)

Poolside

United Kingdom or Remote (Europe + 1 more)

D-4w

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)

$30 – $59 Hourly

NV2w

Senior AI Software Engineer, GenAI Framework

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $287.5k Yearly

NV1w

Senior System Software Engineer - Dynamo-Triton Inference Server

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $241.5k Yearly

NV2d

Lead Principal Engineer, Enterprise Agentic AI Platform

NVIDIA

Santa Clara, California, United States (On-site)

$272k – $431.3k Yearly

NV2w

Senior Software Engineer, Blueprints - NIM Integrations

NVIDIA

Santa Clara, California, United States (On-site)

$184k – $356.5k Yearly

PO2w

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)

OP2w

Security Researcher, Trusted Computing and Cryptography

OpenAI

United States or Remote (United States)

$324k – $490k Yearly

NE5d

Chief ML Researcher, Product

Nebius

United States (Remote)

$200k – $300k Yearly

NV6d

Senior Manager, Engineering - Enterprise AI and Automation

NVIDIA

Santa Clara, California, United States (On-site)

$272k – $431.3k Yearly

NV2w

Senior Software Engineer, Deep Learning Inference - TensorRT

NVIDIA

Santa Clara, California, United States (Hybrid)

$152k – $287.5k Yearly

OP2w

Software Engineer, Gov

OpenAI

Washington, District of Columbia, United States or Remote (District of Columbia, United States + 2 more)

$255k – $405k Yearly

NE2w

Field CTO - Media & Entertainment

Nebius

United States (Remote)

$295k – $365k Yearly

TE10h

Software Engineer, TT-Distributed

Tenstorrent

Santa Clara, California, United States (Hybrid)

$100k – $500k Yearly

NV2d

Senior Scientist, Synthetic Data and Privacy

NVIDIA

Santa Clara, California, United States (On-site)

$192k – $356.5k Yearly

NV4w

Senior Software Test Development Engineer - Deep Learning

NVIDIA

Santa Clara, California, United States (On-site)

$140k – $270.3k Yearly

CO2w

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)

NV2w

Senior Developer Relations Manager - COSMOS and Foundation Models

NVIDIA

Santa Clara, California, United States (On-site)

$184k – $356.5k Yearly

NE1w

GPU Cluster Architect

Nebius

United States (Remote)

$150k – $180k Yearly