GPU Architecture Jobs in San Francisco, California, United States

Browse 284 GPU Architecture jobs in San Francisco, California, United States on Inference Jobs.

284 jobs

1wNE

GPU Cluster Architect

Nebius

United States (Remote)$150k – $180k Yearly
1wVA

Systems/GPU Research Engineer

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
1wVA

Systems/GPU Research Engineer

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
3wNV

Senior Formal Verification Engineer

NVIDIA

Myrtle Point, Oregon, United States or Remote (California, United States)$196k – $310.5k Yearly
1wTA

Solutions Architect

Together AI

San Francisco, California, United States (Hybrid)$180k – $260k Yearly
4wNE

Key Customers Solutions Architect

Nebius

United States + 1 more (Remote)$215k – $275k Yearly
2wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
2wNE

Field CTO - Media & Entertainment

Nebius

United States (Remote)$295k – $365k Yearly
2wOP

ASIC Firmware Engineer, Modeling

OpenAI

San Francisco, California, United States (On-site)$360k – $530k Yearly
1wOP

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)$455k – $555k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
1wAN

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)$280k – $560k Yearly
2wNE

Partner Solutions Architect

Nebius

United States + 1 more (Remote)$225k – $315k Yearly
5dDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300k – $430k Yearly
2wPO

Member of Engineering (Scalability)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
1wVA

GPU Systems Engineer – HPC / Parallel Computing

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
1wTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly
4wNE

Field CTO - Physical AI & Robotics

Nebius

United States (Remote)$265k – $365k Yearly
2wRA

Member of Technical Staff - GPU Infrastructure

Reflection AI

San Francisco, California, United States (On-site)