Inference Engineering Jobs in San Francisco, California, United States

Discover Inference Engineering roles in San Francisco, California, United States on Inference Jobs and apply today.

3mo agoCA

Inference Engineer

Cartesia

San Francisco, California, United States (On-site)$180K – $250K Yearly
3mo agoPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300K – $385K Yearly
3mo agoPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210K – $385K Yearly
3w agoTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160K – $230K Yearly
3mo agoOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380K – $380K Yearly
3w agoOP

Inference Technical Lead, On-Device Transformers

OpenAI

San Francisco, California, United States (Hybrid)$445K – $445K Yearly
3mo agoOP
3w agoTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160K – $230K Yearly
3mo agoCO

Audio Inference Engineer, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
3w agoOP

TL, Research Inference

OpenAI

San Francisco, California, United States (On-site)$380K – $555K Yearly
3mo agoPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
1w agoAN

Engineering Manager, Inference

Anthropic

San Francisco, California, United States (Hybrid)$425K – $560K Yearly
3w agoTA

Senior Backend Engineer, Inference Platform

Together AI

San Francisco, California, United States (On-site)$160K – $250K Yearly
3mo agoCO

Full-Stack Software Engineer, Inference

Cohere

Toronto, Ontario, Canada or Remote (Canada + 2 more)
2mo agoAN

Software Engineer, Inference Deployment

Anthropic

San Francisco, California, United States (Hybrid)$320K – $485K Yearly
3mo agoOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325K – $490K Yearly