- Home
- Jobs
- United States
- California
- San Francisco
- GPU Computing
GPU Computing Jobs in San Francisco, California, United States
Discover GPU Computing roles in San Francisco, California, United States on Inference Jobs and apply today.
3mo agoCO
Audio Inference Engineer, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
3mo agoOP
Training Performance Engineer
OpenAI
San Francisco, California, United States (Hybrid)$250K – $460K Yearly
2mo agoTA
Research Engineer, Frontier Speculative Decoding
Together AI
San Francisco, California, United States (On-site)$190K – $270K Yearly
3mo agoBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150K – $250K Yearly
1w agoCR
Senior Infrastructure Engineer, Lab
Crusoe
San Francisco, California, United States (On-site)$172K – $209K Yearly
3mo agoOP
Software Engineer, Model Inference
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
3mo agoD-
Software Engineer, Staff - SIMD Kernels
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (United States)$190K – $300K Yearly
3mo agoOP
Software Engineer, Hardware
OpenAI
San Francisco, California, United States (Hybrid)$310K – $460K Yearly
1mo agoCR
Senior Engineering Manager, Compute
Crusoe
San Francisco, California, United States (On-site)$237K – $288K Yearly
3mo agoOP
Software Engineer, Accelerators
OpenAI
San Francisco, California, United States (On-site)$310K – $380K Yearly
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly
2d agoTE
AI Performance Simulation Architect
Tenstorrent
California, United States + 5 more (Remote)$100K – $500K Yearly
3mo agoOP
Research-Hardware Codesign Engineer
OpenAI
San Francisco, California, United States (Hybrid)$230K – $460K Yearly
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
4w agoTA
Senior Backend Engineer, Inference Platform
Together AI
San Francisco, California, United States (On-site)$160K – $250K Yearly
4w agoTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
4w agoTA
Machine Learning Engineer - Inference
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly