- Home
- Jobs
- United States
- California
- TensorRT-LLM
TensorRT-LLM Jobs in California, United States
Discover TensorRT-LLM roles in California, United States on Inference Jobs and apply today.
2mo agoDE
Staff Software Engineer, ML Infrastructure
Decagon
San Francisco, California, United States (On-site)$300K – $430K Yearly
4w agoLA
Applied Research Engineer, Agents
Labelbox
San Francisco, California, United States (Hybrid)$250K – $300K Yearly
3mo agoPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210K – $385K Yearly
3w agoTM
Research Engineer, Infrastructure, Numerics
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
3mo agoCO
Member of Technical Staff, MLE
Cohere
San Francisco, California, United States or Remote (California, United States + 3 more)
7d agoAI
Embedded AI Engineer – Android Automotive (On-Device Intelligence)
Applied Intuition
Sunnyvale, California, United States (On-site)$150K – $250K Yearly
3mo agoD-
Principal AI/ML System Software Engineer
d-Matrix
Santa Clara, California, United States (Hybrid)$180K – $280K Yearly
4w agoAI
ML Runtime Optimization Engineer
Applied Intuition
Sunnyvale, California, United States (On-site)$159.1K – $199.3K Yearly
2mo agoTA
Research Engineer, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
3mo agoD-
AI / ML System Software Engineer, Senior Staff
d-Matrix
Santa Clara, California, United States (Hybrid)$180K – $280K Yearly
3w agoTM
Research Engineer, Infrastructure, Kernels
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
2mo agoNV
Senior Deep Learning Compiler Engineer - XLA
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
2mo agoNV
Deep Learning Performance Architect - New College Graduate 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
2mo agoNV
Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents
NVIDIA
Santa Clara, California, United States (On-site)$224K – $356.5K Yearly
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
1mo agoNV
Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$168K – $264.5K Yearly