TensorRT-LLM Jobs in California, United States

Discover TensorRT-LLM roles in California, United States on Inference Jobs and apply today.

2mo agoDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300K – $430K Yearly
4w agoLA

Applied Research Engineer, Agents

Labelbox

San Francisco, California, United States (Hybrid)$250K – $300K Yearly
3mo agoPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210K – $385K Yearly
3w agoTM

Research Engineer, Infrastructure, Numerics

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
3mo agoCO

Member of Technical Staff, MLE

Cohere

San Francisco, California, United States or Remote (California, United States + 3 more)
3mo agoD-

Principal AI/ML System Software Engineer

d-Matrix

Santa Clara, California, United States (Hybrid)$180K – $280K Yearly
4w agoAI

ML Runtime Optimization Engineer

Applied Intuition

Sunnyvale, California, United States (On-site)$159.1K – $199.3K Yearly
2mo agoTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200K – $280K Yearly
3mo agoD-

AI / ML System Software Engineer, Senior Staff

d-Matrix

Santa Clara, California, United States (Hybrid)$180K – $280K Yearly
3w agoTM

Research Engineer, Infrastructure, Kernels

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
2mo agoNV

Senior Deep Learning Compiler Engineer - XLA

NVIDIA

Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175K – $280K Yearly
3mo agoD-