TensorRT-LLM Jobs in California, United States

Discover TensorRT-LLM roles in California, United States on Inference Jobs and apply today.

2w agoAN

Research Engineer, Pretraining Scaling

Anthropic

San Francisco, California, United States (On-site)$350K – $850K Yearly
1mo agoNV

Senior DL Algorithms Engineer - Inference Performance

NVIDIA

Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
3mo agoCO

Senior Research Engineer, Model Evaluation

Cohere

Toronto, Ontario, Canada or Remote (Canada + 2 more)
2mo agoNV

Senior AI Inference Compiler Engineer

NVIDIA

Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoCO

Member of Technical Staff, Senior/Staff MLE

Cohere

San Francisco, California, United States or Remote (California, United States + 3 more)
4w agoTA

Senior Backend Engineer, Inference Platform

Together AI

San Francisco, California, United States (On-site)$160K – $250K Yearly
2mo agoTA

Research Engineer, Frontier Speculative Decoding

Together AI

San Francisco, California, United States (On-site)$190K – $270K Yearly
4w agoCR

Senior Software Engineer, Managed AI

Crusoe

San Francisco, California, United States (On-site)$172.4K – $209K Yearly
2mo agoNV

Senior Applied Deep Learning Research Scientist, Efficiency

NVIDIA

Santa Clara, California, United States (On-site)$192K – $356.5K Yearly
2mo agoNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Santa Clara, California, United States (Hybrid)$184K – $356.5K Yearly
3w agoET
3w agoET
3mo agoPO

Member of Engineering (Scalability)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
1mo agoNV

Senior Performance Engineer - Deep Learning

NVIDIA

Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
2w agoAN

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)$280K – $850K Yearly