TensorRT-LLM Jobs in California, United States

Discover TensorRT-LLM roles in California, United States on Inference Jobs and apply today.

2mo agoNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
3mo agoPL
4w agoTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160K – $230K Yearly
1mo agoNV
3mo agoBA

Software Engineer - Model API's

Baseten

San Francisco, California, United States (On-site)$150K – $230K Yearly
20h agoCE
3mo agoBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150K – $250K Yearly
3w agoCO

Staff Engineer - Perf and Benchmarking

CoreWeave

Sunnyvale, California, United States (Hybrid)$206K – $333K Yearly
3mo agoHA
2mo agoCE

Sr. MTS - Inference ML Eng

Cerebras

Sunnyvale, California, United States (On-site)
1mo agoNE
2mo agoCO

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)$92K – $135K Yearly
4w agoTA

Machine Learning Engineer

Together AI

San Francisco, California, United States (On-site)$160K – $220K Yearly
3mo agoCO

Senior Software Engineer II, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$165K – $242K Yearly
1d agoNV

Principal Deep Learning Communication Architect

NVIDIA

Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
3mo agoCO

Senior Software Engineer I, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$139K – $204K Yearly
4w agoSC

AI Infrastructure Engineer, Model Serving Platform

Scale

San Francisco, California, United States (On-site)$179.4K – $224.3K Yearly