- Home
- Jobs
- United States
- California
- Santa Clara
- TensorRT-LLM
TensorRT-LLM Jobs in Santa Clara, California, United States
Discover TensorRT-LLM roles in Santa Clara, California, United States on Inference Jobs and apply today.
3mo agoCO
Member of Technical Staff, MLE
Cohere
San Francisco, California, United States or Remote (California, United States + 3 more)
3mo agoD-
Principal AI/ML System Software Engineer
d-Matrix
Santa Clara, California, United States (Hybrid)$180K – $280K Yearly
1mo agoNV
Senior Deep Learning and Computer Vision Engineer - Autonomous Vehicles
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
1mo agoD-
3mo agoD-
AI / ML System Software Engineer, Senior Staff
d-Matrix
Santa Clara, California, United States (Hybrid)$180K – $280K Yearly
3mo agoD-
Software Engineer, Staff - SIMD Kernels
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (United States)$190K – $300K Yearly
3mo agoD-
Software Engineer, Senior Staff - Kernels
d-Matrix
Santa Clara, California, United States (Hybrid)$180K – $300K Yearly
2mo agoNV
Senior Account Manager – RTX Raytheon
NVIDIA
United States or Remote (United States)$224K – $356.5K Yearly
2mo agoNV
Senior Software Engineer, Metropolis Vision AI
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
2mo agoNV
Senior Deep Learning Compiler Engineer - XLA
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
2mo agoNV
Deep Learning Performance Architect - New College Graduate 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
2mo agoNV
Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents
NVIDIA
Santa Clara, California, United States (On-site)$224K – $356.5K Yearly
1w agoD-
Principal Software ML Test Engineer
d-Matrix
Santa Clara, California, United States (Hybrid)$180K – $300K Yearly
3mo agoPO
Member of Engineering (Pre-training / Data)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
3mo agoLA
Deployed Engineer (Central)
LangChain
Chicago, Illinois, United States or Remote (Illinois, United States + 1 more)$150K – $270K Yearly
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
1mo agoNV
Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$168K – $264.5K Yearly