TensorRT-LLM Jobs in Santa Clara, California, United States

Discover TensorRT-LLM roles in Santa Clara, California, United States on Inference Jobs and apply today.

1mo agoLA
4w agoLA

Deployed Engineer (Raleigh)

LangChain

North Carolina, United States (Remote)$150K – $250K Yearly
3mo agoCO

Member of Technical Staff, Modeling

Cohere

London, England, United Kingdom or Remote (Worldwide)
3mo agoPO

Member of Engineering (Pre-training / Data Engineering)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa + 1 more)
3mo agoCO

Member of Technical Staff, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
1mo agoNV
2mo agoNV

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
3mo agoNV

Solutions Architect, AI Cloud Partner Performance

NVIDIA

Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoCO

Senior Member of Technical Staff, Multimodal AI

Cohere

San Francisco, California, United States or Remote (Worldwide)
4w agoLA

Deployed Engineer (Charlotte)

LangChain

North Carolina, United States (Remote)$150K – $250K Yearly
4w agoBA
3mo agoPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
2mo agoNV
1mo agoNV

Verification Engineer - Compilers C++

NVIDIA

Santa Clara, California, United States (On-site)$140K – $270.3K Yearly
3mo agoCO

Applied AI Engineer – Agentic Workflows

Cohere

San Francisco, California, United States or Remote (California, United States + 4 more)