TensorRT-LLM Jobs
Explore TensorRT-LLM roles on Inference Jobs and apply today.
3mo agoMO
Member of Technical Staff - ML Performance
Modal
New York, United States (On-site)$150K – $270K Yearly
2mo agoNV
Senior Systems Software Engineer - Deep Learning Solutions
NVIDIA
Toronto, Ontario, Canada (On-site)C$225K – C$275K Yearly
3mo agoPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210K – $385K Yearly
3mo agoCE
Python / PyTorch Developer — Frontend Inference Compiler – Dubai
Cerebras
United Arab Emirates (On-site)
2w agoNV
2mo agoAI
ML Runtime Optimization Engineer - Lead
Applied Intuition
Sunnyvale, California, US$199.3K – $264.5K Yearly
4w agoAI
ML Runtime Optimization Engineer
Applied Intuition
Sunnyvale, California, United States (On-site)$159.1K – $199.3K Yearly
2mo agoTA
Research Engineer, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
3w agoTM
Research Engineer, Infrastructure, Kernels
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
2mo agoNV
Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents
NVIDIA
Santa Clara, California, United States (On-site)$224K – $356.5K Yearly
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly