TensorRT-LLM Jobs
Explore TensorRT-LLM roles on Inference Jobs and apply today.
3mo agoOP
Software Engineer, Inference - Multi Modal
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
2mo agoNV
Senior System Software Engineer - Dynamo-Triton Inference Server
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
2w agoNV
2mo agoNV
Senior Software Engineer - VLM Microservices for Neural Reconstruction
NVIDIA
Santa Clara, California, US$152K – $287.5K Yearly
2mo agoNV
3mo agoCO
1d agoNV
4w agoNV
Senior Manager, System Software Engineering - Metropolis Accelerated and Inferencing Software
NVIDIA
Pune, Maharashtra, India (On-site)
3mo agoPE
Inference Engineering Manager
Perplexity
San Francisco, California, United States (On-site)$300K – $385K Yearly
3mo agoCO
Audio Inference Engineer, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
2mo agoCE
3mo agoBA
Engineering Manager - Model Performance
Baseten
San Francisco, California, United States (On-site)$230K – $300K Yearly
3mo agoNV
Senior Developer Technology Engineer - Windows AI Platform
NVIDIA
Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
4w agoTA
AI Researcher, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
2mo agoNV
Senior Software Engineer, Deep Learning Inference - TensorRT
NVIDIA
Santa Clara, California, US$152K – $287.5K Yearly
4w agoNV
Deep Learning Engineer - LLM and VLM Model Compression
NVIDIA
Warszawa, Masovian Voivodeship, Poland (On-site)zł 292.5K – zł 650K Yearly