LLM Serving Frameworks Jobs
Browse 415 LLM Serving Frameworks jobs on Inference Jobs.
101-120 of 415 jobs
4dNV
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
3wLA
Deployed Engineer (Central)
LangChain
Chicago, Illinois, United States or Remote (Illinois, United States + 1 more)$150k – $270k Yearly
2wPE
Full Stack Software Engineer - Applied AI
Perplexity
San Francisco, California, United States (On-site)$210k – $385k Yearly
3wCR
Staff Software Engineer, Model LifeCycle
Crusoe
San Francisco, California, United States (On-site)$204k – $247k Yearly
2wBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150k – $250k Yearly
2wLA
FullStack Engineer, Observability & Evals Platform (LangSmith)
LangChain
San Francisco, California, United States (On-site)$145k – $180k Yearly
1wCO
Forward Deployed Engineer
CoreWeave
Livingston, New Jersey, United States (Hybrid)$188k – $275k Yearly
7dTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
2wPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210k – $385k Yearly
4wSC
Senior Software Engineer, Connectivity
Scale
San Francisco, California, United States (On-site)$216.2k – $270.3k Yearly
1wCO
Senior Manager Forward Deployed Engineers
CoreWeave
Livingston, New Jersey, United States (Hybrid)$188k – $275k Yearly