1. Home
  2. Jobs
  3. LLM Serving Frameworks

LLM Serving Frameworks Jobs

Browse 415 LLM Serving Frameworks jobs on Inference Jobs.

101-120 of 415 jobs

3wCE
4dNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
3wLA

Deployed Engineer (Central)

LangChain

Chicago, Illinois, United States or Remote (Illinois, United States + 1 more)$150k – $270k Yearly
2wSE

ML Engineer

Sesame

New York, New York, United States (On-site)$190k – $320k Yearly
2wPE

Full Stack Software Engineer - Applied AI

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
3wCR

Staff Software Engineer, Model LifeCycle

Crusoe

San Francisco, California, United States (On-site)$204k – $247k Yearly
2wNV

AI Safety Scientist, Deep Learning

NVIDIA

Ho Chi Minh City, Ho Chi Minh City, Vietnam (On-site)
2wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
7dCE

Senior Research Engineer - Inference ML

Cerebras

Sunnyvale, California, United States (Hybrid)
2wLA

FullStack Engineer, Observability & Evals Platform (LangSmith)

LangChain

San Francisco, California, United States (On-site)$145k – $180k Yearly
1wCO

Forward Deployed Engineer

CoreWeave

Livingston, New Jersey, United States (Hybrid)$188k – $275k Yearly
7dTM

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wRA

Forward Deployed Engineer Lead

Reflection AI

New York, New York, United States (On-site)
4wSC

Senior Software Engineer, Connectivity

Scale

San Francisco, California, United States (On-site)$216.2k – $270.3k Yearly
1wCO

Senior Manager Forward Deployed Engineers

CoreWeave

Livingston, New Jersey, United States (Hybrid)$188k – $275k Yearly
7dCE

Principal ML Investigator

Cerebras

Sunnyvale, California, United States (On-site)