1. Home
  2. Jobs
  3. LLM Serving Frameworks

LLM Serving Frameworks Jobs

Browse 395 LLM Serving Frameworks jobs on Inference Jobs.

181-200 of 395 jobs

2wCA

Software Engineer

Cartesia

San Francisco, California, United States (On-site)$180k – $250k Yearly
1wLA

Forward Deployed Engineer

Labelbox

San Francisco, California, United States (Hybrid)$140k – $200k Yearly
2wLA

Senior Full Stack Engineer, Observability & Evals Platform

LangChain

San Francisco, California, United States (On-site)$175k – $225k Yearly
2wPE

AI Research Lead

Perplexity

San Francisco, California, United States (On-site)$300k – $470k Yearly
1wMA

AI Scientist - Zurich

Mistral AI

Zürich, Zürich, Switzerland (Hybrid)
2wNV

Senior Deep Learning Engineer

NVIDIA

Warszawa, Masovian Voivodeship, Poland (Hybrid)zł 292.5k – zł 507k Yearly
2wOP

AI & Provider Operations Engineer

OpenRouter

United States or Remote (United States)
1wOP

Research Engineer / Machine Learning Engineer - B2B Applications

OpenAI

San Francisco, California, United States (On-site)$295k – $445k Yearly
2wMA

Research Engineer, Machine Learning

Mistral AI

Palo Alto, California, United States (Hybrid)
3wDE

Senior Research Engineer

Decagon

San Francisco, California, United States (On-site)£200k – £300k Yearly
2wMA

Research Engineer

Magic

San Francisco, California, United States (On-site)$225k – $550k Yearly
1wTM

Research Engineer, Infrastructure, Numerics

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
3wHA

Senior Forward Deployed Engineer

Harvey

New York, New York, United States (On-site)$200k – $260k Yearly
2wPO

Member of Engineering (Scalability)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
2wSC

ML Systems Engineer, Robotics

Scale

San Francisco, California, United States (On-site)$218.4k – $273k Yearly
2wOP

Research Engineer / Research Scientist - Foundations Retrieval Lead

OpenAI

San Francisco, California, United States (Hybrid)$460k – $555k Yearly
1wAN

Performance Engineer

Anthropic

San Francisco, California, United States (Hybrid)$315k – $560k Yearly
3wNV

Principal GenAI Engagement Lead, Partner Platforms

NVIDIA

Santa Clara, California, United States (Hybrid)$272k – $431.3k Yearly
2wTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200k – $280k Yearly