1. Home
  2. Jobs
  3. LLM Serving Frameworks

LLM Serving Frameworks Jobs

Browse 417 LLM Serving Frameworks jobs on Inference Jobs.

81-100 of 417 jobs

4wCO

Applied AI Engineer – Agentic Workflows (Korea)

Cohere

Seoul, Seoul, South Korea or Remote (South Korea)
7dTM

Research Engineer, Infrastructure, RL Systems

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wOP

Backend Software Engineer (Evals) – Support Automation Engineering

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
2wCO

Solutions Architect

Cohere

Toronto, Ontario, Canada or Remote (Canada + 3 more)
4dDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300k – $430k Yearly
2wPL

Distributed Training Engineer

Periodic Labs

Menlo Park, California, United States (Hybrid)
2wNV

Senior Capability Development Engineer

NVIDIA

Shenzhen Shi, Guangdong, China (On-site)
3wSC

Staff Machine Learning Research Scientist, LLM Evals

Scale

San Francisco, California, United States (On-site)$280k – $380k Yearly
3wNV

Senior Research Scientist, Multi-Modal Language Models

NVIDIA

Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
3wNV

Senior Software Engineer - NIM Factory Container and Cloud Infrastructure

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
3wCR

Principal Engineer, AI Model LifeCycle

Crusoe

San Francisco, California, United States (On-site)$256k – $320k Yearly
6dAN

Senior/Staff Software Engineer, Inference

Anthropic

New York, New York, United States (Hybrid)$300k – $485k Yearly
2wRA

Member of Technical Staff - Post-Training

Reflection AI

San Francisco, California, United States (On-site)
2wOP

Training: ML Framework Engineer

OpenAI

San Francisco, California, United States (Hybrid)$245k – $385k Yearly
2wSC

AI Research Engineer, Enterprise Evaluations

Scale

San Francisco, California, United States (On-site)$179.4k – $224.3k Yearly
2wBA

Software Engineer, Model Performance Tooling

Baseten

Canada or Remote (Canada + 1 more)C$130k – C$200k Yearly
2dNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
1wNV

Senior ML Framework Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wFU

Senior AI Engineer - Agent Team

FurtherAI

San Francisco, California, United States (On-site)$225k – $300k Yearly
2wLA

Deployed Engineer (East)

LangChain

New York, New York, United States (On-site)$150k – $270k Yearly