Home
Jobs
LLM Serving Frameworks

LLM Serving Frameworks Jobs

Browse 417 LLM Serving Frameworks jobs on Inference Jobs.

81-100 of 417 jobs

4wCO

Applied AI Engineer – Agentic Workflows (Korea)

Cohere

Seoul, Seoul, South Korea or Remote (South Korea)

AI Engineering

Applied AI

7dTM

Research Engineer, Infrastructure, RL Systems

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly

AI Infrastructure

Infrastructure Engineering

2wOP

Backend Software Engineer (Evals) – Support Automation Engineering

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly

AI Infrastructure

Applied AI

2wCO

Solutions Architect

Cohere

Toronto, Ontario, Canada or Remote (Canada + 3 more)

AI Architect

Cloud Architect

4dDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300k – $430k Yearly

Engineering

Infrastructure Engineering

2wPL

Distributed Training Engineer

Periodic Labs

Menlo Park, California, United States (Hybrid)

AI Infrastructure Engineer

LLM Engineering

2wNV

Senior Capability Development Engineer

NVIDIA

Shenzhen Shi, Guangdong, China (On-site)

AI Engineering

LLM Engineering

3wSC

Staff Machine Learning Research Scientist, LLM Evals

Scale

San Francisco, California, United States (On-site)$280k – $380k Yearly

AI Research Scientist

Applied Scientist

3wNV

Senior Research Scientist, Multi-Modal Language Models

NVIDIA

Santa Clara, California, United States (On-site)$192k – $356.5k Yearly

AI Research

Computer Vision Research

3wNV

Senior Software Engineer - NIM Factory Container and Cloud Infrastructure

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly

Cloud Infrastructure

DevOps Engineer

3wCR

Principal Engineer, AI Model LifeCycle

Crusoe

San Francisco, California, United States (On-site)$256k – $320k Yearly

AI Infrastructure Engineer

Cloud Engineer

6dAN

Senior/Staff Software Engineer, Inference

Anthropic

New York, New York, United States (Hybrid)$300k – $485k Yearly

Cloud Infrastructure

Distributed Systems

2wRA

Member of Technical Staff - Post-Training

Reflection AI

San Francisco, California, United States (On-site)

AI Research

Applied Scientist

2wOP

Training: ML Framework Engineer

OpenAI

San Francisco, California, United States (Hybrid)$245k – $385k Yearly

Distributed Systems

Machine Learning Engineer

2wSC

AI Research Engineer, Enterprise Evaluations

Scale

San Francisco, California, United States (On-site)$179.4k – $224.3k Yearly

AI Evaluation

AI Research Engineer

2wBA

Software Engineer, Model Performance Tooling

Baseten

Canada or Remote (Canada + 1 more)C$130k – C$200k Yearly

AI/ML

DevOps

2dNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly

AI Infrastructure

Deep Learning

1wNV

Senior ML Framework Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly

AI Research

HPC Engineer

2wFU

Senior AI Engineer - Agent Team

FurtherAI

San Francisco, California, United States (On-site)$225k – $300k Yearly

AI Engineer

Applied AI

2wLA

Deployed Engineer (East)

LangChain

New York, New York, United States (On-site)$150k – $270k Yearly

Customer Engineering

Deployed Engineering

Inference Jobs

Applied AI Engineer – Agentic Workflows (Korea)

Research Engineer, Infrastructure, RL Systems

Backend Software Engineer (Evals) – Support Automation Engineering

Solutions Architect

Staff Software Engineer, ML Infrastructure

Distributed Training Engineer

Senior Capability Development Engineer

Staff Machine Learning Research Scientist, LLM Evals

Senior Research Scientist, Multi-Modal Language Models

Senior Software Engineer - NIM Factory Container and Cloud Infrastructure

Principal Engineer, AI Model LifeCycle

Senior/Staff Software Engineer, Inference

Member of Technical Staff - Post-Training

Training: ML Framework Engineer

AI Research Engineer, Enterprise Evaluations

Software Engineer, Model Performance Tooling

Senior Software Engineer, Quantized Inference

Senior ML Framework Performance Engineer - AI for Science at Scale

Senior AI Engineer - Agent Team

Deployed Engineer (East)

Related searches