1. Home
  2. Jobs
  3. LLM Inference

LLM Inference Jobs

Browse 406 LLM Inference jobs on Inference Jobs.

181-200 of 406 jobs

6dCR

Site Reliability Engineer, Managed AI

Crusoe

San Francisco, California, United States (On-site)$204k – $247k Yearly
1wTM

Research Engineer, Infrastructure, Kernels

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
1wAN

[Expression of Interest] Research Scientist/Engineer, Alignment Finetuning

Anthropic

San Francisco, California, United States (Hybrid)$315k – $340k Yearly
2wHE

Senior LLMOps Engineer

Heidi

Sydney, New South Wales, Australia (Hybrid)
3wNV

Senior Capability Development Engineer

NVIDIA

Shenzhen Shi, Guangdong, China (On-site)
1wVA

Systems/GPU Research Engineer

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
1wAN

Research Engineer, Discovery

Anthropic

San Francisco, California, United States (Hybrid)$340k – $425k Yearly
1wAN

Machine Learning Systems Engineer, Research Tools

Anthropic

San Francisco, California, United States (Hybrid)$320k – $405k Yearly
2wMO

Member of Technical Staff - ML Performance

Modal

New York, New York, United States (On-site)$150k – $270k Yearly
1wAN

ML Infrastructure Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$320k – $405k Yearly
1wNV

Senior Manager, Engineering - Enterprise AI and Automation

NVIDIA

Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
4dNV

Senior Systems Software Engineer - Deep Learning Solutions

NVIDIA

Toronto, Ontario, Canada (On-site)C$225k – C$275k Yearly
2wRA
1wXA

Member of Technical Staff, RL Training Framework

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wLA

Deployed Engineer (EMEA)

LangChain

London, England, United Kingdom (On-site)
2wBA

Software Engineer, Model Performance Tooling

Baseten

Canada or Remote (Canada + 1 more)C$130k – C$200k Yearly
1wAC

Infrastructure Engineer, ML Systems

Applied Compute

San Francisco, California, United States (On-site)
2wLA

Deployed Engineer (East)

LangChain

New York, New York, United States (On-site)$150k – $270k Yearly
1dTA

Product Marketing Intern (Summer 2026)

Together AI

San Francisco, California, United States (On-site)From $43 Hourly