1. Home
  2. Jobs
  3. LLM Inference Optimization

LLM Inference Optimization Jobs

Browse 445 LLM Inference Optimization jobs on Inference Jobs.

361-380 of 445 jobs

2wLA

JavaScript Engineer (Open Source Team)

LangChain

San Francisco, California, United States (On-site)$150k – $225k Yearly
6dAN

Research Engineer, Machine Learning (Horizons)

Anthropic

San Francisco, California, United States (Hybrid)$280k – $425k Yearly
1wSC

Machine Learning Engineer - Model Evaluations, Public Sector

Scale

San Francisco, California, United States (On-site)$216.3k – $300.3k Yearly
3wCR

Engineering Manager, Managed AI

Crusoe

San Francisco, California, United States (On-site)$237.6k – $288k Yearly
3dNV

Senior Software Engineer - Developer Tools for Deep Learning

NVIDIA

Westford, Massachusetts, United States (On-site)$152k – $287.5k Yearly
3wHA

Senior Forward Deployed Engineer

Harvey

New York, New York, United States (On-site)$200k – $260k Yearly
3dD-
2wBR

Open Source Engineer - Go

Braintrust

San Francisco, California, United States or Remote (United States)
6dTM

Research, Post-Training Data

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
6dNE

HPC System Engineer

Nebius

Amsterdam, North Holland, Netherlands (On-site)
2wFU

Senior AI Engineer - Agent Team

FurtherAI

San Francisco, California, United States (On-site)$225k – $300k Yearly
2wLA

Senior Technical Support Engineer

LangChain

United States (On-site)$130k – $170k Yearly
4dMO

Forward Deployed ML Engineer

Modal

New York, New York, United States (On-site)$180k – $250k Yearly
3wNV

Senior Technical Program Manager, Deep Learning Libraries

NVIDIA

Santa Clara, California, United States (On-site)$168k – $322k Yearly
5dCE

Compiler Engineer

Cerebras

Sunnyvale, California, United States (On-site)
3dFU

AI Engineer - Agent Team

FurtherAI

San Francisco, California, United States (On-site)$150k – $250k Yearly
23hNV

Devtech Compute Engineer

NVIDIA

Beijing, Beijing, China (On-site)