1. Home
  2. Jobs
  3. LLM Infrastructure

LLM Infrastructure Jobs

Browse 1,012 LLM Infrastructure jobs on Inference Jobs.

81-100 of 1,012 jobs

2wOP

Backend Software Engineer (Evals) – Support Automation Engineering

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
6dBA

Software Engineer — GPU Networking & Distributed Systems

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
1wCE

Full Stack LLM Engineer

Cerebras

Toronto, Ontario, Canada (On-site)
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
2wD-

Senior Staff ML Researcher - LLM Algorithmic Optimization

d-Matrix

Bengaluru, Karnataka, India (Hybrid)₹4M – ₹6M Yearly
2wNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wCE

Senior Full Stack LLM Engineer - Training

Cerebras

Sunnyvale, California, United States (On-site)
2wSC

Tech Lead Manager, Machine Learning Research Scientist- LLM Evals

Scale

San Francisco, California, United States (On-site)$280k – $380k Yearly
1wNV

Senior LLM Agents Architect

NVIDIA

Yokneam Ilit, Northern District, Israel (Hybrid)
5dNV

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
7dNV
2wSE

ML Engineer

Sesame

New York, New York, United States (On-site)$190k – $320k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wNV

Senior AI Software Engineer, GenAI Framework

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
3wSC

Staff Machine Learning Research Scientist, LLM Evals

Scale

San Francisco, California, United States (On-site)$280k – $380k Yearly
2wCO

Staff Research Engineer, Model Efficiency

Cohere

New York, New York, United States (Hybrid)
2wSC

Senior/Staff Machine Learning Engineer, General Agents, Enterprise GenAI

Scale

San Francisco, California, United States (On-site)$218k – $273k Yearly
2wLA

Python OSS Engineer

LangChain

San Francisco, California, United States (On-site)$160k – $225k Yearly
2wLA

Deployed Engineer (EMEA)

LangChain

London, England, United Kingdom (On-site)
1wAN

Startups Solutions Architect, Applied AI

Anthropic

San Francisco, California, United States (Hybrid)$240k – $270k Yearly