1. Home
  2. Jobs
  3. LLM Inference Optimization

LLM Inference Optimization Jobs

Browse 439 LLM Inference Optimization jobs on Inference Jobs.

421-439 of 439 jobs

2wPW

Member of Technical Staff, Infrastructure & Scaling

Parallel Web Systems

San Francisco, California, United States (On-site)
2wMA

Research Scientist - Model Team

Mirelo AI

Berlin, Berlin, Germany (Hybrid)
2wLA

FullStack Engineer, Observability & Evals Platform (LangSmith)

LangChain

San Francisco, California, United States (On-site)$145k – $180k Yearly
2wLA

Education Engineer, Fullstack

LangChain

San Francisco, California, United States (On-site)$150k – $185k Yearly
2wD-

Machine Learning Research Intern

d-Matrix

Santa Clara, California, United States (Hybrid)$30 – $59 Hourly
2wAI

Senior Software Engineer - ML Infrastructure

Applied Intuition

Sunnyvale, California, United States (On-site)$153k – $222k Yearly
6dTE

Software Engineer

Tenstorrent

東京都, Tokyo Prefecture, Japan (On-site)
2wOP

Research Engineer/Research Scientist, RL/Reasoning

OpenAI

San Francisco, California, United States (Hybrid)$310k – $460k Yearly
4wOP

Power Architect

OpenAI

San Francisco, California, United States (On-site)$295k – $445k Yearly
2wNV

Performance Engineering Intern - 2026

NVIDIA

Shanghai, Shanghai, China (On-site)
5dRU
6dCO

AI Solutions Engineer, Post Sales- W&B

CoreWeave

Livingston, New Jersey, United States (Hybrid)$165k – $242k Yearly
2wOP

Security Researcher, Trusted Computing and Cryptography

OpenAI

United States or Remote (United States)$324k – $490k Yearly
2wNV

Senior Data Scientist – Enterprise AI Systems

NVIDIA

Santa Clara, California, United States (On-site)$168k – $322k Yearly
2wCR

Product Security Engineer - AI

Crusoe

San Francisco, California, United States (On-site)$135k – $150k Yearly
2wD-

Analog Design Engineer, Senior Staff

d-Matrix

Santa Clara, California, United States (Hybrid)$196k – $300k Yearly