1. Home
  2. Jobs
  3. LLM Inference Optimization

LLM Inference Optimization Jobs

Browse 426 LLM Inference Optimization jobs on Inference Jobs.

261-280 of 426 jobs

1wNV

Senior Software Developer, AI Networking

NVIDIA

Texas, United States (Remote)$184k – $356.5k Yearly
2wAN

Research Compute Operations

Anthropic

San Francisco, California, United States (Hybrid)$270k – $290k Yearly
3wAI

AI Infrastructure Engineer - Autonomy

Applied Intuition

Sunnyvale, California, United States (On-site)$153k – $222k Yearly
3wNV

Software Architect, Advanced Development

NVIDIA

Yokne'am, Northern District, Israel (On-site)
2wNV

AI Safety Scientist, Deep Learning

NVIDIA

Ho Chi Minh City, Ho Chi Minh City, Vietnam (On-site)
3wNV

Senior Research Scientist, Multi-Modal Language Models

NVIDIA

Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
2wDE

Staff Research Engineer

Decagon

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wLA

Deployed Engineer (EMEA)

LangChain

London, England, United Kingdom (On-site)
6dCR

Principal Product Manager, General Compute (SF, Sunnyvale, New York)

Crusoe

San Francisco, California, United States (Hybrid)$260.8k – $326k Yearly
3dNV

Lead Principal Engineer, Enterprise Agentic AI Platform

NVIDIA

Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
1wTE

Software Engineer, Kernel Development and Optimization

Tenstorrent

Gdańsk, Pomeranian Voivodeship, Poland (Hybrid)
4wXA

Member of Technical Staff, Midtraining

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
1wAN

[Expression of Interest] Research Scientist/Engineer, Honesty

Anthropic

New York, New York, United States (Hybrid)$315k – $340k Yearly
2wLA

Deployed Engineer (East)

LangChain

New York, New York, United States (On-site)$150k – $270k Yearly
4wNV

Software Product Manager - Nemotron

NVIDIA

Santa Clara, California, United States (On-site)$240k – $379.5k Yearly
2wOP

Research Engineer, Frontier Evals & Environments

OpenAI

San Francisco, California, United States (On-site)$200k – $370k Yearly
1wTM

Research Engineer, Infrastructure, Tinker

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wLA

Deployed Engineer (West)

LangChain

San Francisco, California, United States (On-site)$150k – $270k Yearly
2wOP

Research Engineer, Privacy

OpenAI

San Francisco, California, United States (On-site)$380k – $460k Yearly