1. Home
  2. Jobs
  3. LLM Inference Optimization

LLM Inference Optimization Jobs

Browse 445 LLM Inference Optimization jobs on Inference Jobs.

401-420 of 445 jobs

2wCA

Researcher: Model Architecture, UK

Cartesia

London, England, United Kingdom (On-site)
2wPE

Full Stack Software Engineer - Applied AI

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
6dCO

Solutions Architect - HPC/AI/ML

CoreWeave

London, England, United Kingdom (Hybrid)£116k – £155k Yearly
2wOP

Senior Research Engineer/Scientist - Edge, Consumer Products

OpenAI

San Francisco, California, United States (Hybrid)$380k – $460k Yearly
2wLA

Python OSS Engineer

LangChain

San Francisco, California, United States (On-site)$160k – $225k Yearly
1wCO

Full-Stack Software Engineer, Inference

Cohere

Toronto, Ontario, Canada or Remote (Canada + 2 more)
2wVE

AI Architect

Vertiv

Westerville, Ohio, United States (On-site)
2wLA

Software Engineering Manager, Observability & Evals Platform

LangChain

San Francisco, California, United States (On-site)$200k – $250k Yearly
2wCO

Member of Technical Staff, Modeling

Cohere

London, England, United Kingdom or Remote (Worldwide)
6dAN

Research Engineer, Model Evaluations

Anthropic

San Francisco, California, United States (Hybrid)$300k – $405k Yearly
3wAI

Machine Learning Engineer - Defense

Applied Intuition

Washington, District of Columbia, United States (On-site)$150k – $225k Yearly
1wSC

Machine Learning Research Engineer, GenAI Applied ML

Scale

San Francisco, California, United States (On-site)$176k – $220k Yearly
2wCA

Senior Applied Researcher, Audio Understanding

Cartesia

San Francisco, California, United States (On-site)$200k – $350k Yearly
2wNV

Senior Performance Architect - Heterogeneous Workload Optimization

NVIDIA

Santa Clara, California, United States (Hybrid)$184k – $356.5k Yearly
2wAN

Software Engineer, AI Reliability

Anthropic

San Francisco, California, United States (Hybrid)$325k – $485k Yearly
2wOP

Security Researcher, Trusted Computing and Cryptography

OpenAI

United States or Remote (United States)$324k – $490k Yearly
2wRA

Member of Technical Staff - Data Ingestion Engineer

Reflection AI

San Francisco, California, United States (On-site)