1. Home
  2. Jobs
  3. Inference Optimization

Inference Optimization Jobs

Browse 482 Inference Optimization jobs on Inference Jobs.

41-60 of 482 jobs

5dVA

Systems/GPU Research Engineer

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
2wNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
1wCO

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)
9hNV

Senior ML Compiler Engineer

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly
1wOP

Software Engineer, Monetization Delivery

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
2wMA

Research Engineer

Magic

San Francisco, California, United States (On-site)$225k – $550k Yearly
5dVA

GPU Systems Engineer – HPC / Parallel Computing

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
3dNV

Senior Compiler Engineer - AI

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175k – $280k Yearly
2wHF

Senior Open-Source Machine Learning Engineer, Computer Vision - EMEA Remote

Hugging Face

Île de Ré, Charente-Maritime, France or Remote (Europe, Middle East, and Africa)
5dAN

Research Engineer, Discovery

Anthropic

San Francisco, California, United States (Hybrid)$340k – $425k Yearly
5dAN

Staff Research Engineer, Discovery Team

Anthropic

San Francisco, California, United States (Hybrid)$340k – $425k Yearly
1wOP

Research Engineer, Codex

OpenAI

San Francisco, California, United States (Hybrid)$380k – $460k Yearly
1wOP

Research Engineer / Research Scientist - Foundations Retrieval Lead

OpenAI

San Francisco, California, United States (Hybrid)$460k – $555k Yearly
5dSC

ML Research Engineer, ML Systems

Scale

San Francisco, California, United States (On-site)$218.4k – $273k Yearly