1. Home
  2. Jobs
  3. Inference Optimization

Inference Optimization Jobs

Browse 482 Inference Optimization jobs on Inference Jobs.

21-40 of 482 jobs

14hNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
3wXA

Member of Technical Staff - Multimodal Interactions Post-training

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
1wD-

Senior Staff ML Researcher - LLM Algorithmic Optimization

d-Matrix

Bengaluru, Karnataka, India (Hybrid)₹4M – ₹6M Yearly
2wNV

Senior Developer Relations Manager - COSMOS and Foundation Models

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
5dAN

ML Infrastructure Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$320k – $405k Yearly
2wAI

Machine Learning Engineer - Defense

Applied Intuition

Ann Arbor, Michigan, United States (On-site)$130k – $200k Yearly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
5dAN

Staff Software Engineer, Inference

Anthropic

Dublin, County Dublin, Ireland (Hybrid)€295k – €355k Yearly
3wNV

Platform Architecture Engineer, GeForce NOW

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
3wXA

Member of Technical Staff, Grok Imagine

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wMA

Software Engineer, Technical Lead, Inference

Mistral AI

Île de Ré, Charente-Maritime, France (Hybrid)
1wOP

Software Engineer, Productivity

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
2wRA

Member of Technical Staff - Post-Training

Reflection AI

San Francisco, California, United States (On-site)
2wNV

Senior Machine Learning Applications and Compiler Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
1wOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
2wNE

Field CTO - Media & Entertainment

Nebius

United States (Remote)$295k – $365k Yearly
4dAN

Engineering Manager, Inference

Anthropic

San Francisco, California, United States (Hybrid)$425k – $560k Yearly
3dCR

Principal Product Manager, General Compute (SF, Sunnyvale, New York)

Crusoe

San Francisco, California, United States (Hybrid)$260.8k – $326k Yearly
5dXA

Member of Technical Staff, RL Training Framework

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly