1. Home
  2. Jobs
  3. Model Optimization

Model Optimization jobs

Explore Model Optimization roles on Inference Jobs and apply today.

21-40 of 1,098 jobs

NV4w

Senior Applied Researcher, Foundational AI Models for Biology

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)

OP2w

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)

$325k – $490k Yearly

D-2w

ML Compiler Architect, Senior Principal

d-Matrix

Toronto, Ontario, Canada (Hybrid)

HF2w

Senior Open-Source Machine Learning Engineer, Computer Vision - EMEA Remote

Hugging Face

Île de Ré, Charente-Maritime, France or Remote (Europe, Middle East, and Africa)

CO2w

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)

AI4d

Senior Autonomy Engineer

Applied Intuition

London, England, United Kingdom (On-site)

OP2w

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)

$380k – $380k Yearly

SC1w

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale

San Francisco, California, United States (On-site)

$252k – $315k Yearly

OP1w

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)

$455k – $555k Yearly

VE3w

Product Manager

Vertiv

United States (On-site)

CE3w

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)

CO7d

Research Engineer

Cohere

Canada + 3 more (Remote)

CE2w

Senior Full Stack LLM Engineer - Training

Cerebras

Sunnyvale, California, United States (On-site)

D-4w

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)

$30 – $59 Hourly

HA2w

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)

LA7d

Applied Research Engineer

Labelbox

San Francisco, California, United States (Hybrid)

$250k – $300k Yearly

MA2w

Training Infrastructure Engineer

Mirelo AI

Berlin, Berlin, Germany (Hybrid)

NV3w

Deeplearning Software Engineer -- Neural 3D reconstruction

NVIDIA

Shanghai, Shanghai, China (On-site)

NV3d

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)

$152k – $287.5k Yearly

PL2w

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)