1. Home
  2. Jobs
  3. Direct Preference Optimization (DPO)

Direct Preference Optimization (DPO) Jobs

Browse 10 Direct Preference Optimization (DPO) jobs on Inference Jobs.

10 jobs

4dLA

Applied Research Engineer

Labelbox

San Francisco, California, United States (Hybrid)$250k – $300k Yearly
2wPE

AI Researcher

Perplexity

San Francisco, California, United States (On-site)$210k – $470k Yearly
3wPE

Research Engineering Manager - Model Training

Perplexity

San Francisco, California, United States (On-site)$300k – $470k Yearly
1wTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200k – $280k Yearly
5dSC

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale

San Francisco, California, United States (On-site)$252k – $315k Yearly
2wRA

Member of Technical Staff - Alignment Lead

Reflection AI

San Francisco, California, United States (On-site)
5dTM

Research, Post-Training Data

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
1wOP

Product Policy - Data Scientist

OpenAI

San Francisco, California, United States (Hybrid)$325k – $405k Yearly