1. Home
  2. Jobs
  3. Direct Preference Optimization (DPO)

Direct Preference Optimization (DPO) Jobs

Explore Direct Preference Optimization (DPO) roles on Inference Jobs and apply today.

3mo agoPE

AI Researcher

Perplexity

San Francisco, California, United States (On-site)$210K – $470K Yearly
2mo agoPE

Research Engineering Manager - Model Training

Perplexity

San Francisco, California, United States (On-site)$300K – $470K Yearly
3mo agoCA

Researcher, Post Training

Cartesia

San Francisco, California, United States (On-site)$180K – $350K Yearly
2mo agoTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200K – $280K Yearly
3w agoSC
3w agoBA
3mo agoPE

AI Research Lead

Perplexity

San Francisco, California, United States (On-site)$300K – $470K Yearly
3w agoCR
1mo agoOP
3w agoTM

Research, Post-Training

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
3w agoTM

Research Engineer, Infrastructure, RL Systems

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
3w agoSC

Research Scientist, Agent Robustness

Scale

San Francisco, California, United States (On-site)$197.4K – $246.8K Yearly