Direct Preference Optimization (DPO) Jobs
Browse 10 Direct Preference Optimization (DPO) jobs on Inference Jobs.
10 jobs
4dLA
Applied Research Engineer
Labelbox
San Francisco, California, United States (Hybrid)$250k – $300k Yearly
3wPE
Research Engineering Manager - Model Training
Perplexity
San Francisco, California, United States (On-site)$300k – $470k Yearly
1wTA
Research Engineer, Core ML
Together AI
San Francisco, California, United States (On-site)$200k – $280k Yearly
5dSC
Machine Learning Research Scientist / Research Engineer, Post-Training
Scale
San Francisco, California, United States (On-site)$252k – $315k Yearly
2wRA
Member of Technical Staff - Alignment Lead
Reflection AI
San Francisco, California, United States (On-site)
5dTM
Research, Post-Training Data
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
1wOP
Product Policy - Data Scientist
OpenAI
San Francisco, California, United States (Hybrid)$325k – $405k Yearly