DPO (Direct Preference Optimization) Jobs
Browse 8 DPO (Direct Preference Optimization) jobs on Inference Jobs.
8 jobs
2w ago
LA
Applied Research Engineer
Labelbox
San Francisco, California, United States (Hybrid)$250K – $300K Yearly
2d ago
XA
Member of Technical Staff - Post-Training and RL
xAI
Palo Alto, California, United States (On-site)$180K – $600K Yearly
2d ago
RA
Forward Deployed Engineer - LLM Post-training
Reflection AI
San Francisco, California, United States (On-site)
2w ago
BA
3w ago
TA
Forward Deployed Engineer (Inference & Post-Training)
Together AI
San Francisco, California, United States (On-site)$270K – $300K Yearly
2w ago
TA
Research Engineer, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
4d ago
OP
Compute Optimization Researcher/Engineer
OpenAI
San Francisco, California, United States (Hybrid)$293K – $455K Yearly