Reward Learning Jobs
Browse 943 Reward Learning jobs on Inference Jobs.
943 jobs
6dTM
Research, Post-Training
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
6dAN
Staff Machine Learning Engineer, Virtual Collaborator
Anthropic
San Francisco, California, United States (Hybrid)$340k – $560k Yearly
6dTM
Research, Post-Training Data
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
2wRA
Member of Technical Staff - Alignment Lead
Reflection AI
San Francisco, California, United States (On-site)
6dSC
Machine Learning Research Scientist / Research Engineer, Post-Training
Scale
San Francisco, California, United States (On-site)$252k – $315k Yearly
2wRA
Member of Technical Staff - Post-Training
Reflection AI
San Francisco, California, United States (On-site)
1wTA
Research Engineer, Core ML
Together AI
San Francisco, California, United States (On-site)$200k – $280k Yearly
5dAN
[Expression of Interest] Research Scientist/Engineer, Alignment Finetuning
Anthropic
San Francisco, California, United States (Hybrid)$315k – $340k Yearly
2wAI
Research Engineer - Reinforcement Learning, Self-Driving
Applied Intuition
Sunnyvale, California, United States (On-site)$126k – $423k Yearly
6dRU
Applied Research Lead, Reinforcement Learning
Runway
New York, New York, United States or Remote (North America + 1 more)$280k – $380k Yearly
2wAI
Research Intern - Reinforcement Learning, Robotics
Applied Intuition
Sunnyvale, California, United States (On-site)
4wAN
Full Stack Software Engineer, Reinforcement Learning
Anthropic
San Francisco, California, United States (Hybrid)$300k – $405k Yearly
3wAN
Software Engineer, Human Data Interface
Anthropic
San Francisco, California, United States (Hybrid)$320k – $405k Yearly
6dXA
Member of Technical Staff, RL Training Framework
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wSC
Machine Learning Fellow - Human Frontier Collective (UK)
Scale
United Kingdom (Remote)Up to $166.4k Hourly
5dAN
Machine Learning Engineer, Safeguards
Anthropic
San Francisco, California, United States (Hybrid)$315k – $425k Yearly