1. Home
  2. Jobs
  3. Reward Learning

Reward Learning Jobs

Browse 943 Reward Learning jobs on Inference Jobs.

943 jobs

6dTM

Research, Post-Training

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
6dAN

Staff Machine Learning Engineer, Virtual Collaborator

Anthropic

San Francisco, California, United States (Hybrid)$340k – $560k Yearly
6dTM

Research, Post-Training Data

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wRA

Member of Technical Staff - Alignment Lead

Reflection AI

San Francisco, California, United States (On-site)
6dSC

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale

San Francisco, California, United States (On-site)$252k – $315k Yearly
2wRA

Member of Technical Staff - Post-Training

Reflection AI

San Francisco, California, United States (On-site)
2wAN

Research Engineer, Environment Scaling

Anthropic

United States (Hybrid)$350k – $850k Yearly
1wTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200k – $280k Yearly
5dAN

[Expression of Interest] Research Scientist/Engineer, Alignment Finetuning

Anthropic

San Francisco, California, United States (Hybrid)$315k – $340k Yearly
2wAI

Research Engineer - Reinforcement Learning, Self-Driving

Applied Intuition

Sunnyvale, California, United States (On-site)$126k – $423k Yearly
6dRU

Applied Research Lead, Reinforcement Learning

Runway

New York, New York, United States or Remote (North America + 1 more)$280k – $380k Yearly
2wAI

Research Intern - Reinforcement Learning, Robotics

Applied Intuition

Sunnyvale, California, United States (On-site)
4wAN

Full Stack Software Engineer, Reinforcement Learning

Anthropic

San Francisco, California, United States (Hybrid)$300k – $405k Yearly
3wAN

Software Engineer, Human Data Interface

Anthropic

San Francisco, California, United States (Hybrid)$320k – $405k Yearly
6dXA

Member of Technical Staff, RL Training Framework

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dAN

Machine Learning Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$315k – $425k Yearly