1. Home
  2. Jobs
  3. Reward Modeling

Reward Modeling Jobs

Browse 941 Reward Modeling jobs on Inference Jobs.

941 jobs

3wAN

Software Engineer, Human Data Interface

Anthropic

San Francisco, California, United States (Hybrid)$320k – $405k Yearly
2wRA

Member of Technical Staff - Post-Training

Reflection AI

San Francisco, California, United States (On-site)
4dAN

[Expression of Interest] Research Scientist/Engineer, Alignment Finetuning

Anthropic

San Francisco, California, United States (Hybrid)$315k – $340k Yearly
7dTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200k – $280k Yearly
2wRA

Member of Technical Staff - Alignment Lead

Reflection AI

San Francisco, California, United States (On-site)
5dAN

Staff Machine Learning Engineer, Virtual Collaborator

Anthropic

San Francisco, California, United States (Hybrid)$340k – $560k Yearly
5dTM

Research, Post-Training

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
5dTM

Research, Post-Training Data

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wOP

Compensation Analytics Manager

OpenAI

San Francisco, California, United States or Remote (United States)$187.2k – $260k Yearly
5dSC

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale

San Francisco, California, United States (On-site)$252k – $315k Yearly
2wAN

Research Engineer, Environment Scaling

Anthropic

United States (Hybrid)$350k – $850k Yearly
2wOP

Compensation Business Partner

OpenAI

San Francisco, California, United States (Hybrid)$234k – $260k Yearly
2wCR

Senior Compensation Partner

Crusoe

San Francisco, California, United States (On-site)$165k – $235k Yearly
2wOP

Technical Program Manager – Adversarial Model Research

OpenAI

San Francisco, California, United States (Hybrid)$230k – $285k Yearly
4wNV
2wCR

Director, Revenue Operations - Real Estate & Infrastructure

Crusoe

San Francisco, California, United States (On-site)$230k – $260k Yearly
5dXA

Model Behavior Tutor - Wit & Conversation

xAI

Wyoming, United States + 1 more (Remote)$50 – $70 Hourly