1. Home
  2. Jobs
  3. Reward Hacking

Reward Hacking Jobs

Browse 7 Reward Hacking jobs on Inference Jobs.

7 jobs
2w agoAnthropic logoAN

Senior Research Scientist, Reward Models

Anthropic

San Francisco, California, United States (Hybrid)$350K – $500K Yearly
2w agoAnthropic logoAN

Research Engineer, Reward Models Platform

Anthropic

San Francisco, California, United States (Hybrid)$350K – $500K Yearly
4d agoOpenAI logoOP

Researcher, Alignment Science

OpenAI

San Francisco, California, United States (Hybrid)$250K – $445K Yearly
4d agoOpenAI logoOP

Researcher, Misalignment Research

OpenAI

San Francisco, California, United States (On-site)$295K – $445K Yearly
2w agoBaseten logoBA
2w agoAnthropic logoAN

Research Engineer – Cybersecurity RL

Anthropic

San Francisco, California, United States (Hybrid)$300K – $405K Yearly
2d agoxAI logoXA
Subscribe to this search

Get email updates when new jobs match this search.