Reward Hacking Jobs
Browse 7 Reward Hacking jobs on Inference Jobs.
7 jobs
2w ago
AN
Senior Research Scientist, Reward Models
Anthropic
San Francisco, California, United States (Hybrid)$350K – $500K Yearly
2w ago
AN
Research Engineer, Reward Models Platform
Anthropic
San Francisco, California, United States (Hybrid)$350K – $500K Yearly
4d ago
OP
Researcher, Alignment Science
OpenAI
San Francisco, California, United States (Hybrid)$250K – $445K Yearly
4d ago
OP
Researcher, Misalignment Research
OpenAI
San Francisco, California, United States (On-site)$295K – $445K Yearly
2w ago
BA
2w ago
AN
Research Engineer – Cybersecurity RL
Anthropic
San Francisco, California, United States (Hybrid)$300K – $405K Yearly
2d ago
XA
Member of Technical Staff - Post-Training and RL
xAI
Palo Alto, California, United States (On-site)$180K – $600K Yearly