Reinforcement Learning with Human Feedback (RLHF) Jobs
Explore Reinforcement Learning with Human Feedback (RLHF) roles on Inference Jobs and apply today.
3w agoLA
Applied Research Engineer, Agents
Labelbox
San Francisco, California, United States (Hybrid)$250K – $300K Yearly
2w agoTM
Research, Post-Training Data
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
2w agoAN
Research Engineer, Performance RL
Anthropic
San Francisco, California, United States (Hybrid)$350K – $850K Yearly
3w agoBA
2mo agoSC
Machine Learning Research Engineer, Agents - Enterprise GenAI
Scale
San Francisco, California, United States (On-site)$252K – $315K Yearly
4w agoRU
3mo agoCO
Member of Technical Staff, Integration/RL Team (Research Engineer)
Cohere
Paris, Paris, France or Remote (United States + 3 more)
2w agoCO
VP of Product, Research and Training Infrastructure
CoreWeave
Livingston, New Jersey, United States (Hybrid)$233K – $341K Yearly
3w agoSC
Research Scientist, AI Controls and Monitoring
Scale
San Francisco, California, United States (On-site)$197.4K – $246.8K Yearly
1mo agoAA
Senior AI Researcher- Reinforcement learning (f/m/d)
Aleph Alpha
Heidelberg, Baden-Württemberg, Germany (Hybrid)
3mo agoRA
Member of Technical Staff - Safety Lead
Reflection AI
San Francisco, California, United States (On-site)
4d agoLA
Forward Deployed Engineer, RL Environments
Labelbox
San Francisco, California, United States (Hybrid)$140K – $200K Yearly
3w agoTA
AI Researcher, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
2d agoAN
Anthropic Fellows Program — Reinforcement Learning
Anthropic
United States + 2 more (Remote)$3.9K – $3.9K Weekly
1w agoAN
[Expression of Interest] Research Scientist/Engineer, Honesty
Anthropic
New York, New York, United States (Hybrid)$350K – $500K Yearly
3mo agoNV
Software Product Manager - Nemotron
NVIDIA
Santa Clara, California, United States (On-site)$240K – $379.5K Yearly
1w agoAN
Staff Machine Learning Engineer, Virtual Collaborator
Anthropic
New York, New York, United States (Hybrid)$500K – $850K Yearly
2mo agoXA
Member of Technical Staff - RL Infrastructure [data, evals, agent]
xAI
Palo Alto, California, United States (On-site)$180K – $440K Yearly
4d agoFI
Staff Reinforcement Learning Engineer – Whole Body Control
Figure
San Jose, California, United States (On-site)$150K – $250K Yearly
1w agoAN
Research Engineer, Reward Models Platform
Anthropic
San Francisco, California, United States (Hybrid)$350K – $500K Yearly