Reinforcement Learning from Human Feedback (RLHF) Jobs
Explore Reinforcement Learning from Human Feedback (RLHF) roles on Inference Jobs and apply today.
3w agoAN
Research Engineer, Performance RL
Anthropic
San Francisco, California, United States (Hybrid)$350K – $850K Yearly
4w agoLA
Applied Research Engineer, Agents
Labelbox
San Francisco, California, United States (Hybrid)$250K – $300K Yearly
3mo agoCO
Member of Technical Staff, Integration/RL Team (Research Engineer)
Cohere
Paris, Paris, France or Remote (United States + 3 more)
2w agoCO
VP of Product, Research and Training Infrastructure
CoreWeave
Livingston, New Jersey, United States (Hybrid)$233K – $341K Yearly
2mo agoSC
Machine Learning Research Engineer, Agents - Enterprise GenAI
Scale
San Francisco, California, United States (On-site)$252K – $315K Yearly
3w agoTM
Research, Post-Training Data
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
4w agoRU
1mo agoAA
Senior AI Researcher- Reinforcement learning (f/m/d)
Aleph Alpha
Heidelberg, Baden-Württemberg, Germany (Hybrid)
4w agoSC
Research Scientist, AI Controls and Monitoring
Scale
San Francisco, California, United States (On-site)$197.4K – $246.8K Yearly
3mo agoRA
Member of Technical Staff - Safety Lead
Reflection AI
San Francisco, California, United States (On-site)
5d agoAN
Anthropic Fellows Program — Reinforcement Learning
Anthropic
United States + 2 more (Remote)$3.9K – $3.9K Weekly
3mo agoNV
Software Product Manager - Nemotron
NVIDIA
Santa Clara, California, United States (On-site)$240K – $379.5K Yearly
4w agoTA
AI Researcher, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
2w agoAN
[Expression of Interest] Research Scientist/Engineer, Honesty
Anthropic
New York, New York, United States (Hybrid)$350K – $500K Yearly
6d agoFI
Staff Reinforcement Learning Engineer – Whole Body Control
Figure
San Jose, California, United States (On-site)$150K – $250K Yearly
2mo agoXA
Member of Technical Staff - RL Infrastructure [data, evals, agent]
xAI
Palo Alto, California, United States (On-site)$180K – $440K Yearly
2w agoAN
Staff Machine Learning Engineer, Virtual Collaborator
Anthropic
New York, New York, United States (Hybrid)$500K – $850K Yearly
2w agoAN
Research Engineer, Reward Models Platform
Anthropic
San Francisco, California, United States (Hybrid)$350K – $500K Yearly
2mo agoAN
Research Engineer, Machine Learning (Reinforcement Learning)
Anthropic
London, England, United Kingdom (Hybrid)£1 – £1 Yearly
3mo agoPL