Reinforcement Learning from Human Feedback (RLHF) Jobs
Browse 271 Reinforcement Learning from Human Feedback (RLHF) jobs on Inference Jobs.
21-40 of 271 jobs
4wNV
Software Product Manager - Nemotron
NVIDIA
Santa Clara, California, United States (On-site)$240k – $379.5k Yearly
2wOP
Research Engineer/Research Scientist, RL/Reasoning
OpenAI
San Francisco, California, United States (Hybrid)$310k – $460k Yearly
4wXA
Member of Technical Staff - RL Infrastructure [data, evals, agent]
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wRA
Member of Technical Staff - Data Quality Engineer (Post-training)
Reflection AI
San Francisco, California, United States (On-site)
7dXA
Member of Technical Staff - Multimodal Post-training
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
7dAN
Staff Machine Learning Engineer, Virtual Collaborator
Anthropic
San Francisco, California, United States (Hybrid)$340k – $560k Yearly
7dXA
Member of Technical Staff - Reasoning Efficiency
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
7dRU
Applied Research Lead, Reinforcement Learning
Runway
New York, New York, United States or Remote (North America + 1 more)$280k – $380k Yearly
4wAN
Full Stack Software Engineer, Reinforcement Learning
Anthropic
San Francisco, California, United States (Hybrid)$300k – $405k Yearly
6dLA
6dAN
Machine Learning Engineer, Safeguards
Anthropic
San Francisco, California, United States (Hybrid)$315k – $425k Yearly
6dSC
Machine Learning Research Engineer - Robotics
Scale
San Francisco, California, United States (On-site)$218.4k – $273k Yearly
4dNV
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly