Reinforcement Learning from Human Feedback Jobs
Browse 265 Reinforcement Learning from Human Feedback jobs on Inference Jobs.
101-120 of 265 jobs
6dAN
Research Engineer / Research Scientist, Biology & Life Sciences
Anthropic
San Francisco, California, United States (Hybrid)$315k – $340k Yearly
2wSC
2wAN
Research Engineer - Pretraining
Anthropic
London, England, United Kingdom (Hybrid)£260k – £630k Yearly
2wPE
Tech Lead Manager - Agents
Perplexity
San Francisco, California, United States (On-site)$300k – $385k Yearly
2wSC
Machine Learning Fellow - Human Frontier Collective (UK)
Scale
United Kingdom (Remote)Up to $166.4k Hourly
3wCR
Principal Engineer, AI Model LifeCycle
Crusoe
San Francisco, California, United States (On-site)$256k – $320k Yearly
3wCR
Staff Software Engineer, Model LifeCycle
Crusoe
San Francisco, California, United States (On-site)$204k – $247k Yearly
2wBA
Senior Product Engineer - Training Platform
Baseten
San Francisco, California, United States (On-site)$200k – $275k Yearly
2wPE
AI Software Engineer - Comet Agents
Perplexity
San Francisco, California, United States (On-site)$210k – $385k Yearly
4wNV
Agentic AI Solution Engineering Intern - Summer 2026
NVIDIA
Austin, Texas, United States (On-site)$20 – $71 Hourly
4dNV
6dRU
Applied Research Lead, Model Scaling
Runway
New York, New York, United States or Remote (North America + 1 more)$280k – $380k Yearly
2wOP
Program Manager, Human Data
OpenAI
San Francisco, California, United States (On-site)$180k – $240k Yearly
1wXA
X Memes and Headline Commentary Tutor
xAI
Palo Alto, California, United States or Remote (United States)$45 – $100 Hourly