1. Home
  2. Jobs
  3. Reinforcement Learning from Human Feedback

Reinforcement Learning from Human Feedback Jobs

Browse 264 Reinforcement Learning from Human Feedback jobs on Inference Jobs.

61-80 of 264 jobs

3wNV

Research Scientist, Electronic Design Automation - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)$168k – $264.5k Yearly
2wSI

Product Manager, Intelligence

Sierra

San Francisco, California, United States (On-site)$175k – $350k Yearly
3wNV

Senior Scientific Machine Learning Engineer – Earth-2

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
6dAN

Research Engineer / Scientist, Alignment Science, London

Anthropic

London, England, United Kingdom (Hybrid)£250k – £270k Yearly
4wSC
3wNV

Senior Applied Deep Learning Research Scientist, Efficiency

NVIDIA

Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
6dXA

Member of Technical Staff - Reasoning Efficiency

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wNV

Senior Capability Development Engineer

NVIDIA

Shenzhen Shi, Guangdong, China (On-site)
5dAN

[Expression of Interest] Research Scientist/Engineer, Honesty

Anthropic

New York, New York, United States (Hybrid)$315k – $340k Yearly
2wRA

Member of Technical Staff - Safety Lead

Reflection AI

San Francisco, California, United States (On-site)
2wPE

AI Research Lead

Perplexity

San Francisco, California, United States (On-site)$300k – $470k Yearly
3wNV

Senior Machine Learning Performance Engineer - Physics

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
2wOP

Software Engineer, Applied Evals

OpenAI

San Francisco, California, United States (Hybrid)$255k – $325k Yearly
3wAN

Research Engineer – Cybersecurity RL

Anthropic

San Francisco, California, United States (Hybrid)$300k – $405k Yearly
2wAN

Research Engineer, Environment Scaling

Anthropic

United States (Hybrid)$350k – $850k Yearly
6dAN

Research Engineer / Scientist, Alignment Science

Anthropic

San Francisco, California, United States (Hybrid)$315k – $340k Yearly
2wPE

Model Behavior Architect

Perplexity

San Francisco, California, United States (On-site)$180k – $260k Yearly
2wOP

Research Scientist, Mathematical Sciences

OpenAI

San Francisco, California, United States (Hybrid)$380k – $460k Yearly
2wAI

Research Intern - Robotic Hardware, Simulation and Data

Applied Intuition

Sunnyvale, California, United States (On-site)