RL Algorithms Jobs
Browse 168 RL Algorithms jobs on Inference Jobs.
21-40 of 168 jobs
1wTM
Research, Post-Training
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
1wXA
Member of Technical Staff - Multimodal Post-training
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wXA
Member of Technical Staff - Coding Agents
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
6dLA
Applied Research Engineer, Agents
Labelbox
San Francisco, California, United States (Hybrid)$250k – $300k Yearly
1wTM
Research, Post-Training Data
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
1wXA
Member of Technical Staff - Reasoning Post-training
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
6dAN
[Expression of Interest] Research Scientist/Engineer, Alignment Finetuning
Anthropic
San Francisco, California, United States (Hybrid)$315k – $340k Yearly
7dAN
Research Engineer, Production Model Post-Training - London
Anthropic
London, England, United Kingdom (Hybrid)£270k – £340k Yearly
6dLA
2wOP
Researcher, Robustness & Safety Training
OpenAI
San Francisco, California, United States (On-site)$310k – $460k Yearly
1wSC
Machine Learning Research Scientist / Research Engineer, Post-Training
Scale
San Francisco, California, United States (On-site)$252k – $315k Yearly
7dAN
Research Engineer, Production Model Post Training
Anthropic
San Francisco, California, United States (Hybrid)$315k – $340k Yearly
6dAN
[Expression of Interest] Research Scientist/Engineer, Honesty
Anthropic
New York, New York, United States (Hybrid)$315k – $340k Yearly
2wRA
Member of Technical Staff - Pre-Training
Reflection AI
San Francisco, California, United States (On-site)
2wRA
Member of Technical Staff - Alignment Lead
Reflection AI
San Francisco, California, United States (On-site)