RL Training Pipelines Jobs
Browse 23 RL Training Pipelines jobs on Inference Jobs.
23 jobs
6d ago
OP
1w ago
AN
Research Engineer, Machine Learning (RL Velocity)
Anthropic
San Francisco, California, United States (Hybrid)$500K – $850K Yearly
6d ago
CO
Member of Technical Staff, Integration/RL Team (Research Engineer)
Cohere
Paris, FR or Remote (Eastern Time Zone, United States + 27 more)
2w ago
FI
6d ago
OP
Researcher, Artifacts - Agent Post-Training
OpenAI
California, United States (Remote)$250K – $380K Yearly
15h ago
PO
Member of Engineering (Reinforcement Learning)
Poolside
Europe, Middle East, and Africa, North America (Remote)
5d ago
OP
Researcher, Computer Use - Agent Post-Training
OpenAI
San Francisco, California, United States (On-site)$250K – $380K Yearly
3d ago
XA
Member of Technical Staff - Post-Training and RL
xAI
Palo Alto, California, United States (On-site)$180K – $600K Yearly
2w ago
XA
Member of Technical Staff - RL Infrastructure [data, evals, agent]
xAI
Palo Alto, California, United States (On-site)$180K – $440K Yearly
6d ago
OP
2w ago
TM
Research Engineer, Infrastructure, RL Systems
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
1w ago
AN
Research Engineer, RL Infrastructure and Reliability (Knowledge Work)
Anthropic
San Francisco, California, United States (Hybrid)$350K – $850K Yearly
2w ago
LA
Forward Deployed Engineer, RL Environments
Labelbox
San Francisco, California, United States (Hybrid)$140K – $200K Yearly
2w ago
AN
Research Engineer, Performance RL
Anthropic
San Francisco, California, United States (Hybrid)$350K – $850K Yearly
2w ago
AN
Machine Learning Systems Engineer, RL Engineering
Anthropic
San Francisco, California, United States (Hybrid)$500K – $850K Yearly
1w ago
AN
Research Engineer, Machine Learning (RL Velocity)
Anthropic
London, England, United Kingdom (Hybrid)£370K – £630K Yearly
2w ago
BA
2w ago
PL