Model Safety Jobs
Browse 1,050 Model Safety jobs on Inference Jobs.
1,050 jobs
2wOP
Model Policy Manager, Chemical & Biological Risk
OpenAI
San Francisco, California, United States (Hybrid)$207k – $295k Yearly
6dAN
Research Engineer, Model Evaluations
Anthropic
San Francisco, California, United States (Hybrid)$300k – $405k Yearly
3wAN
Research Product Manager, Model Behaviors
Anthropic
San Francisco, California, United States (Hybrid)$305k – $385k Yearly
2wRA
Member of Technical Staff - Safety Lead
Reflection AI
San Francisco, California, United States (On-site)
2wOP
Product Manager, API Model Behavior
OpenAI
San Francisco, California, United States (On-site)$325k – $405k Yearly
6dAN
Research Engineer, Production Model Post Training
Anthropic
San Francisco, California, United States (Hybrid)$315k – $340k Yearly
2wAI
System Safety Engineer Autonomous Driving - Autonomy
Applied Intuition
Sunnyvale, California, United States (On-site)$118k – $250k Yearly
2wOP
Technical Program Manager – Adversarial Model Research
OpenAI
San Francisco, California, United States (Hybrid)$230k – $285k Yearly
6dAN
Research Engineer, Production Model Post-Training - London
Anthropic
London, England, United Kingdom (Hybrid)£270k – £340k Yearly
1wOP
Researcher, Robustness & Safety Training
OpenAI
San Francisco, California, United States (On-site)$310k – $460k Yearly
5dAN
Developer Relations, MCP
Anthropic
San Francisco, California, United States (Hybrid)$305k – $385k Yearly
5dAN
[Expression of Interest] Research Scientist/Engineer, Honesty
Anthropic
New York, New York, United States (Hybrid)$315k – $340k Yearly
2wOP
2wOP
TLM, Machine Learning, Integrity
OpenAI
San Francisco, California, United States (On-site)$405k – $490k Yearly
2wOP
Research Engineer, Privacy
OpenAI
San Francisco, California, United States (On-site)$380k – $460k Yearly