Model Safety Jobs
Browse 1,050 Model Safety jobs on Inference Jobs.
21-40 of 1,050 jobs
3wXA
2wOP
Researcher, Preparedness
OpenAI
San Francisco, California, United States (On-site)$310k – $460k Yearly
2wAN
National Security Policy Lead
Anthropic
San Francisco, California, United States (Hybrid)$295k – $345k Yearly
2wOP
Research Engineer, Frontier Evals & Environments
OpenAI
San Francisco, California, United States (On-site)$200k – $370k Yearly
5dAN
Engineering Manager, ML Acceleration
Anthropic
San Francisco, California, United States (Hybrid)$425k – $560k Yearly
4wSC
Engineering Manager, AgentOps
Scale
San Francisco, California, United States (Hybrid)$216.2k – $270.3k Yearly
6dAN
Research Engineer / Scientist, Alignment Science
Anthropic
San Francisco, California, United States (Hybrid)$315k – $340k Yearly
1wOP
Product Manager, Model Behavior
OpenAI
San Francisco, California, United States (On-site)$255k – $325k Yearly
6dAN
Research Engineer, AI Observability
Anthropic
San Francisco, California, United States (Hybrid)$320k – $405k Yearly
2wAN
2wOP
Developer Experience Engineer
OpenAI
San Francisco, California, United States (On-site)$280k – $345k Yearly
5dAN
Research Engineer, Frontier Red Team (Hardware Lead)
Anthropic
San Francisco, California, United States (Hybrid)$850k – $850k Yearly
2dAN
5dAN
[Expression of Interest] Research Manager, Interpretability
Anthropic
San Francisco, California, United States (Hybrid)$340k – $425k Yearly
6dAN
ML Infrastructure Engineer, Safeguards
Anthropic
San Francisco, California, United States (Hybrid)$320k – $405k Yearly
4dAN
Offensive Security Research Engineer, Safeguards
Anthropic
San Francisco, California, United States (Hybrid)$320k – $405k Yearly
5dAN
Research Engineer, Frontier Red Team (Autonomy)
Anthropic
San Francisco, California, United States (Hybrid)$350k – $850k Yearly