AI Safety Researcher Jobs
Browse 1,656 AI Safety Researcher jobs on Inference Jobs.
1,656 jobs
2wOP
2wOP
Researcher, Preparedness
OpenAI
San Francisco, California, United States (On-site)$310k – $460k Yearly
2wOP
Researcher, Interpretability
OpenAI
San Francisco, California, United States (On-site)$310k – $460k Yearly
2wOP
Researcher, Robustness & Safety Training
OpenAI
San Francisco, California, United States (On-site)$310k – $460k Yearly
5dAN
[Expression of Interest] Research Scientist/Engineer, Honesty
Anthropic
New York, New York, United States (Hybrid)$315k – $340k Yearly
6dAN
Research Engineer / Scientist, Alignment Science
Anthropic
San Francisco, California, United States (Hybrid)$315k – $340k Yearly
5dAN
Research Engineer / Scientist, Frontier Red Team (Cyber)
Anthropic
San Francisco, California, United States (Hybrid)$350k – $850k Yearly
6dAN
Research Scientist, Interpretability
Anthropic
San Francisco, California, United States (Hybrid)$315k – $560k Yearly
2wOP
Researcher, Misalignment Research
OpenAI
New York, New York, United States or Remote (New York, United States)$380k – $460k Yearly
6dAN
Research Engineer / Scientist, Alignment Science, London
Anthropic
London, England, United Kingdom (Hybrid)£250k – £270k Yearly
3wAN
Research Engineer – Cybersecurity RL
Anthropic
San Francisco, California, United States (Hybrid)$300k – $405k Yearly
6dAN
Research Engineer, AI Observability
Anthropic
San Francisco, California, United States (Hybrid)$320k – $405k Yearly
3wAN
Research Scientist, Societal Impacts
Anthropic
San Francisco, California, United States (Hybrid)$350k – $850k Yearly
5dAN
Research Engineer, Frontier Red Team (Hardware Lead)
Anthropic
San Francisco, California, United States (Hybrid)$850k – $850k Yearly
2wOP
Security Researcher, Trusted Computing and Cryptography
OpenAI
United States or Remote (United States)$324k – $490k Yearly
5dAN
Research Engineer, Frontier Red Team (Autonomy)
Anthropic
San Francisco, California, United States (Hybrid)$350k – $850k Yearly
5dAN
[Expression of Interest] Research Manager, Interpretability
Anthropic
San Francisco, California, United States (Hybrid)$340k – $425k Yearly
2dAN