AI Safety Research Jobs
Explore AI Safety Research roles on Inference Jobs and apply today.
2w agoAN
[Expression of Interest] Research Manager, Interpretability
Anthropic
San Francisco, California, United States (Hybrid)$350K – $500K Yearly
3mo agoOP
Researcher, Interpretability
OpenAI
San Francisco, California, United States (On-site)$310K – $460K Yearly
2mo agoAN
Research Engineer, AI Observability
Anthropic
San Francisco, California, United States (Hybrid)$320K – $405K Yearly
2w agoAN
Research Engineer / Scientist, Alignment Science, London
Anthropic
London, England, United Kingdom (Hybrid)£260K – £370K Yearly
2w agoAN
Research Engineer / Scientist, Alignment Science
Anthropic
San Francisco, California, United States (Hybrid)$350K – $500K Yearly
3mo agoAN
Research Scientist, Societal Impacts
Anthropic
San Francisco, California, United States (Hybrid)$350K – $850K Yearly
2mo agoOP
Model Policy Manager, Chemical & Biological Risk
OpenAI
San Francisco, California, US$207K – $295K Yearly
3mo agoOP
Technical Program Manager – Adversarial Model Research
OpenAI
San Francisco, California, US$230K – $285K Yearly
2w agoAN
Research Scientist, Interpretability
Anthropic
San Francisco, California, United States (Hybrid)$350K – $850K Yearly
3mo agoAN
Research Product Manager, Model Behaviors
Anthropic
San Francisco, California, United States (Hybrid)$305K – $385K Yearly
3mo agoAN
Research Engineer / Scientist, Frontier Red Team (Cyber)
Anthropic
San Francisco, California, United States (Hybrid)$350K – $850K Yearly
1mo agoAN
Policy Manager, Chemical Weapons and High Yield Explosives
Anthropic
San Francisco, California, United States (Hybrid)$245K – $285K Yearly
3mo agoAN
Research Engineer, Frontier Red Team (Autonomy)
Anthropic
San Francisco, California, United States (Hybrid)$350K – $850K Yearly
1mo agoNV
Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$168K – $264.5K Yearly
2w agoAN
[Expression of Interest] Research Scientist/Engineer, Honesty
Anthropic
New York, New York, United States (Hybrid)$350K – $500K Yearly
2mo agoOP
Researcher, Automated Red Teaming
OpenAI
San Francisco, California, United States (On-site)$295K – $445K Yearly
2w agoAN
Research Engineer – Cybersecurity RL
Anthropic
San Francisco, California, United States (Hybrid)$300K – $405K Yearly
2mo agoAN
Research Engineer / Research Scientist, Tokens
Anthropic
New York, New York, United States (Hybrid)$350K – $500K Yearly
2w agoAN
Senior Research Scientist, Reward Models
Anthropic
San Francisco, California, United States (Hybrid)$350K – $500K Yearly
3mo agoOP
Research Engineer, Frontier Evals & Environments
OpenAI
San Francisco, California, United States (On-site)$200K – $370K Yearly