AI Safety Jobs in San Francisco, California, United States

Discover AI Safety roles in San Francisco, California, United States on Inference Jobs and apply today.

3mo agoOP

TLM, Machine Learning, Integrity

OpenAI

San Francisco, California, United States (On-site)$405K – $490K Yearly
2mo agoOP

Researcher, Automated Red Teaming

OpenAI

San Francisco, California, United States (On-site)$295K – $445K Yearly
7d agoAN

Safeguards Policy Analyst, Fraud & Scams

Anthropic

San Francisco, California, United States (Hybrid)$245K – $285K Yearly
3mo agoAN

Research Engineer / Scientist, Frontier Red Team (Cyber)

Anthropic

San Francisco, California, United States (Hybrid)$350K – $850K Yearly
2w agoAN

Research Engineer – Cybersecurity RL

Anthropic

San Francisco, California, United States (Hybrid)$300K – $405K Yearly
3mo agoAN

Technical CBRN-E Threat Investigator

Anthropic

San Francisco, California, United States (Hybrid)$230K – $290K Yearly
2mo agoAN

Safeguards Analyst, Account Abuse

Anthropic

San Francisco, California, United States (Hybrid)$230K – $310K Yearly
2w agoAN

Senior Research Scientist, Reward Models

Anthropic

San Francisco, California, United States (Hybrid)$350K – $500K Yearly
3mo agoOP
4w agoPE

Member of Technical Staff - Secure Intelligence Institute

Perplexity

San Francisco, California, United States (On-site)$220K – $405K Yearly
3mo agoAN

Technical Policy Manager, Cyber Harms

Anthropic

San Francisco, California, United States (Hybrid)$320K – $405K Yearly
2mo agoOP

Researcher, Frontier Cybersecurity Risks

OpenAI

San Francisco, California, United States (On-site)$295K – $445K Yearly
4w agoAN

Vendor and Contract Manager, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$245K – $285K Yearly
3mo agoAN

Research Engineer / Scientist, Societal Impacts

Anthropic

San Francisco, California, United States (Hybrid)$350K – $500K Yearly
2w agoAN

Machine Learning Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$350K – $500K Yearly
1mo agoAN

Technical Influence Operations Threat Investigator

Anthropic

San Francisco, California, United States (Hybrid)$230K – $290K Yearly
3mo agoOP

Fullstack Engineer, Safety Engineering

OpenAI

San Francisco, California, United States (On-site)$210K – $325K Yearly