AI Safety jobs in California, United States

Discover AI Safety roles in California, United States on Inference Jobs and apply today.

21-40 of 691 jobs

OP2w

Research Engineer, Frontier Evals & Environments

OpenAI

San Francisco, California, United States (On-site)

$200k – $370k Yearly

OP2w

Research Engineer, Privacy

OpenAI

San Francisco, California, United States (On-site)

$380k – $460k Yearly

OP2w

Developer Experience Engineer

OpenAI

San Francisco, California, United States (On-site)

$280k – $345k Yearly

AN3w

Technical Program Manager, Safeguards – Infrastructure & Evals

Anthropic

San Francisco, California, United States (Hybrid)

$290k – $365k Yearly

AN2w

Technical Program Manager, Research

Anthropic

San Francisco, California, United States (Hybrid)

$290k – $365k Yearly

AN2w

Research Engineer / Scientist, Societal Impacts

Anthropic

San Francisco, California, United States (Hybrid)

$350k – $500k Yearly

AN4w

Research Product Manager, Model Behaviors

Anthropic

San Francisco, California, United States (Hybrid)

$305k – $385k Yearly

OP2w

Product Manager, API Model Behavior

OpenAI

San Francisco, California, United States (On-site)

$325k – $405k Yearly

AN2w

Product Management, Research

Anthropic

San Francisco, California, United States (Hybrid)

$305k – $385k Yearly

OP2w

Model Policy Manager, Chemical & Biological Risk

OpenAI

San Francisco, California, United States (Hybrid)

$207k – $295k Yearly

SC4w

Engineering Manager, AgentOps

Scale

San Francisco, California, United States (Hybrid)

$216.2k – $270.3k Yearly

SC4w

Communications Manager, Corporate & Product

Scale

San Francisco, California, United States (On-site)

$151.2k – $189k Yearly

D-2w

AI Security Architect, Principal

d-Matrix

Santa Clara, California, United States or Remote (United States)

$220k – $300k Yearly

OP2w

Security Researcher, Trusted Computing and Cryptography

OpenAI

United States or Remote (United States)

$324k – $490k Yearly

OP2w

Software Engineer, Ads Integrity

OpenAI

San Francisco, California, United States (On-site)

$200k – $370k Yearly

AN3w

Forward Deployed Engineer, Custom Agents

Anthropic

San Francisco, California, United States (Hybrid)

$280k – $400k Yearly

AN3w

Policy Manager, Harmful Persuasion

Anthropic

San Francisco, California, United States (Hybrid)

$245k – $330k Yearly

OP3w

Product Policy –Policy Manager (Child Safety)

OpenAI

San Francisco, California, United States (Hybrid)

$261k – $290k Yearly

OP2w

Technical Program Manager – Adversarial Model Research

OpenAI

San Francisco, California, United States (Hybrid)

$230k – $285k Yearly

AN4w

Prompt Engineer, Agent Prompts & Evals

Anthropic

San Francisco, California, United States (Hybrid)

$320k – $405k Yearly