AI Safety jobs
Explore AI Safety roles on Inference Jobs and apply today.
41-60 of 1,658 jobs
Research Engineer, Model Evaluations
Anthropic
San Francisco, California, United States (Hybrid)
$300k – $405k Yearly
Engagement Manager, Applied AI
Anthropic
San Francisco, California, United States (Hybrid)
$200k – $300k Yearly
Research Engineer, Frontier Red Team (Autonomy)
Anthropic
San Francisco, California, United States (Hybrid)
$350k – $850k Yearly
ML Infrastructure Engineer, Safeguards
Anthropic
San Francisco, California, United States (Hybrid)
$320k – $405k Yearly
Research Engineer, Production Model Post Training
Anthropic
San Francisco, California, United States (Hybrid)
$315k – $340k Yearly
Research Engineer, Frontier Evals & Environments
OpenAI
San Francisco, California, United States (On-site)
$200k – $370k Yearly
Research Engineer, Production Model Post-Training - London
Anthropic
London, England, United Kingdom (Hybrid)
£270k – £340k Yearly
Data Scientist, Integrity Measurement
OpenAI
San Francisco, California, United States (On-site)
$293k – $385k Yearly
Research Engineer, Privacy
OpenAI
San Francisco, California, United States (On-site)
$380k – $460k Yearly
Engineering Manager, Inference
Anthropic
San Francisco, California, United States (Hybrid)
$425k – $560k Yearly
Developer Experience Engineer
OpenAI
San Francisco, California, United States (On-site)
$280k – $345k Yearly
Technical Program Manager, Safeguards – Infrastructure & Evals
Anthropic
San Francisco, California, United States (Hybrid)
$290k – $365k Yearly
Technical Program Manager, Research
Anthropic
San Francisco, California, United States (Hybrid)
$290k – $365k Yearly
Software Engineer, Functional Safety
Tenstorrent
Santa Clara, California, United States (Hybrid)
$100k – $500k Yearly
Research Engineer / Scientist, Societal Impacts
Anthropic
San Francisco, California, United States (Hybrid)
$350k – $500k Yearly
Engineering Manager, ML Acceleration
Anthropic
San Francisco, California, United States (Hybrid)
$425k – $560k Yearly
Technical CBRN-E Threat Investigator
Anthropic
San Francisco, California, United States (Hybrid)
$230k – $290k Yearly
Research Product Manager, Model Behaviors
Anthropic
San Francisco, California, United States (Hybrid)
$305k – $385k Yearly
Principal Hardware Functional Safety Expert
NVIDIA
Santa Clara, California, United States (Hybrid)
$272k – $431.3k Yearly