Safety Mechanisms Jobs
Browse 283 Safety Mechanisms jobs on Inference Jobs.
61-80 of 283 jobs
5dAN
[Expression of Interest] Research Scientist/Engineer, Honesty
Anthropic
New York, New York, United States (Hybrid)$315k – $340k Yearly
1wAI
Verification and Validation - Lead
Applied Intuition
Sunnyvale, California, United States (On-site)$200k – $320k Yearly
6dOP
Data Scientist, Integrity Measurement
OpenAI
San Francisco, California, United States (On-site)$293k – $385k Yearly
6dOP
Researcher, Frontier Cybersecurity Risks
OpenAI
San Francisco, California, United States (On-site)$295k – $445k Yearly
6dAN
Research Engineer, Production Model Post Training
Anthropic
San Francisco, California, United States (Hybrid)$315k – $340k Yearly
2wOP
Model Policy Manager, Chemical & Biological Risk
OpenAI
San Francisco, California, United States (Hybrid)$207k – $295k Yearly
6dAN
Technical CBRN-E Threat Investigator
Anthropic
San Francisco, California, United States (Hybrid)$230k – $290k Yearly
6dAN
Technical Policy Manager, Cyber Harms
Anthropic
San Francisco, California, United States (Hybrid)$320k – $405k Yearly
5dAN
Engineering Manager, Cloud Security
Anthropic
San Francisco, California, United States (Hybrid)$405k – $405k Yearly
4dAN
Software Engineer, Account Abuse
Anthropic
San Francisco, California, United States (Hybrid)$320k – $405k Yearly
2wOP
Software Engineer, Applied Foundations - London
OpenAI
London, England, United Kingdom (On-site)$200k – $370k Yearly
2wOP
Software Engineer, Youth Well-Being
OpenAI
San Francisco, California, United States (On-site)$255k – $405k Yearly
3dAI
Software Architect - Fallback Stack
Applied Intuition
Sunnyvale, California, United States (On-site)$145k – $245k Yearly
2wOP
Software Engineer, Ads Integrity
OpenAI
San Francisco, California, United States (On-site)$200k – $370k Yearly
6dAN
Privacy Research Engineer, Safeguards
Anthropic
San Francisco, California, United States (Hybrid)$320k – $485k Yearly
3wAN
Policy Manager, Harmful Persuasion
Anthropic
San Francisco, California, United States (Hybrid)$245k – $330k Yearly
4wCO
2wAI
4dAN
Offensive Security Research Engineer, Safeguards
Anthropic
San Francisco, California, United States (Hybrid)$320k – $405k Yearly
2wOP
Security Researcher, Trusted Computing and Cryptography
OpenAI
United States or Remote (United States)$324k – $490k Yearly