1. Home
  2. Jobs
  3. Safety Mechanisms

Safety Mechanisms Jobs

Browse 283 Safety Mechanisms jobs on Inference Jobs.

61-80 of 283 jobs

5dAN

[Expression of Interest] Research Scientist/Engineer, Honesty

Anthropic

New York, New York, United States (Hybrid)$315k – $340k Yearly
1wAI

Verification and Validation - Lead

Applied Intuition

Sunnyvale, California, United States (On-site)$200k – $320k Yearly
6dOP

Data Scientist, Integrity Measurement

OpenAI

San Francisco, California, United States (On-site)$293k – $385k Yearly
6dOP

Researcher, Frontier Cybersecurity Risks

OpenAI

San Francisco, California, United States (On-site)$295k – $445k Yearly
6dAN

Research Engineer, Production Model Post Training

Anthropic

San Francisco, California, United States (Hybrid)$315k – $340k Yearly
2wOP

Model Policy Manager, Chemical & Biological Risk

OpenAI

San Francisco, California, United States (Hybrid)$207k – $295k Yearly
6dAN

Technical CBRN-E Threat Investigator

Anthropic

San Francisco, California, United States (Hybrid)$230k – $290k Yearly
6dAN

Technical Policy Manager, Cyber Harms

Anthropic

San Francisco, California, United States (Hybrid)$320k – $405k Yearly
5dAN

Engineering Manager, Cloud Security

Anthropic

San Francisco, California, United States (Hybrid)$405k – $405k Yearly
4dAN

Software Engineer, Account Abuse

Anthropic

San Francisco, California, United States (Hybrid)$320k – $405k Yearly
2wOP

Software Engineer, Applied Foundations - London

OpenAI

London, England, United Kingdom (On-site)$200k – $370k Yearly
2wOP

Software Engineer, Youth Well-Being

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
3dAI

Software Architect - Fallback Stack

Applied Intuition

Sunnyvale, California, United States (On-site)$145k – $245k Yearly
2wOP

Software Engineer, Ads Integrity

OpenAI

San Francisco, California, United States (On-site)$200k – $370k Yearly
6dAN

Privacy Research Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$320k – $485k Yearly
3wAN

Policy Manager, Harmful Persuasion

Anthropic

San Francisco, California, United States (Hybrid)$245k – $330k Yearly
4wCO

EHS Global Audit Manager

CoreWeave

Livingston, New Jersey, United States (Hybrid)$122k – $165k Yearly
2wAI

Scenario Engineer

Applied Intuition

Sunnyvale, California, United States (On-site)$87k – $130k Yearly
4dAN

Offensive Security Research Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$320k – $405k Yearly
2wOP

Security Researcher, Trusted Computing and Cryptography

OpenAI

United States or Remote (United States)$324k – $490k Yearly