1. Home
  2. Jobs
  3. Safety Mechanisms

Safety Mechanisms Jobs

Browse 283 Safety Mechanisms jobs on Inference Jobs.

41-60 of 283 jobs

2wOP

Researcher, Misalignment Research

OpenAI

New York, New York, United States or Remote (New York, United States)$380k – $460k Yearly
6dAN

Research Engineer / Scientist, Alignment Science, London

Anthropic

London, England, United Kingdom (Hybrid)£250k – £270k Yearly
3dXA

Security Specialist

xAI

Memphis, Tennessee, United States (On-site)
5dAN

Research Engineer / Scientist, Frontier Red Team (Cyber)

Anthropic

San Francisco, California, United States (Hybrid)$350k – $850k Yearly
2wOP

TLM, Machine Learning, Integrity

OpenAI

San Francisco, California, United States (On-site)$405k – $490k Yearly
6dAN

Research Engineer, Model Evaluations

Anthropic

San Francisco, California, United States (Hybrid)$300k – $405k Yearly
6dAN

Research Engineer / Scientist, Alignment Science

Anthropic

San Francisco, California, United States (Hybrid)$315k – $340k Yearly
4wCR

Data Centre Construction Safety Manager

Crusoe

Abilene, Texas, United States (On-site)$135k – $170k Yearly
2wOP

Trust & Safety Operations Analyst

OpenAI

San Francisco, California, United States (Hybrid)$210k – $280k Yearly
2wOP

Research Engineer, Privacy

OpenAI

San Francisco, California, United States (On-site)$380k – $460k Yearly
3wAN

Safeguards Analyst, Account Abuse

Anthropic

San Francisco, California, United States (Hybrid)$230k – $310k Yearly
4dOP

Researcher, Automated Red Teaming

OpenAI

San Francisco, California, United States (On-site)$295k – $445k Yearly
2wOP

Research Engineer, Frontier Evals & Environments

OpenAI

San Francisco, California, United States (On-site)$200k – $370k Yearly
3wAN

Research Engineer – Cybersecurity RL

Anthropic

San Francisco, California, United States (Hybrid)$300k – $405k Yearly
5dAN

Staff Red Team Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$300k – $405k Yearly
2wOP

Technical Program Manager – Adversarial Model Research

OpenAI

San Francisco, California, United States (Hybrid)$230k – $285k Yearly
4wAI

Head of Flight Test and Safety

Applied Intuition

Washington, District of Columbia, United States (On-site)$180k – $230k Yearly
3wAN

Technical Program Manager, Safeguards – Infrastructure & Evals

Anthropic

San Francisco, California, United States (Hybrid)$290k – $365k Yearly