1. Home
  2. Jobs
  3. Adversarial Training

Adversarial Training Jobs

Explore Adversarial Training roles on Inference Jobs and apply today.

3mo agoOP

Researcher, Robustness & Safety Training

OpenAI

San Francisco, California, United States (On-site)$310K – $460K Yearly
4w agoSC

Research Scientist, Agent Robustness

Scale

San Francisco, California, United States (On-site)$197.4K – $246.8K Yearly
2mo agoOP

Researcher, Automated Red Teaming

OpenAI

San Francisco, California, United States (On-site)$295K – $445K Yearly
2w agoAN

Machine Learning Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$350K – $500K Yearly
3mo agoOP

Researcher, Misalignment Research

OpenAI

New York, United States or Remote (New York, United States)$380K – $460K Yearly
1mo agoOP

Researcher, Loss of Control

OpenAI

San Francisco, California, United States (On-site)$295K – $445K Yearly
3mo agoOP

Technical Lead, Safety Research

OpenAI

San Francisco, California, United States (Hybrid)$460K – $555K Yearly
3mo agoRA

Member of Technical Staff - Safety Lead

Reflection AI

San Francisco, California, United States (On-site)
3w agoSC

Research Scientist, Frontier Risk Evaluations

Scale

San Francisco, California, United States (On-site)$197.4K – $246.8K Yearly
1mo agoOP

Data Scientist, Preparedness

OpenAI

San Francisco, California, United States (On-site)$347K – $400K Yearly
1mo agoOP

Threat Modeler Lead

OpenAI

San Francisco, California, United States (On-site)$325K – $325K Yearly
3mo agoAN

Technical Policy Manager, Cyber Harms

Anthropic

San Francisco, California, United States (Hybrid)$320K – $405K Yearly
3mo agoOP

Researcher, Trustworthy AI

OpenAI

San Francisco, California, United States (On-site)$380K – $380K Yearly