1. Home
  2. Jobs
  3. Adversarial Evaluations

Adversarial Evaluations Jobs

Browse 23 Adversarial Evaluations jobs on Inference Jobs.

23 jobs
4d agoOpenAI logoOP

Researcher, Misalignment Research

OpenAI

San Francisco, California, United States (On-site)$295K – $445K Yearly
2w agoScale logoSC

Research Scientist, Frontier Risk Evaluations

Scale

San Francisco, California, United States (On-site)$197.4K – $246.8K Yearly
2w agoScale logoSC

Research Scientist, Agent Robustness

Scale

San Francisco, California, United States (On-site)$197.4K – $246.8K Yearly
2d agoReflection AI logoRA

Member of Technical Staff - Safety Lead

Reflection AI

San Francisco, California, United States (On-site)
2w agoOpenAI logoOP

Data Scientist, Preparedness

OpenAI

San Francisco, California, United States (On-site)$347K – $400K Yearly
5d agoOpenAI logoOP

Researcher, Robustness & Safety Training

OpenAI

San Francisco, California, United States (On-site)$295K – $445K Yearly
2w agoScale logoSC

Research Scientist, Safety Post Training

Scale

San Francisco, California, United States (On-site)$216K – $270K Yearly
2w agoOpenAI logoOP

Researcher, Automated Red Teaming

OpenAI

San Francisco, California, United States (On-site)$295K – $445K Yearly
2w agoOpenAI logoOP

Threat Modeler Lead

OpenAI

San Francisco, California, United States (On-site)$325K – $325K Yearly
2w agoScale logoSC

Machine Learning Engineer - Model Evaluations, Public Sector

Scale

San Francisco, California, United States (On-site)$216.3K – $300.3K Yearly
2w agoAnthropic logoAN

Research Engineer, Safeguards Labs

Anthropic

San Francisco, California, United States (Hybrid)$350K – $850K Yearly
5d agoOpenAI logoOP

Researcher, Trustworthy AI

OpenAI

San Francisco, California, United States (On-site)$380K – $380K Yearly
2w agoAnthropic logoAN

Machine Learning Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$350K – $500K Yearly
2w agoNVIDIA logoNV

Senior Deep Learning Engineer - Model Evaluation & AI Systems

NVIDIA

Santa Clara, California, United States (On-site)$224K – $431.3K Yearly
2w agoMistral AI logoMA
6d agoGoogle DeepMind logoGD

Research Scientist, Gemini Safety

Google DeepMind

Mountain View, California, United States (On-site)
4d agoOpenAI logoOP

Product Manager, Cyber Safety

OpenAI

San Francisco, California, United States (On-site)$293K – $325K Yearly
Subscribe to this search

Get email updates when new jobs match this search.