1. Home
  2. Jobs
  3. Interpretability

Interpretability Jobs

Explore Interpretability roles on Inference Jobs and apply today.

3mo agoOP

Researcher, Interpretability

OpenAI

San Francisco, California, United States (On-site)$310K – $460K Yearly
2w agoAN
2w agoAN

Research Scientist, Interpretability

Anthropic

San Francisco, California, United States (Hybrid)$350K – $850K Yearly
2w agoAN

Research Engineer, Interpretability

Anthropic

San Francisco, California, United States (Hybrid)$315K – $560K Yearly
2w agoAN

Machine Learning Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$350K – $500K Yearly
2w agoAN

Senior Research Scientist, Reward Models

Anthropic

San Francisco, California, United States (Hybrid)$350K – $500K Yearly
2w agoAN
4w agoAN
3mo agoAN

Research Scientist, Societal Impacts

Anthropic

San Francisco, California, United States (Hybrid)$350K – $850K Yearly
3mo agoRA

Member of Technical Staff - Safety Lead

Reflection AI

San Francisco, California, United States (On-site)
4w agoSC

Research Scientist, AI Controls and Monitoring

Scale

San Francisco, California, United States (On-site)$197.4K – $246.8K Yearly
4w agoBA
5d agoAN

Anthropic Fellows Program — AI Safety

Anthropic

United States + 2 more (Remote)$3.9K – $3.9K Weekly
2mo agoAN
4w agoCR

Integration Lead

Crusoe

Brighton, Colorado, United States (On-site)$41 – $46 Hourly
1w agoPR
2mo agoOP
1w agoPR