1. Home
  2. Jobs
  3. Mechanistic Interpretability

Mechanistic Interpretability Jobs

Browse 13 Mechanistic Interpretability jobs on Inference Jobs.

13 jobs

5dAN

[Expression of Interest] Research Manager, Interpretability

Anthropic

San Francisco, California, United States (Hybrid)$340k – $425k Yearly
1wOP

Researcher, Interpretability

OpenAI

San Francisco, California, United States (On-site)$310k – $460k Yearly
5dAN

Research Scientist, Interpretability

Anthropic

San Francisco, California, United States (Hybrid)$315k – $560k Yearly
5dAN

Research Engineer, Interpretability

Anthropic

San Francisco, California, United States (Hybrid)$315k – $560k Yearly
3dXA

Member of Technical Staff, Interpretability

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dAN

Research Engineer / Scientist, Alignment Science, London

Anthropic

London, England, United Kingdom (Hybrid)£250k – £270k Yearly
3wAN

Research Scientist, Societal Impacts

Anthropic

San Francisco, California, United States (Hybrid)$350k – $850k Yearly
2wRA

Member of Technical Staff - Safety Lead

Reflection AI

San Francisco, California, United States (On-site)
2wRA

Member of Technical Staff - Evaluations

Reflection AI

San Francisco, California, United States (On-site)
2wPE

Model Behavior Architect

Perplexity

San Francisco, California, United States (On-site)$180k – $260k Yearly
5dAN

[Expression of Interest] Research Scientist/Engineer, Honesty

Anthropic

New York, New York, United States (Hybrid)$315k – $340k Yearly
4wSC

AI Product Manager, Insights

Scale

San Francisco, California, United States (On-site)$206.8k – $258.5k Yearly