Mechanistic Interpretability Jobs
Browse 13 Mechanistic Interpretability jobs on Inference Jobs.
13 jobs
5dAN
[Expression of Interest] Research Manager, Interpretability
Anthropic
San Francisco, California, United States (Hybrid)$340k – $425k Yearly
1wOP
Researcher, Interpretability
OpenAI
San Francisco, California, United States (On-site)$310k – $460k Yearly
5dAN
Research Scientist, Interpretability
Anthropic
San Francisco, California, United States (Hybrid)$315k – $560k Yearly
5dAN
Research Engineer, Interpretability
Anthropic
San Francisco, California, United States (Hybrid)$315k – $560k Yearly
3dXA
Member of Technical Staff, Interpretability
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dAN
Research Engineer / Scientist, Alignment Science, London
Anthropic
London, England, United Kingdom (Hybrid)£250k – £270k Yearly
3wAN
Research Scientist, Societal Impacts
Anthropic
San Francisco, California, United States (Hybrid)$350k – $850k Yearly
2wRA
Member of Technical Staff - Safety Lead
Reflection AI
San Francisco, California, United States (On-site)
2wRA
Member of Technical Staff - Evaluations
Reflection AI
San Francisco, California, United States (On-site)
2wPE
Model Behavior Architect
Perplexity
San Francisco, California, United States (On-site)$180k – $260k Yearly
5dAN
[Expression of Interest] Research Scientist/Engineer, Honesty
Anthropic
New York, New York, United States (Hybrid)$315k – $340k Yearly
4wSC
AI Product Manager, Insights
Scale
San Francisco, California, United States (On-site)$206.8k – $258.5k Yearly