Interpretability Jobs in San Francisco, California, United States

Discover Interpretability roles in San Francisco, California, United States on Inference Jobs and apply today.

3mo agoOP

Researcher, Interpretability

OpenAI

San Francisco, California, United States (On-site)$310K – $460K Yearly
2w agoAN
2w agoAN

Research Scientist, Interpretability

Anthropic

San Francisco, California, United States (Hybrid)$350K – $850K Yearly
2w agoAN

Research Engineer, Interpretability

Anthropic

San Francisco, California, United States (Hybrid)$315K – $560K Yearly
2w agoAN

Machine Learning Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$350K – $500K Yearly
2w agoAN

Senior Research Scientist, Reward Models

Anthropic

San Francisco, California, United States (Hybrid)$350K – $500K Yearly
3mo agoAN

Research Scientist, Societal Impacts

Anthropic

San Francisco, California, United States (Hybrid)$350K – $850K Yearly
3mo agoRA

Member of Technical Staff - Safety Lead

Reflection AI

San Francisco, California, United States (On-site)
4w agoSC

Research Scientist, AI Controls and Monitoring

Scale

San Francisco, California, United States (On-site)$197.4K – $246.8K Yearly
4w agoBA
5d agoAN

Anthropic Fellows Program — AI Safety

Anthropic

United States + 2 more (Remote)$3.9K – $3.9K Weekly
1w agoPR
3mo agoCA

Inference Engineer

Cartesia

San Francisco, California, United States (On-site)$180K – $250K Yearly
3mo agoHA

Deployment Strategist

HappyRobot

San Francisco, California, United States (On-site)$150K – $200K Yearly
3mo agoOP

Researcher, Training

OpenAI

San Francisco, California, United States (Hybrid)$360K – $440K Yearly