AI Evaluation jobs in San Francisco, California, United States

Discover AI Evaluation roles in San Francisco, California, United States on Inference Jobs and apply today.

41-60 of 657 jobs

SC2w

Machine Learning Fellow - Human Frontier Collective (US)

Scale

United States (Remote)

SC4w

Engineering Manager, AgentOps

Scale

San Francisco, California, United States (Hybrid)

$216.2k – $270.3k Yearly

DE2w

Senior Software Engineer, Agent Orchestration

Decagon

San Francisco, California, United States (On-site)

$250k – $330k Yearly

XA4w

AI Finance Tutor - Portfolio Management

xAI

United States (Remote)

DE2w

Engineering Manager, Agent Orchestration

Decagon

San Francisco, California, United States (On-site)

$300k – $415k Yearly

CA2w

Researcher, Evals

Cartesia

San Francisco, California, United States (On-site)

$220k – $350k Yearly

XA4w

AI Finance Tutor - Sell-Side

xAI

United States (Remote)

XA2w

AI Tutor - Video Specialist

xAI

Worldwide (Remote)

$93.6k – $156k Yearly

OP2w

Product Manager, API Model Behavior

OpenAI

San Francisco, California, United States (On-site)

$325k – $405k Yearly

OP2w

Backend Software Engineer (Evals) – Support Automation Engineering

OpenAI

San Francisco, California, United States (On-site)

$255k – $405k Yearly

PE2w

AI Research Lead

Perplexity

San Francisco, California, United States (On-site)

$300k – $470k Yearly

OP2w

TLM, Machine Learning, Integrity

OpenAI

San Francisco, California, United States (On-site)

$405k – $490k Yearly

OP2w

Research Engineer, Frontier Evals & Environments - Finance

OpenAI

San Francisco, California, United States (On-site)

$200k – $370k Yearly

XA1w

AI Economics Tutor

xAI

United States (Remote)

$45 – $100 Hourly

XA4w

AI Finance Tutor - Quantitative Finance

xAI

United States (Remote)

$45 – $100 Hourly

XA2w

AI Tutor - Image Specialist

xAI

Worldwide (Remote)

OP2w

Model Policy Manager, Chemical & Biological Risk

OpenAI

San Francisco, California, United States (Hybrid)

$207k – $295k Yearly

DE2w

Senior Research Engineer

Decagon

San Francisco, California, United States (On-site)

£200k – £300k Yearly

OP2w

Technical Program Manager – Adversarial Model Research

OpenAI

San Francisco, California, United States (Hybrid)

$230k – $285k Yearly

XA1w

Model Behavior Tutor - Epistemic Rigor & Truthfulness

xAI

United States (Remote)

$50 – $70 Hourly