1. Home
  2. Jobs
  3. Evaluation Research

Evaluation Research jobs

Explore Evaluation Research roles on Inference Jobs and apply today.

181-200 of 517 jobs

DE2w

Staff Research Engineer

Decagon

San Francisco, California, United States (On-site)

$350k – $475k Yearly

HF2w

Community ML Research Engineer, non-AI scientific fields - US Remote

Hugging Face

New York, New York, United States or Remote (United States)

SC4w

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Scale

San Francisco, California, United States (On-site)

$252k – $315k Yearly

PE2w

Software Engineer - Data Flywheel

Perplexity

London, England, United Kingdom (On-site)

$210k – $385k Yearly

HE2w

Expression of Interest (UK)

Heidi

London, England, United Kingdom or Remote (United Kingdom)

DE2w

Staff Research Engineer, Voice

Decagon

San Francisco, California, United States (On-site)

$350k – $475k Yearly

OP2w

Backend Software Engineer (Evals) – Support Automation Engineering

OpenAI

San Francisco, California, United States (On-site)

$255k – $405k Yearly

SC2w

STEM Fellow - Human Frontier Collective

Scale

United States (Remote)

SC1w

AI Architect

Scale

San Francisco, California, United States (On-site)

$190k – $230k Yearly

XA4w

AI Finance Tutor - Financial Analyst

xAI

United States (Remote)

OP6d

Research Communications Manager

OpenAI

San Francisco, California, United States (Hybrid)

$185k – $205k Yearly

SC4d

Visiting Faculty

Scale

United States (On-site)

HA2w

General Evaluator, HCP Sales

Hippocratic AI

United States or Remote (United States)

SC2w

Human Frontier Collective Fellow - GenAI (Remote)

Scale

United States (Remote)

PE2w

Data Scientist/Engineer – Online Metrics

Perplexity

London, England, United Kingdom (On-site)

SC4w

Communications Manager, Corporate & Product

Scale

San Francisco, California, United States (On-site)

$151.2k – $189k Yearly

HA2w

General Evaluator, Patient Services / Access Specialist (Non-Clinical)

Hippocratic AI

United States or Remote (United States)

SE2w

Technical Program Manager, Quality

Sesame

San Francisco, California, United States (On-site)

$200k – $260k Yearly

XA4w

Member of Technical Staff - RL Infrastructure [data, evals, agent]

xAI

Palo Alto, California, United States (On-site)

$180k – $440k Yearly

HA2w

Engineering Manager, AI Quality

Harvey

New York, New York, United States (On-site)

$260k – $330k Yearly