LLM-as-a-judge systems jobs
Explore LLM-as-a-judge systems roles on Inference Jobs and apply today.
141-160 of 277 jobs
Strategic Projects Lead, Enterprise Evaluations
Scale
San Francisco, California, United States (On-site)
$198k – $247.5k Yearly
Research Engineer / Scientist, Alignment Science, London
Anthropic
London, England, United Kingdom (Hybrid)
£250k – £270k Yearly
Model Policy Manager, Chemical & Biological Risk
OpenAI
San Francisco, California, United States (Hybrid)
$207k – $295k Yearly
Research Engineer, AI Observability
Anthropic
San Francisco, California, United States (Hybrid)
$320k – $405k Yearly
Member of Engineering (Pre-training / Data)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
Applied Research Engineer
Labelbox
San Francisco, California, United States (Hybrid)
$250k – $300k Yearly
Applied Legal Researcher
Harvey
San Francisco, California, United States (On-site)
$180k – $220k Yearly
Policy Manager, Harmful Persuasion
Anthropic
San Francisco, California, United States (Hybrid)
$245k – $330k Yearly
Member of Technical Staff - Reasoning Efficiency
xAI
Palo Alto, California, United States (On-site)
$180k – $440k Yearly
Member of Technical Staff - Evaluations
Reflection AI
San Francisco, California, United States (On-site)
Applied Research Intern
Labelbox
San Francisco, California, United States (Hybrid)
$35 – $45 Yearly
Machine Learning Research Scientist / Research Engineer, Post-Training
Scale
San Francisco, California, United States (On-site)
$252k – $315k Yearly
Member of Technical Staff, Pretraining evaluations
Cohere
London, England, United Kingdom or Remote (Worldwide)
Agentic AI Solution Engineering Intern - Summer 2026
NVIDIA
Austin, Texas, United States (On-site)
$20 – $71 Hourly
Research Engineer / Scientist, Alignment Science
Anthropic
San Francisco, California, United States (Hybrid)
$315k – $340k Yearly