1. Home
  2. Companies
  3. Braintrust
BR

Braintrust

Braintrust builds an AI observability platform for measuring, evaluating, and improving AI systems in production. The platform integrates LLM evaluation into standard engineering workflows, serving companies including Notion, Stripe, Zapier, Vercel, and Ramp. The system enables teams to iterate on AI applications through real-time data pipelines that convert production data into evaluation feedback, with interfaces designed for both engineering iteration and product prototyping.

The technical architecture centers on evaluation tooling that supports daily feature deployment cadence. The platform provides UI-based prototyping for non-engineers and real-time review workflows for cross-functional teams. Core infrastructure runs on Go, Python, and Node.js, with Postgres and Redis for data persistence and caching, deployed on AWS via Terraform and Docker.

The team operates as a small group focused on developer tooling problems: building data pipelines for production AI systems, creating evaluation interfaces for LLM performance measurement, and developing workflows that reduce latency in feedback loops. Technical domains span AI development, model evaluation frameworks, real-time data infrastructure, and engineering workflow optimization.

Open roles at Braintrust

Explore 44 open positions at Braintrust and find your next opportunity.

BR2w

Recruiting Coordinator

Braintrust

San Francisco, California, United States (On-site)

BR2w

Commercial Counsel

Braintrust

San Francisco, California, United States (On-site)

BR3w

Regional Sales Director, Commercial

Braintrust

New York, New York, United States (On-site)

BR3w

Software Engineer, Growth

Braintrust

San Francisco, California, United States (On-site)