1. Home
  2. AI Companies
  3. Braintrust
BR

Braintrust

About

Braintrust builds an AI observability platform for measuring, evaluating, and improving AI systems in production. The platform integrates LLM evaluation into standard engineering workflows, serving companies including Notion, Stripe, Zapier, Vercel, and Ramp. The system enables teams to iterate on AI applications through real-time data pipelines that convert production data into evaluation feedback, with interfaces designed for both engineering iteration and product prototyping.

The technical architecture centers on evaluation tooling that supports daily feature deployment cadence. The platform provides UI-based prototyping for non-engineers and real-time review workflows for cross-functional teams. Core infrastructure runs on Go, Python, and Node.js, with Postgres and Redis for data persistence and caching, deployed on AWS via Terraform and Docker.

The team operates as a small group focused on developer tooling problems: building data pipelines for production AI systems, creating evaluation interfaces for LLM performance measurement, and developing workflows that reduce latency in feedback loops. Technical domains span AI development, model evaluation frameworks, real-time data infrastructure, and engineering workflow optimization.

Open roles at Braintrust

Explore 37 open positions at Braintrust and find your next opportunity.

BR

Eval Engineer

Braintrust

San Francisco, California, United States (On-site)

3w ago
BR

Solutions Engineer (East Region)

Braintrust

New York, New York, United States (On-site)

3w ago
BR

Solutions Engineer (Central Region)

Braintrust

Texas, United States + 2 more (Remote)

3w ago
BR

Documentation Engineer

Braintrust

San Francisco, California, United States (On-site)

1mo ago
BR

Brand Lead

Braintrust

San Francisco, California, United States (On-site)

2mo ago
BR

Commercial Counsel

Braintrust

San Francisco, California, United States (On-site)

2mo ago
BR

Regional Sales Director, Commercial

Braintrust

New York, New York, United States (On-site)

2mo ago
BR

Regional Sales Director, Enterprise

Braintrust

San Francisco, California, United States (On-site)

2mo ago
BR

Regional Sales Director, Commercial

Braintrust

San Francisco, California, United States (On-site)

2mo ago
BR

Recruiting Coordinator

Braintrust

San Francisco, California, United States (On-site)

2mo ago
BR

Open Source Engineer - Python

Braintrust

San Francisco, California, United States or Remote (California, United States + 2 more)

3mo ago
BR

Account Executive, Digital Natives (East Region)

Braintrust

New York, New York, United States (On-site)

3mo ago
BR

Cloud Infrastructure Engineer

Braintrust

San Francisco, California, United States (On-site)

3mo ago
BR

Design Engineer

Braintrust

San Francisco, California, United States (On-site)

3mo ago
BR

Account Executive, EMEA

Braintrust

London, England, United Kingdom (On-site)

3mo ago
BR

Strategic Account Executive (Central Region)

Braintrust

Denver, Colorado, United States or Remote (Colorado, United States + 2 more)

3mo ago
BR

Sales Development Representative

Braintrust

San Francisco, California, United States (On-site)

3mo ago
BR

Field Marketing Manager

Braintrust

San Francisco, California, United States (On-site)

3mo ago
BR

GTM Sourcer

Braintrust

San Francisco, California, United States (Hybrid)

3mo ago
BR

Open Source Engineer - Go

Braintrust

San Francisco, California, United States or Remote (United States)

3mo ago

Similar companies

TM

Thinking Machines Lab

Thinking Machines Lab is a 2025-founded AI research and product company led by Mira Murati, former CTO of OpenAI. The organization addresses a concentration problem: training methods for frontier AI systems have remained largely confined to top labs, constraining public understanding and limiting users' ability to customize systems to specific needs. The team - comprising scientists and engineers who previously built ChatGPT, Character.ai, and contributed to PyTorch - focuses on making AI systems more widely understood, customizable, and generally capable through open science publications and code releases. The company's technical work centers on multimodal systems designed to adapt across the full spectrum of human expertise, with an explicit architectural preference for human–AI collaboration over full autonomy. Their stack includes Python, Rust, PyTorch, React/TypeScript, Kubernetes, and Spark. Development priorities span training and analysis of frontier models, multimodal system design, and foundational ML framework work - reflecting the team's prior experience building widely-deployed products and infrastructure. The operational model emphasizes open science: research findings and implementations are released publicly rather than held proprietary. This approach targets both the customizability bottleneck - where users cannot effectively tune systems to domain-specific requirements - and the knowledge distribution problem that limits informed discourse about frontier model development. Product outputs include multimodal systems and published research artifacts alongside the methodological contributions inherent in their open release practice.

58 jobs
MO

Modal

Modal operates a serverless compute platform designed to minimize infrastructure friction for ML inference, fine-tuning, and batch workloads. The platform provides instant GPU access with usage-based pricing, targeting teams that need to ship compute-intensive applications without managing scheduling, container orchestration, or resource allocation. The architecture is built on custom infrastructure components - an in-house file system, container runtime, scheduler, and image builder - optimized for the latency and throughput characteristics of AI workloads. The technical stack spans Python, Rust, and Go at the systems level, with PyTorch, CUDA, vLLM, and TensorRT support for ML frameworks. This reflects prioritization of both developer ergonomics (Python interface) and low-level performance (Rust/Go for runtime components). The custom infrastructure signals investment in controlling the full vertical - from container initialization through GPU scheduling - rather than composing existing orchestration layers. The team operates across New York, Stockholm, and San Francisco, and includes creators of open-source projects like Seaborn and Luigi, alongside academic researchers and engineers with experience building production systems. The platform positions itself around developer experience as a core constraint, with infrastructure complexity abstracted to reduce operational overhead for data and AI teams.

28 jobs