About

Baseten builds AI infrastructure for production deployment and scaling of models, with work spanning kernel-level optimization for inference performance through developer tooling. The platform ships daily, measuring success by real-world impact of AI products running on it rather than vanity metrics. Engineers embed directly with customers to surface operational bottlenecks, then optimize obsessively - work ranges from TensorRT-LLM and CUDA kernel tuning to building developer tools that reduce deployment friction.

The stack centers on inference at scale: TensorRT-LLM and PyTorch for model execution, NVIDIA Triton Inference Server for serving, Kubernetes (EKS) with Karpenter for autoscaling, and Knative for event-driven workloads on AWS EC2. Infrastructure decisions prioritize shipping velocity over process - small teams with real ownership iterate rapidly on production reliability, latency (including tail behavior), and cost efficiency. Docker containerization and PostgreSQL round out core operational dependencies.

The team is internationally distributed, composed of engineers and designers who take craft seriously without performative posturing. Customer-embedded engineering informs both platform architecture and developer experience tradeoffs, creating tight feedback loops between deployment reality and infrastructure evolution. From founding, the approach has centered on hands-on problem solving and rapid iteration rather than abstraction layers that delay production learning.

Open roles at Baseten

Explore 58 open positions at Baseten and find your next opportunity.

Baseten logoBA

Senior Software Engineer - New Products

Baseten

San Francisco, California, United States (On-site)

$165K – $330K Yearly2d ago
Baseten logoBA

Software Engineer - Infrastructure

Baseten

San Francisco, California, US or Remote (United States)

$165K – $330K Yearly4d ago
Baseten logoBA

Manager, Solutions Architect

Baseten

Worldwide (Remote)

$165K – $330K Yearly4d ago
Baseten logoBA

Senior Software Engineer - Model Training

Baseten

San Francisco, California, US or Remote (Worldwide)

$165K – $330K Yearly4d ago
Baseten logoBA

Engineering Manager, Model Library

Baseten

United States (Remote)

$165K – $330K Yearly4d ago
Baseten logoBA

Engineering Manager, Internal Platform

Baseten

Worldwide (Remote)

$240K – $260K Yearly4d ago
Baseten logoBA

Senior Sales Recruiter

Baseten

San Francisco, California, US or Remote (California, United States)

$160K – $200K Yearly4d ago
Baseten logoBA

Software Engineer - Voice AI (Inference Runtime)

Baseten

United States (Remote)

$165K – $330K Yearly4d ago
Baseten logoBA

AI Solutions Engineer

Baseten

San Francisco, California, US or Remote (United States)

$165K – $330K Yearly4d ago
Baseten logoBA

OS / K8s Systems Engineer

Baseten

Worldwide (Remote)

$165K – $330K Yearly4d ago
Baseten logoBA

Strategic Finance, GTM

Baseten

San Francisco, California, United States (On-site)

$190K – $210K Yearly4d ago
Baseten logoBA

Integrated Marketing Manager

Baseten

Worldwide (Remote)

$140K – $170K Yearly4d ago
Baseten logoBA

Data Center Network Engineer

Baseten

Worldwide (Remote)

$180K – $360K Yearly4d ago
Baseten logoBA

Software Engineer - Model Performance

Baseten

San Francisco, California, US or Remote (Worldwide)

$180K – $360K Yearly4d ago
Baseten logoBA

Capacity and Infrastructure Lead

Baseten

United States (Remote)

$220K – $260K Yearly4d ago
Baseten logoBA

Software Engineer - Model API's

Baseten

San Francisco, California, US or Remote (Worldwide)

$180K – $360K Yearly4d ago
Baseten logoBA

Forward Deployed Engineer

Baseten

San Francisco, California, US or Remote (United States)

$165K – $330K Yearly4d ago
Baseten logoBA

Senior Product Engineer - Training Platform

Baseten

San Francisco, California, US or Remote (Worldwide)

$165K – $330K Yearly4d ago
Baseten logoBA

Applied AI Inference Engineer

Baseten

San Francisco, California, US or Remote (California, United States + 1 more)

$165K – $330K Yearly4d ago

Similar companies

Together AI logoTA

Together AI

Together AI is a research-driven AI cloud infrastructure provider enabling developers and enterprises to train, fine-tune, and deploy open-source generative AI models at scale.

48 jobs
Braintrust logoBR

Braintrust

Braintrust is the AI observability platform helping teams measure, evaluate, and improve AI in production. Trusted by companies like Notion, Stripe, Zapier, Vercel, and Ramp.

32 jobs
Modal logoMO

Modal

Modal is a serverless compute platform for AI and data teams that enables running compute-intensive workloads like ML inference, fine-tuning, and batch jobs with instant GPU access and usage-based pricing.

28 jobs
Runpod logoRU

Runpod

RunPod provides cloud infrastructure for AI developers, offering GPU computing services for training, deploying, and scaling AI models.

18 jobs
Lambda logoLA

Lambda

Lambda is an AI-only company providing cloud GPUs, on-demand clusters, and hardware for AI training and inference, building the infrastructure powering AI services used by hundreds of millions of people.

14 jobs
Bento logoBE

Bento

Bento provides an open-source framework and enterprise platform for deploying and operating AI/ML model inference in production with control over performance, scaling, and operational complexity.