Baseten builds AI infrastructure for production deployment and scaling of models, with work spanning kernel-level optimization for inference performance through developer tooling. The platform ships daily, measuring success by real-world impact of AI products running on it rather than vanity metrics. Engineers embed directly with customers to surface operational bottlenecks, then optimize obsessively - work ranges from TensorRT-LLM and CUDA kernel tuning to building developer tools that reduce deployment friction.

The stack centers on inference at scale: TensorRT-LLM and PyTorch for model execution, NVIDIA Triton Inference Server for serving, Kubernetes (EKS) with Karpenter for autoscaling, and Knative for event-driven workloads on AWS EC2. Infrastructure decisions prioritize shipping velocity over process - small teams with real ownership iterate rapidly on production reliability, latency (including tail behavior), and cost efficiency. Docker containerization and PostgreSQL round out core operational dependencies.

The team is internationally distributed, composed of engineers and designers who take craft seriously without performative posturing. Customer-embedded engineering informs both platform architecture and developer experience tradeoffs, creating tight feedback loops between deployment reality and infrastructure evolution. From founding, the approach has centered on hands-on problem solving and rapid iteration rather than abstraction layers that delay production learning.

Open roles at Baseten

Explore 27 open positions at Baseten and find your next opportunity.

BA2d

Software Engineer - AI Enablement

Baseten

San Francisco, California, United States (On-site)

$150k – $230k Yearly

BA2d

Senior Software Engineer - New Products

Baseten

San Francisco, California, United States (On-site)

$185k – $285k Yearly

BA2d

Solution Architect

Baseten

San Francisco, California, United States (On-site)

$165k – $275k Yearly

BA3d

Senior Software Engineer - Infrastructure

Baseten

San Francisco, California, United States (On-site)

$150k – $230k Yearly

BA3d

Software Engineer — GPU Networking & Distributed Systems

Baseten

San Francisco, California, United States (On-site)

$150k – $250k Yearly

BA1w

Sales Manager - Emerging

Baseten

San Francisco, California, United States (Hybrid)

$280k – $320k Yearly

BA1w

Software Engineer, Model Performance Tooling

Baseten

Canada or Remote (Canada + 1 more)

C$130k – C$200k Yearly

BA1w

Senior Business Recruiter

Baseten

San Francisco, California, United States (On-site)

$160k – $200k Yearly

BA1w

Corporate Finance Lead

Baseten

San Francisco, California, United States (On-site)

$200k – $250k Yearly

BA1w

Software Engineer - Model API's

Baseten

San Francisco, California, United States (On-site)

$150k – $230k Yearly

BA1w

Partnerships Product Marketing Manager

Baseten

San Francisco, California, United States (On-site)

$160k – $190k Yearly

BA1w

Technical Recruiter

Baseten

San Francisco, California, United States (On-site)

$160k – $210k Yearly

BA1w

Account Executive - AI Native: Strategic

Baseten

San Francisco, California, United States (On-site)

$230k – $300k Yearly

BA1w

Senior Sales Recruiter

Baseten

San Francisco, California, United States (On-site)

$160k – $200k Yearly

BA1w

Senior/Staff Product Designer

Baseten

San Francisco, California, United States (On-site)

$175k – $225k Yearly

BA1w

AI Solutions Engineer

Baseten

San Francisco, California, United States (On-site)

$160k – $275k Yearly

BA1w

Senior Software Engineer - Enterprise Platform

Baseten

San Francisco, California, United States (On-site)

$200k – $270k Yearly

BA1w

Engineering Manager - Forward Deployed Engineering (LLM)

Baseten

San Francisco, California, United States (On-site)

$220k – $285k Yearly

BA1w

Software Engineer - Core Product

Baseten

San Francisco, California, United States (On-site)

$150k – $230k Yearly

BA1w

Account Executive - AI Native: Strategic

Baseten

New York, New York, United States (On-site)

$230k – $300k Yearly