About

Vast.ai operates a peer-to-peer GPU marketplace connecting over 10,000 GPUs across 40 data centers with users requiring compute for training, fine-tuning, and inference workloads. The platform aggregates capacity from data centers and individual providers running Vast's hosting software, offering on-demand, interruptible, and auction-based pricing models that price 3-5x below traditional cloud providers. Instance deployment occurs in seconds, with the marketplace enabling direct comparison of price-performance across heterogeneous hardware.

The architecture surfaces a pricing-availability trade-off inherent to peer-to-peer models: cost savings derive from utilizing underutilized capacity, but availability and reliability vary by provider. Interruptible instances present the sharpest cost-performance point but require fault-tolerant workloads and checkpointing discipline. The platform supports standard ML frameworks (PyTorch, TensorFlow) and containerized deployments via Docker. Enterprise offerings provide dedicated clusters with SLAs, SOC 2 Type I certification, and access to ISO 27001 certified facilities, trading marketplace economics for operational predictability.

The technical stack spans Python and C++ for core platform services, PostgreSQL for marketplace state, Redis for coordination, and Terraform for infrastructure provisioning. CUDA support is foundational for GPU workloads. The system must handle heterogeneous provider configurations, node churn, and pricing dynamics across thousands of GPUs while maintaining search and allocation latency suitable for rapid instance provisioning. Founded in 2018, the company positions itself as infrastructure for cost-sensitive training and inference at scale.

Open roles at Vast.ai

Explore 9 open positions at Vast.ai and find your next opportunity.

Vast.ai logoVA

AI Agent Researcher

Vast.ai

San Francisco, California, United States (On-site)

$160K – $320K Yearly2w ago
Vast.ai logoVA

Systems/GPU Research Engineer

Vast.ai

San Francisco, California, United States (On-site)

$160K – $320K Yearly2w ago
Vast.ai logoVA

Security Engineer

Vast.ai

Los Angeles, California, United States (On-site)

$145K – $185K Yearly2w ago
Vast.ai logoVA

QA Associate

Vast.ai

Los Angeles, California, United States (On-site)

$40 – $40 Hourly2w ago
Vast.ai logoVA

Systems/GPU Research Engineer

Vast.ai

San Francisco, California, United States (On-site)

$160K – $320K Yearly2w ago
Vast.ai logoVA

GPU Systems Engineer – HPC / Parallel Computing

Vast.ai

San Francisco, California, United States (On-site)

$160K – $320K Yearly2w ago
Vast.ai logoVA

C++ Software Engineer — Systems

Vast.ai

San Francisco, California, United States (On-site)

$120K – $180K Yearly2w ago
Vast.ai logoVA

QA Engineer

Vast.ai

Los Angeles, California, United States (On-site)

$40 – $40 Hourly2w ago
Vast.ai logoVA

Senior Infrastructure Engineer

Vast.ai

San Francisco, California, United States (On-site)

$180K – $300K Yearly2w ago

Similar companies

Crusoe logoCR

Crusoe

Crusoe is a vertically integrated AI infrastructure company that builds and operates sustainable data centers and cloud computing platforms powered by clean energy sources.

263 jobs
CoreWeave logoCO

CoreWeave

CoreWeave is an AI-native cloud platform providing specialized GPU infrastructure for training and deploying AI workloads at scale.

256 jobs
Together AI logoTA

Together AI

Together AI is a research-driven AI cloud infrastructure provider enabling developers and enterprises to train, fine-tune, and deploy open-source generative AI models at scale.

48 jobs
Modal logoMO

Modal

Modal is a serverless compute platform for AI and data teams that enables running compute-intensive workloads like ML inference, fine-tuning, and batch jobs with instant GPU access and usage-based pricing.

28 jobs
Runpod logoRU

Runpod

RunPod provides cloud infrastructure for AI developers, offering GPU computing services for training, deploying, and scaling AI models.

18 jobs
Mithril logoMI

Mithril

Mithril orchestrates multi-cloud GPU, CPU, and storage resources through a single interface with transparent pricing, supporting both reserved and spot-based compute for ML training and inference.