About

Modal operates a serverless compute platform designed to minimize infrastructure friction for ML inference, fine-tuning, and batch workloads. The platform provides instant GPU access with usage-based pricing, targeting teams that need to ship compute-intensive applications without managing scheduling, container orchestration, or resource allocation. The architecture is built on custom infrastructure components - an in-house file system, container runtime, scheduler, and image builder - optimized for the latency and throughput characteristics of AI workloads.

The technical stack spans Python, Rust, and Go at the systems level, with PyTorch, CUDA, vLLM, and TensorRT support for ML frameworks. This reflects prioritization of both developer ergonomics (Python interface) and low-level performance (Rust/Go for runtime components). The custom infrastructure signals investment in controlling the full vertical - from container initialization through GPU scheduling - rather than composing existing orchestration layers.

The team operates across New York, Stockholm, and San Francisco, and includes creators of open-source projects like Seaborn and Luigi, alongside academic researchers and engineers with experience building production systems. The platform positions itself around developer experience as a core constraint, with infrastructure complexity abstracted to reduce operational overhead for data and AI teams.

Open roles at Modal

Explore 28 open positions at Modal and find your next opportunity.

Modal logoMO

Forward Deployed ML Engineer

Modal

New York, United States (On-site)

$180K – $250K Yearly3d ago
Modal logoMO

Founding GTM Talent Partner

Modal

San Francisco, California, United States (On-site)

$150K – $195K Yearly3d ago
Modal logoMO

Controller

Modal

New York, United States (On-site)

$220K – $235K Yearly3d ago
Modal logoMO

Business Operations Manager

Modal

New York, United States (On-site)

$140K – $190K Yearly3d ago
Modal logoMO

Member of Technical Staff - Reliability Engineering

Modal

New York, United States (On-site)

$150K – $350K Yearly3d ago
Modal logoMO

Security Field Engineer

Modal

New York, United States (On-site)

$150K – $270K Yearly3d ago
Modal logoMO

Forward Deployed Engineer - Systems

Modal

New York, United States (On-site)

$180K – $240K Yearly3d ago
Modal logoMO

Member of Technical Staff - ML Performance

Modal

New York, United States (On-site)

$150K – $350K Yearly3d ago
Modal logoMO

Technical Content Marketing

Modal

New York, United States (On-site)

$130K – $250K Yearly3d ago
Modal logoMO

Systems Engineering Manager

Modal

Stockholm, Stockholm, Sweden (On-site)

$175K – $250K Yearly3d ago
Modal logoMO

Product Marketing Manager

Modal

New York, United States (On-site)

$190K – $260K Yearly3d ago
Modal logoMO

Forward Deployed Engineer - Systems

Modal

Stockholm, Stockholm, Sweden (On-site)

3d ago
Modal logoMO

Compute Strategy & Operations Lead

Modal

New York, United States (On-site)

$200K – $300K Yearly3d ago
Modal logoMO

Member of Technical Staff - Systems

Modal

New York, United States (On-site)

$200K – $350K Yearly3d ago
Modal logoMO

Developer Relations Engineer

Modal

San Francisco, California, United States (On-site)

$175K – $275K Yearly3d ago
Modal logoMO

Member of Technical Staff - Product (Backend)

Modal

New York, United States (On-site)

$150K – $300K Yearly3d ago
Modal logoMO

Member of Technical Staff - Product (Growth)

Modal

New York, United States (On-site)

$150K – $300K Yearly3d ago
Modal logoMO

Account Executive - Enterprise

Modal

New York, United States (On-site)

$300K – $300K Yearly3d ago
Modal logoMO

Support Engineer

Modal

New York, United States (On-site)

$150K – $220K Yearly3d ago
Modal logoMO

Infrastructure Security Engineer

Modal

New York, United States (On-site)

$150K – $270K Yearly3d ago

Similar companies

Baseten logoBA

Baseten

Baseten is an AI infrastructure platform providing the tooling, expertise, and hardware needed to deploy and scale AI models in production.

58 jobs
Together AI logoTA

Together AI

Together AI is a research-driven AI cloud infrastructure provider enabling developers and enterprises to train, fine-tune, and deploy open-source generative AI models at scale.

48 jobs
d-Matrix logoD-

d-Matrix

d-Matrix builds purpose-built AI inference computing platforms to make generative AI commercially viable, efficient, and sustainable through digital in-memory compute technology.

43 jobs
Runpod logoRU

Runpod

RunPod provides cloud infrastructure for AI developers, offering GPU computing services for training, deploying, and scaling AI models.

18 jobs
Vast.ai logoVA

Vast.ai

Vast.ai is the market leader for low cost GPU rentals, connecting data centers and professionals with users who need AI compute at prices 3-5X cheaper than traditional cloud providers.

9 jobs
fal.ai logoFA

fal.ai

fal.ai operates serverless GPU compute and a model gallery for deploying generative media inference - image, video, audio, and 3D - at production scale.