About

Modal operates a serverless compute platform designed to minimize infrastructure friction for ML inference, fine-tuning, and batch workloads. The platform provides instant GPU access with usage-based pricing, targeting teams that need to ship compute-intensive applications without managing scheduling, container orchestration, or resource allocation. The architecture is built on custom infrastructure components - an in-house file system, container runtime, scheduler, and image builder - optimized for the latency and throughput characteristics of AI workloads.

The technical stack spans Python, Rust, and Go at the systems level, with PyTorch, CUDA, vLLM, and TensorRT support for ML frameworks. This reflects prioritization of both developer ergonomics (Python interface) and low-level performance (Rust/Go for runtime components). The custom infrastructure signals investment in controlling the full vertical - from container initialization through GPU scheduling - rather than composing existing orchestration layers.

The team operates across New York, Stockholm, and San Francisco, and includes creators of open-source projects like Seaborn and Luigi, alongside academic researchers and engineers with experience building production systems. The platform positions itself around developer experience as a core constraint, with infrastructure complexity abstracted to reduce operational overhead for data and AI teams.

Open roles at Modal

Explore 28 open positions at Modal and find your next opportunity.

Modal logoMO

Technical Content Marketing

Modal

New York, United States (On-site)

$130K – $250K Yearly3d ago
Modal logoMO

Member of Technical Staff - Systems

Modal

New York, United States (On-site)

$200K – $350K Yearly3d ago
Modal logoMO

Member of Technical Staff - Product (Growth)

Modal

New York, United States (On-site)

$150K – $300K Yearly3d ago
Modal logoMO

Systems Engineering Manager

Modal

New York, United States (On-site)

$250K – $350K Yearly2w ago
Modal logoMO

Solutions Architect

Modal

San Francisco, California, United States (Hybrid)

$200K – $280K Yearly2w ago
Modal logoMO

VP Finance

Modal

New York, United States (On-site)

$300K – $350K Yearly2w ago
Modal logoMO

Member of Technical Staff - ML Training Systems

Modal

New York, United States (On-site)

$150K – $350K Yearly2w ago
Modal logoMO

Forward Deployed Engineer - ML

Modal

Stockholm, Sweden (On-site)

2w ago

Similar companies

Baseten logoBA

Baseten

Baseten is an AI infrastructure platform providing the tooling, expertise, and hardware needed to deploy and scale AI models in production.

58 jobs
Together AI logoTA

Together AI

Together AI is a research-driven AI cloud infrastructure provider enabling developers and enterprises to train, fine-tune, and deploy open-source generative AI models at scale.

48 jobs
d-Matrix logoD-

d-Matrix

d-Matrix builds purpose-built AI inference computing platforms to make generative AI commercially viable, efficient, and sustainable through digital in-memory compute technology.

43 jobs
Runpod logoRU

Runpod

RunPod provides cloud infrastructure for AI developers, offering GPU computing services for training, deploying, and scaling AI models.

18 jobs
Vast.ai logoVA

Vast.ai

Vast.ai is the market leader for low cost GPU rentals, connecting data centers and professionals with users who need AI compute at prices 3-5X cheaper than traditional cloud providers.

9 jobs
fal.ai logoFA

fal.ai

fal.ai operates serverless GPU compute and a model gallery for deploying generative media inference - image, video, audio, and 3D - at production scale.