CA

Cartesia

About

Cartesia builds real-time multimodal AI models for voice applications, with production systems spanning text-to-speech and speech-to-text. The company emerged from Stanford's AI Lab, where the founding team - led by CEO Karan Goel - pioneered work on State Space Models (SSMs) before transitioning to commercial infrastructure. Their technical approach combines model innovation with systems engineering, focusing on the latency, throughput, and operational constraints that define production voice AI.

The core product line includes Sonic, a text-to-speech model designed for emotive, human-like output, and Ink, a recently launched speech-to-text system purpose-built for real-time voice applications. Both systems address the fundamental trade-offs in voice AI: achieving low-latency inference while maintaining quality at scale. The company's technical domains span foundation model development, real-time multimodal intelligence, and developer tooling - infrastructure that runs where users are rather than requiring server-side processing.

Cartesia's engineering stack runs on Python, Go, and TypeScript, supporting developers building voice interfaces that demand sub-second response times and reliable performance under production load. The team's research background in SSMs informs their approach to model efficiency and scalability, though the company now focuses on shipping production systems rather than pure research. Their stated mission centers on ubiquitous, interactive intelligence - systems that handle the operational complexity of real-time voice while remaining accessible to developers building conversational interfaces.

Open roles at Cartesia

Explore 26 open positions at Cartesia and find your next opportunity.

CA

Technical Lead Manager, Platform (India)

Cartesia

Bengaluru, Karnataka, India (On-site)

₹10M – ₹13M Yearly1w ago
CA

Product and Research Operations Analyst (Fixed Term Contract)

Cartesia

Westminster, London, England, United Kingdom (Hybrid)

£35 – £35 Hourly1w ago
CA

Technical Account Manager

Cartesia

San Francisco, California, United States (On-site)

$240K – $280K Yearly1w ago
CA

Forward Deployed Engineer (India)

Cartesia

India (Remote)

₹7M – ₹9M Yearly1mo ago
CA

Research Engineer, Data (India)

Cartesia

Bengaluru, Karnataka, India (On-site)

₹7M – ₹9M Yearly2mo ago
CA

Solutions Engineer

Cartesia

San Francisco, California, United States (On-site)

$160K – $220K Yearly2mo ago
CA

Software Engineer, Product

Cartesia

San Francisco, California, United States (On-site)

$180K – $250K Yearly2mo ago
CA

Software Engineer, Databases

Cartesia

San Francisco, California, United States (On-site)

$180K – $250K Yearly2mo ago
CA

Software Engineer, AI & Developer Experience

Cartesia

San Francisco, California, United States (On-site)

$180K – $250K Yearly2mo ago
CA

Design Engineer

Cartesia

San Francisco, California, United States (On-site)

$180K – $250K Yearly2mo ago
CA

Product Engineer Intern

Cartesia

San Francisco, California, United States (On-site)

$96K – $96K Yearly3mo ago
CA

Frontend Engineer, Marketing

Cartesia

San Francisco, California, United States (On-site)

$180K – $250K Yearly3mo ago
CA

HR Business Partner

Cartesia

San Francisco, California, United States (On-site)

$200K – $270K Yearly3mo ago
CA

Founding Marketer

Cartesia

San Francisco, California, United States (On-site)

$180K – $250K Yearly3mo ago
CA

Account Executive

Cartesia

San Francisco, California, United States (On-site)

$240K – $280K Yearly3mo ago
CA

Senior Applied Researcher, Audio Understanding

Cartesia

San Francisco, California, United States (On-site)

$200K – $350K Yearly3mo ago
CA

Researcher, Evals

Cartesia

San Francisco, California, United States (On-site)

$220K – $350K Yearly3mo ago
CA

Forward Deployed Engineer

Cartesia

San Francisco, California, United States (On-site)

$180K – $250K Yearly3mo ago
CA

Research Engineer, Data

Cartesia

San Francisco, California, United States (On-site)

$180K – $250K Yearly3mo ago
CA

Researcher: Model Architecture

Cartesia

San Francisco, California, United States (On-site)

$180K – $350K Yearly3mo ago

Similar companies

DE

Decagon

Decagon builds a conversational AI platform designed to replace or augment legacy customer support systems by deploying intelligent AI agents across chat, email, and voice channels. The company positions its technology as infrastructure for delivering concierge-level customer experiences at scale, targeting brands looking to support, onboard, and retain customers without proportional headcount growth. Led by CEO Jesse Zhang and founded by serial entrepreneurs, Decagon operates from the US and focuses on addressing the operational constraints of traditional customer support systems. The platform's core technical approach centers on Agent Operating Procedures (AOPs), a natural-language-to-code compilation system that allows non-technical users to define agent behavior while preserving technical team control over guardrails, integrations, and versioning. This design addresses a common trade-off in AI tooling: enabling rapid iteration by domain experts without sacrificing reliability controls or introducing configuration drift. The agent orchestration layer spans multiple channels and claims to amplify CX team impact by 10x, though specific benchmarks around latency, accuracy, or failure rate are not publicly detailed. Decagon's technical domains span conversational AI, natural language processing, multichannel messaging infrastructure, and automation systems. The platform emphasizes runtime guardrails and version management as first-class concerns, reflecting a systems-oriented approach to production deployment. The company claims to deliver always-on, personalized service, positioning its agents as operational infrastructure rather than experimental tooling. For engineers evaluating opportunities, the technical challenges likely involve scaling context-rich, stateful interactions across channels while maintaining consistency, handling edge cases in natural language understanding, and building abstraction layers that balance expressiveness with safety.

89 jobs
TO

Toma

Toma operates a voice AI platform for automotive dealerships, processing over 1,000,000 calls since launching in 2024. The system handles inbound phone operations - service scheduling, call routing, and follow-up automation - with safeguards designed to manage transfer latency and revenue leakage. Core technical challenge: maintaining conversational quality and intent detection accuracy across high-variance dealership scenarios (service appointments, parts inquiries, sales handoffs) while minimizing false transfers and dropped context. The platform implements transfer triggers, clawback mechanisms for mistimed handoffs, and follow-up alerts when human staff doesn't complete actions, addressing the operational complexity of human-AI transition points in production telephony. Infrastructure runs on AWS with a TypeScript/Next.js frontend, PostgreSQL via Prisma for state management, and tRPC for type-safe API boundaries. The voice AI layer must handle real-time constraints - low-latency speech recognition and synthesis, sub-second intent classification - while managing concurrent call volume and dealership-specific context (inventory, scheduling systems, staff availability). Trade-offs center on model selection for conversational understanding versus inference cost at scale, and the reliability surface area of integrating with legacy dealership management systems. Founded by engineers from Scale AI, Uber, Lyft, and Amazon; backed by Andreessen Horowitz and Y Combinator with $17 million Series A funding. Deployment spans dealerships across the United States, including Pohanka Automotive Group, SCHOMP, Hudson Automotive Group, and Bergey's. Primary bottlenecks likely involve tuning voice models for domain-specific terminology (vehicle makes, service codes, dealership jargon), managing tail latency in transfer decisions where milliseconds impact customer experience, and evaluating conversational success beyond simple call completion - did the AI correctly capture appointment details, route urgency appropriately, preserve customer satisfaction? The system's value proposition hinges on converting missed calls and staff bottlenecks into captured revenue, which requires high precision on intent classification and low false-negative rates on transfer triggers to avoid revenue loss from mishandled interactions.

1 job