1. Home
  2. AI Companies
  3. Perplexity

About

Perplexity operates an AI-powered answer engine processing over 150 million questions weekly across web, mobile, and enterprise platforms. Founded in 2022, the system combines real-time web search with multiple LLMs to deliver source-attributed answers. The architecture serves both consumer and enterprise workloads, with enterprise deployments requiring security guarantees for knowledge worker use cases including legal research partnerships with organizations like Latham & Watkins.

The technical stack runs on AWS infrastructure with Terraform for provisioning, Python and Go for backend services, and PyTorch with DeepSpeed and FSDP for model training and inference. Data pipelines use dbt, SQL, Snowflake, and Databricks. Frontend implementations use React and TypeScript, with Docker containerization and Open Policy Agent for access control. This architecture must handle tail latency and throughput requirements for real-time search retrieval paired with LLM inference at consumer scale, while maintaining source credibility verification in the critical path.

The engineering focus centers on information retrieval accuracy, model response quality, and citation reliability rather than advertising optimization. Production systems must balance inference cost against answer quality across multiple models, manage retrieval latency for real-time web indexing, and maintain reliability for both free-tier consumer traffic and enterprise SLA requirements. Pro tier monetization suggests capacity-based or model selection tiering rather than pure ad-based revenue.

Open roles at Perplexity

Explore 54 open positions at Perplexity and find your next opportunity.

Perplexity logoPE

AI Infra Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)

$220K – $405K Yearly3d ago
Perplexity logoPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)

$220K – $485K Yearly3d ago
Perplexity logoPE

Research Engineering Manager - Model Training

Perplexity

San Francisco, California, United States (On-site)

$300K – $470K Yearly2w ago
Perplexity logoPE

UK Internship Program

Perplexity

London, England, United Kingdom (Hybrid)

2w ago
Perplexity logoPE

Data Scientist/Engineer – Online Metrics

Perplexity

London, England, United Kingdom (On-site)

2w ago
Perplexity logoPE

Software Engineer - Data Flywheel

Perplexity

London, England, United Kingdom (On-site)

$210K – $385K Yearly2w ago
Perplexity logoPE

Data Scientist, Evals

Perplexity

London, England, United Kingdom (On-site)

$210K – $385K Yearly2w ago
Perplexity logoPE

iOS Engineer - Comet

Perplexity

San Francisco, California, United States (On-site)

$200K – $300K Yearly2w ago
Perplexity logoPE

Data Scientist

Perplexity

San Francisco, California, United States (On-site)

$210K – $330K Yearly2w ago
Perplexity logoPE

Backend Software Engineer

Perplexity

Worldwide (Remote)

$220K – $405K Yearly2w ago
Perplexity logoPE

Analytics Engineer

Perplexity

San Francisco, California, United States (On-site)

$210K – $330K Yearly2w ago
Perplexity logoPE

Demand Generation

Perplexity

United States (Remote)

$200K – $215K Yearly2w ago
Perplexity logoPE

Enterprise Customer Support Specialist

Perplexity

California, United States + 1 more (Remote)

$110K – $130K Yearly2w ago
Perplexity logoPE

Member of Technical Staff - Secure Intelligence Institute

Perplexity

San Francisco, California, United States (On-site)

$220K – $405K Yearly2w ago

Similar companies

Braintrust logoBR

Braintrust

Braintrust is the AI observability platform helping teams measure, evaluate, and improve AI in production. Trusted by companies like Notion, Stripe, Zapier, Vercel, and Ramp.

32 jobs
Lovable logoLO

Lovable

Lovable is an AI-powered platform that enables anyone to build production-ready web applications and websites through conversational prompts, democratizing software creation for non-technical users.

32 jobs
Parallel Web Systems logoPW

Parallel Web Systems

Parallel Web Systems builds web search and research APIs purpose-built for AI agents, providing production-grade infrastructure that outperforms humans and leading AI models on complex research benchmarks.

17 jobs
Pinecone logoPI

Pinecone

Pinecone is the leading vector database for building accurate and performant AI applications at scale in production.

2 jobs
Qdrant logoQD

Qdrant

Qdrant is an open-source vector database and similarity search engine written in Rust, powering AI applications with high-performance vector similarity search technology.

1 job
Inferact logoIN

Inferact

Inferact commercializes vLLM, an open-source LLM inference engine built by its founders, to reduce inference latency, cost, and serving complexity at scale.