1. Home
  2. AI Companies
  3. Deepnight
DE

Deepnight

About

This company hasn't shared a description yet.

Similar companies

NE

Nebius

Nebius is a Nasdaq-listed technology company (NBIS) building full-stack AI infrastructure from its Amsterdam headquarters, with GPU clusters deployed across Europe and the United States. Led by CEO Arkady Volozh, the company operates AI-optimized sustainable data centers - including a facility 60 kilometers from Helsinki and a new Vineland, New Jersey site - and has raised significant capital ($700 million from investors including Accel, NVIDIA, and Orbis). The engineering organization, numbering in the hundreds, maintains deep expertise in world-class infrastructure and runs an in-house AI R&D team that dogfoods the platform to validate it against production ML practitioner requirements. The infrastructure stack spans hyperscaler-scale features with supercomputer-grade performance characteristics. ISEG, Nebius's supercomputer, ranks among the world's most powerful systems. The platform integrates NVIDIA GPUs with NVIDIA InfiniBand networking, exposing workload orchestration through both Kubernetes and Slurm. The operational layer includes standard observability (Prometheus, Grafana), data infrastructure (PostgreSQL, Apache Spark), and ML tooling (MLflow, vLLM, Triton, Ray), with infrastructure-as-code managed via Terraform. This architecture targets the latency, throughput, and reliability requirements of AI training and inference workloads at scale. The company has secured a multi-billion dollar agreement with Microsoft to deliver dedicated AI infrastructure from its Vineland data center. Nebius serves startups, research institutes, and enterprises across healthcare and life sciences, robotics, finance, and entertainment verticals. The technical approach emphasizes production-grade infrastructure that handles the operational complexity of large-scale AI deployments - managing GPU utilization, network bottlenecks, and the cost-performance trade-offs inherent in serving diverse AI workloads from model training through inference serving.

484 jobs
QD

Qdrant

Qdrant is a Rust-based vector database designed for high-dimensional similarity search at scale, serving semantic search, recommendation systems, and retrieval-augmented generation workloads. The system has processed billions of vectors across production deployments, with adoption reflected in 10 million+ downloads and 23,000 GitHub stars. The architecture trades language-level memory safety and zero-cost abstractions for predictable performance characteristics under load, operating both as an open-source deployment target and a managed cloud service. The database handles multi-modal retrieval and real-time recommendation workloads for enterprises including HubSpot, Bayer, Bosch, and CB Insights, spanning e-commerce through healthcare verticals. The managed offering positions deployment time as a primary bottleneck reducer, though actual production reliability depends on vector dimensionality, query patterns, and infrastructure topology. The team of 75+ distributed across 20+ countries maintains both the core engine and cloud operations, with the stack including gRPC for service boundaries, Kubernetes for orchestration, and observability through Prometheus/Grafana/OpenTelemetry. Founded in 2021 by André Zayarni and Andrey Vasnetsov, the company operates a dual open-source and managed cloud business model. The technical focus centers on scalability trade-offs in nearest neighbor search - balancing index structure overhead, query latency distribution, and write throughput as vector counts scale. Deployment options span AWS, GCP, and Azure, with Terraform for infrastructure provisioning and Docker for containerization.

27 jobs
FU

FurtherAI

FurtherAI builds domain-specific AI infrastructure for commercial insurance workflows, targeting the document-heavy operational bottlenecks that dominate underwriting, claims processing, and policy comparison work. Their AI Workspace handles submission intake, underwriting audits, and compliance checks by parsing and normalizing unstructured data from broker letters, property schedules, Accord forms, and loss histories. The system reports 95–97% accuracy on these tasks compared to 70–77% for manual processing, addressing a workflow layer where precision directly impacts underwriting decisions and operational throughput. The platform is deployed by insurers, reinsurers, MGAs, and brokers writing over $15B in premiums across all 50 U.S. states. Technical focus areas include document understanding, NLP for insurance-specific language and formats, data normalization pipelines, and workflow automation that integrates with existing carrier systems. The core technical challenge is reliability at scale across heterogeneous document types and insurance product lines, where edge cases in policy language or submission format can propagate downstream into underwriting errors or compliance gaps. FurtherAI operates in a sector facing projected workforce reduction of 400,000 by 2026, with approximately 3 million insurance professionals currently handling manual document processing. The system architecture must handle the latency requirements of underwriting timelines while maintaining accuracy thresholds that meet regulatory and risk management standards. Key operational trade-offs include throughput on batch processing of submissions versus real-time responsiveness for urgent underwriting decisions, and the cost-accuracy frontier for document parsing models across different insurance product complexities.

18 jobs
PO

Poolside

Poolside is a frontier research lab founded in 2023, building foundation models and autonomous agents for enterprise software development through reinforcement learning from code execution feedback. The company operates millions of orchestrated coding environments seeded from open-source repositories across Python, Rust, Java, and Go, using actual code execution as the training signal rather than relying solely on language model scaling. This approach treats code execution environments as a safe playground for models to learn from experience, with the explicit constraint that feedback derives from whether code runs correctly. The technical infrastructure spans the full stack from foundation model development in PyTorch and CUDA to enterprise deployment systems built on Kubernetes, Terraform, and AWS. The company's thesis centers on reinforcement learning as the primary scaling axis beyond traditional language model approaches, with a stated focus on high-consequence software for high-consequence applications. Co-founded by Jason Warner and Eiso Kant, Poolside is headquartered in France and positions software engineering as the fastest path toward artificial general intelligence through its experience-based learning paradigm.

10 jobs
OP

OpenRouter

OpenRouter operates a unified API gateway that aggregates 300+ large language models from 60+ providers into a single interface, processing over 100 trillion tokens annually for more than 5 million developers. Founded in 2023 by Alex Atallah and backed by $40M Series A funding from Andreessen Horowitz, Menlo Ventures, and Sequoia Capital, the platform addresses multi-provider infrastructure complexity through intelligent routing, automatic failover, and consolidated billing across models from Anthropic, OpenAI, Google, Meta, and dozens of other providers. The technical architecture prioritizes reliability and operational flexibility through automatic fallbacks between providers, response healing for malformed JSON outputs, and customizable data policies. The platform standardizes access across heterogeneous model APIs while maintaining transparent per-token pricing without subscription tiers. Public usage rankings provide visibility into model performance patterns across the user base. OpenRouter's infrastructure handles workloads ranging from individual developer projects to enterprise-scale deployments, with completion insurance and routing logic designed to mitigate single-provider outages and rate limiting. The platform's tech stack includes React, Next.js, TypeScript, and Cloudflare Workers for edge deployment. Core operational focus centers on eliminating vendor lock-in while maintaining production-grade uptime across a rapidly expanding model catalog.

8 jobs