IN

Inferact

About

This company hasn't shared a description yet.

Similar companies

QD

Qdrant

Qdrant is a Rust-based vector database designed for high-dimensional similarity search at scale, serving semantic search, recommendation systems, and retrieval-augmented generation workloads. The system has processed billions of vectors across production deployments, with adoption reflected in 10 million+ downloads and 23,000 GitHub stars. The architecture trades language-level memory safety and zero-cost abstractions for predictable performance characteristics under load, operating both as an open-source deployment target and a managed cloud service. The database handles multi-modal retrieval and real-time recommendation workloads for enterprises including HubSpot, Bayer, Bosch, and CB Insights, spanning e-commerce through healthcare verticals. The managed offering positions deployment time as a primary bottleneck reducer, though actual production reliability depends on vector dimensionality, query patterns, and infrastructure topology. The team of 75+ distributed across 20+ countries maintains both the core engine and cloud operations, with the stack including gRPC for service boundaries, Kubernetes for orchestration, and observability through Prometheus/Grafana/OpenTelemetry. Founded in 2021 by André Zayarni and Andrey Vasnetsov, the company operates a dual open-source and managed cloud business model. The technical focus centers on scalability trade-offs in nearest neighbor search - balancing index structure overhead, query latency distribution, and write throughput as vector counts scale. Deployment options span AWS, GCP, and Azure, with Terraform for infrastructure provisioning and Docker for containerization.

27 jobs
MA

Magic

Magic operates 8,000 NVIDIA H100s on Google Cloud, training frontier code models designed to automate software engineering and AI research itself. The company has raised $515 million from Nat Friedman, Daniel Gross, CapitalG, and Sequoia to pursue direct AGI development through code generation - treating automated AI research as the primary bottleneck rather than incremental developer tooling. Technical focus spans large-scale pre-training, domain-specific reinforcement learning, ultra-long context windows, and inference-time compute scaling. The company's research program centers on fundamental problems in automating software engineering at scale, not incremental productivity improvements. Context window extension and inference-time compute are treated as first-class constraints rather than auxiliary features. Co-founded by Eric Steinberger and Sebastian De Ro, the team remains small and emphasizes ownership over execution - engineers and researchers work on meaningful problem subsets rather than predetermined roadmaps. Infrastructure operates at production scale: the H100 cluster represents committed capital toward training runs that matter, not research prototypes. The operational model assumes that code generation quality and AI research automation are the direct path to AGI, making software engineering the domain where model capabilities and safety research converge. Google Cloud provides the substrate, but the company owns its GPU allocation outright.

17 jobs
IN

Interaction

Interaction is a Palo Alto-based startup building Poke, an AI assistant that operates entirely within iMessage and SMS. The architecture constrains the system to function through messaging protocols rather than native apps, requiring the assistant to parse natural language commands, maintain conversational state, and execute actions across text, email, and calendar integrations - all mediated through message-based I/O. This introduces latency and throughput considerations inherent to SMS delivery networks and iMessage's API surface, alongside constraints on rich UI feedback mechanisms available to native applications. The company raised $15M in seed funding led by General Catalyst. The technical challenge centers on building proactive intelligence that surfaces relevant information from communication patterns while operating within the reliability and availability constraints of carrier networks and Apple's messaging infrastructure. Cross-platform integration across email and calendar systems adds complexity in authentication flows, permission models, and error handling when actions must be triggered through conversational interfaces rather than direct API calls. The team includes engineers from quantitative trading firms, MIT, Cambridge, and international science olympiad competitors. The stack includes Next.js, React, and SwiftUI, suggesting server-side processing for NLP workloads with client components for any companion interfaces. Production success depends on handling edge cases in natural language understanding, managing state across asynchronous message exchanges, and maintaining consistent behavior despite variable network conditions and platform-specific limitations in both iOS and carrier SMS systems.

1 job