1. Home
  2. Companies
  3. Together AI
TA

Together AI operates a purpose-built GPU cloud platform for training, fine-tuning, and deploying generative AI models. The infrastructure is designed without vendor lock-in, serving developers and organizations that need to run open-source models at scale. The engineering work centers on distributed systems, model optimization, and AI infrastructure - areas where trade-offs between throughput, latency, and operational complexity define production viability.

The company maintains active contributions to open-source projects including FlashAttention, Mamba, and RedPajama. Engineers and researchers work in close proximity, with new hires taking ownership of substantial technical challenges from the start. The tech stack spans PyTorch, CUDA, TensorRT, TensorRT-LLM, vLLM, SGLang, and TGI, reflecting the requirement to support multiple inference backends and optimization paths. Work involves designing distributed inference engines and developing model architectures where performance characteristics - memory bandwidth utilization, kernel fusion opportunities, multi-GPU coordination overhead - directly impact what models can run economically in production.

Technical problems include optimizing inference for various model architectures across heterogeneous GPU clusters, managing the reliability and cost trade-offs in serving large language models, and building tooling that makes open-source AI accessible without sacrificing control over deployment parameters. The platform must handle the operational complexity of supporting diverse workloads: training runs with different parallelization strategies, fine-tuning jobs with varying dataset sizes, and inference deployments where tail latency matters.

Open roles at Together AI

Explore 36 open positions at Together AI and find your next opportunity.

TA5d

Senior Software Engineer - Together Cloud Platform

Together AI

San Francisco, California, United States (Hybrid)

$160k – $230k Yearly

TA1w

Staff Strategic Sourcing Manager (Hardware)

Together AI

San Francisco, California, United States (Hybrid)

$220k – $260k Yearly

TA1w

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)

$200k – $280k Yearly

TA1w

Sr. Manager, Cloud Sourcing

Together AI

San Francisco, California, United States (On-site)

$230k – $260k Yearly

TA1w

Research Engineer, Frontier Speculative Decoding

Together AI

San Francisco, California, United States (On-site)

$190k – $270k Yearly

TA1w

Sr. Manager, Capacity Planning

Together AI

San Francisco, California, United States (Hybrid)

$230k – $260k Yearly

TA2w

Senior Strategic Sourcing & Procurement Lead, Compute

Together AI

San Francisco, California, United States (Hybrid)

$150k – $200k Yearly

TA2w

Senior Software Engineer, Observability

Together AI

San Francisco, California, United States (Hybrid)

$160k – $260k Yearly

TA2w

Staff Engineer, Distributed Storage and HPC & AI Infrastructure

Together AI

Amsterdam, North Holland, Netherlands (Hybrid)

TA2w

Senior Counsel, Commercial

Together AI

San Francisco, California, United States (On-site)

$225k – $300k Yearly

TA2w

Senior Network Engineer (Amsterdam)

Together AI

Amsterdam, North Holland, Netherlands (On-site)

TA2w

Senior Backend Engineer - Together Cloud

Together AI

Amsterdam, North Holland, Netherlands (Hybrid)

TA2w

Research Intern, Model Shaping (Summer 2026)

Together AI

San Francisco, California, United States (On-site)

TA2w

Strategic Finance Manager

Together AI

San Francisco, California, United States (On-site)

$210k – $260k Yearly

TA3w

Product Marketing Director

Together AI

San Francisco, California, United States (Hybrid)

$250k – $295k Yearly

TA3w

Sales Development Engineer

Together AI

San Francisco, California, United States (On-site)

$90k – $150k Yearly