PL

LLM Inference Engineer

Periodic Labs
Posted onFeb 16, 2026
LocationMenlo Park, California, United States (On-site)
Employment typeFull-time

About Periodic Labs

We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries. We are well funded and growing rapidly. Team members are owners who identity and solve problems without boundaries or bureaucracy. We eagerly learn new tools and new science to push forward our mission.

About the role

You will integrate, optimize, and operate large-scale inference systems to power AI scientific research. You will build and maintain high-performance serving infrastructure that delivers low-latency, high-throughput access to large language models across thousands of GPUs. You will work closely with researchers and engineers to integrate cutting-edge inference into large-scale reinforcement learning workloads. You will build tools and directly support frontier-scale experiments to make Periodic Labs the world’s best AI + science lab. You will make contributions to open-source LLM inference software.

You might thrive in this role if you have experience with:

  • Optimizing inference for the largest open-source model

  • High-performance model serving frameworks such as TensorRT-LLM, vLLM, SGLang

  • Distributed inference techniques (tensor/expert/pipeline parallelism, speculative decoding, KV cache management)

  • Optimizing GPU utilization and latency for reinforcement learning

Periodic Labs is building AI scientists and autonomous laboratories to accelerate scientific discovery in the physical sciences by combining frontier AI models with real-world experimental data.

Similar jobs

You might also be interested in...

HA2w

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)

TA6d

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)

$160k – $230k Yearly

NV2w

Senior AI Software Engineer, GenAI Framework

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $287.5k Yearly

TA1w

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)

$200k – $280k Yearly

TM6d

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)

$350k – $475k Yearly