1. Home
  2. Jobs
  3. United States
  4. California
  5. San Francisco
  6. AI Infrastructure
  7. GPU Systems Engineer – HPC / Parallel Computing
VA

GPU Systems Engineer – HPC / Parallel Computing

Vast.ai
Posted onFeb 23, 2026
LocationSan Francisco, California, United States | Los Angeles, California, United States (On-site)
Employment typeFull-time
Salary$160k – $320k Yearly

About Us

Vast.ai’s cloud powers AI projects and businesses all over the world. We are democratizing and decentralizing AI computing—reshaping our future for the benefit of humanity.

We are a small, growing, and highly motivated team dedicated to an ambitious technical plan. We operate with a flat mobile organizational structure where all contribute directly to the company’s mission. Leadership is earned by those who show initiative and deliver excellence. 

We seek engineers/researchers with strong intrinsic drive, a true passion for advancing the state of the art, and a mix of excellent research, coding, and communication skills.

LOCATION: On-site at our office in San Francisco or Westwood, Los Angeles.

About the Role

We’re looking for a systems engineer with HPC or parallel programming experience to help scale AI inference. You’ll leverage your knowledge of high-performance systems to optimize GPU performance at the bleeding edge of AI.

  • Full-Time
  • On-site at either our SF or LA offices

Tech Stack

CUDA/C++, GPGPU, Python, Linux

Key Responsibilities

  • Design and optimize GPU kernels and tensor libraries
  • Translate HPC techniques into scalable AI inference solutions
  • Evaluate emerging architectures and resource management approaches
  • Collaborate with technical leadership to improve GPU infrastructure efficiency

Ideal Experience

  • Advanced C++ (C++17/20 preferred)
  • Expertise with at least one parallel framework (CUDA, HIP, SYCL, OpenCL, OpenACC, or similar)
  • Strong background in systems optimization and HPC performance tooling
  • Familiarity with distributed training/inference frameworks (bonus)

Interview Process

After submitting your application, our technical team reviews your credentials. If selected, you'll proceed through the following stages:

  • Initial screening (virtual, 15 minutes)
  • Quick dive into Vast, systems and architectures (virtual, 30 minutes)
  • LLM-assisted coding assessment (virtual, 1 hour)
  • Meet and greet with coding assessment (on-site, 2 hours)

Our goal is to complete the interview process in two weeks.

Annual Salary Range

$160,000 – $320,000 + equity + benefits

Vast.ai is hiring across all experience levels with compensation commensurate with background, experience and potential.

Benefits

  • Comprehensive health, dental, vision, and life insurance
  • 401(k) with company match 
  • Meaningful early-stage equity
  • Onsite meals, snacks, and close collaboration with founders/tech leaders
  • Ambitious, fast-paced startup culture where initiative is rewarded

Vast.ai is the market leader for low cost GPU rentals, connecting data centers and professionals with users who need AI compute at prices 3-5X cheaper than traditional cloud providers.

Similar jobs

You might also be interested in...

TM5d

Research Engineer, Infrastructure, Kernels

Thinking Machines Lab

San Francisco, California, United States (On-site)

$350k – $475k Yearly

CE3d

Kernel Engineer

Cerebras

Sunnyvale, California, United States (On-site)

NV5d

Senior AI Developer Technology Engineer, Financial Sector

NVIDIA

Santa Clara, California, United States (Hybrid)

$152k – $241.5k Yearly

NV4w

Deep Learning Algorithm Engineer - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)

$124k – $241.5k Yearly

NV3w

Senior HPC Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)

$184k – $356.5k Yearly