1. Home
  2. Jobs
  3. Inference-Time Compute

Inference-Time Compute jobs

Explore Inference-Time Compute roles on Inference Jobs and apply today.

161-180 of 512 jobs

CE5d

Kernel Engineer

Cerebras

Sunnyvale, California, United States (On-site)

NV3w

Senior Machine Learning Performance Engineer - Physics

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $287.5k Yearly

AI4w

Machine Learning Engineer - Defense

Applied Intuition

Washington, District of Columbia, United States (On-site)

$150k – $225k Yearly

CE3w

Kernel Optimization Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)

NV2w

Senior Deep Learning Compiler Engineer - PyTorch

NVIDIA

Berlin, Berlin, Germany (On-site)

zł 292.5k – zł 507k Yearly

OP2w

Senior Research Engineer/Scientist - Edge, Consumer Products

OpenAI

San Francisco, California, United States (Hybrid)

$380k – $460k Yearly

NV1w

Raytracing Compiler Engineer - Developer and Performance Technology

NVIDIA

Santa Clara, California, United States (On-site)

$184k – $356.5k Yearly

NV2w

Deep Learning Compiler Verification and Infra Development Intern - 2026

NVIDIA

Shanghai, Shanghai, China (On-site)

CE6d

Compiler Engineer

Cerebras

Sunnyvale, California, United States (On-site)

CE1w

CoDesign & NextGen - New College Grad

Cerebras

Sunnyvale, California, United States (On-site)

$145k – $155k Yearly

NV2w

Senior Software Engineer - VLM Microservices for Neural Reconstruction

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $287.5k Yearly

D-4d

Software Engineering Intern - Kernels

d-Matrix

Ontario, Canada (Remote)

C$40 – C$70 Hourly

NV5d

Software Engineer – Hardware Dataflow

NVIDIA

Netherlands (Remote)

BA2w

Software Engineer, Model Performance Tooling

Baseten

Canada or Remote (Canada + 1 more)

C$130k – C$200k Yearly

OP2w

Research-Hardware Codesign Engineer

OpenAI

San Francisco, California, United States (Hybrid)

$230k – $460k Yearly

TM1w

Research Engineer, Infrastructure, Numerics

Thinking Machines Lab

San Francisco, California, United States (On-site)

$350k – $475k Yearly

NV1w

GPU Compiler LLVM Backend Intern - 2026

NVIDIA

Shanghai, Shanghai, China (On-site)

NV2d

Developer Technology Engineer - AI

NVIDIA

Beijing, Beijing, China (On-site)

CE2w

Senior/Staff- Engineer: Post Silicon- Bring Up

Cerebras

Bengaluru, Karnataka, India (On-site)

$175k – $275k Yearly

TE1w

Sr. Software Engineer, AI Compiler

Tenstorrent

Toronto, Ontario, Canada (Hybrid)

$100k – $500k Yearly