1. Home
  2. Jobs
  3. Micro-batching

Micro-batching jobs

Explore Micro-batching roles on Inference Jobs and apply today.

41-60 of 60 jobs

NV1w

Senior GPU Low Power Architect

NVIDIA

Santa Clara, California, United States (On-site)

$136k – $264.5k Yearly

SC1w

Machine Learning Research Engineer, GenAI Applied ML

Scale

San Francisco, California, United States (On-site)

$176k – $220k Yearly

OP2w

Growth - Emails, Notifications and Lifecycle

OpenAI

New York, New York, United States (Hybrid)

$265k – $265k Yearly

TE1w

Formal Verification Lead

Tenstorrent

Santa Clara, California, United States (Hybrid)

$100k – $500k Yearly

CO2w

Senior ML Systems Engineer, Frameworks & Tooling

Cohere

London, England, United Kingdom (On-site)

OP2w

Software Engineer, Data Acquisition

OpenAI

San Francisco, California, United States (On-site)

$325k – $405k Yearly

HA5d

Strategy & Ops Engineer

HappyRobot

Madrid, Madrid, Spain or Remote (Spain)

PL2w

Software Engineer (MES)

Periodic Labs

Menlo Park, California, United States (On-site)

NV1w

Senior Performance Verification Engineer

NVIDIA

Santa Clara, California, United States (On-site)

$136k – $264.5k Yearly

NV2d

Principal GPU Memory Architect

NVIDIA

Santa Clara, California, United States (On-site)

$272k – $431.3k Yearly

NV1w

ASIC Design Engineer, BOOT, Functional Safety and Power Management

NVIDIA

Bengaluru, Karnataka, India (Hybrid)

PL2w

Distributed Training Engineer

Periodic Labs

Menlo Park, California, United States (Hybrid)

NV2w

Senior Web Infrastructure Engineer

NVIDIA

Santa Clara, California, United States (On-site)

$184k – $287.5k Yearly

OP2w

Software Engineer, Caching Infrastructure

OpenAI

San Francisco, California, United States (On-site)

$255k – $405k Yearly

NV2d

Senior Web Infrastructure Engineer

NVIDIA

Shanghai, Shanghai, China (On-site)

NV2w

Senior Verification Engineer, Chip Design

NVIDIA

Yokne'am, Northern District, Israel (On-site)

PO4w

Member of Engineering (Pre-training / Data Engineering)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa + 1 more)

AI1w

ML Runtime Optimization Engineer

Applied Intuition

Mountain View, California, United States (On-site)

$159.1k – $199.3k Yearly

AN1w

Research Engineer, Pretraining Scaling

Anthropic

San Francisco, California, United States (On-site)

$315k – $560k Yearly

OP2w

Software Engineer, Load Balancing - Inference

OpenAI

San Francisco, California, United States (On-site)

$325k – $490k Yearly