1. Home
  2. Jobs
  3. NVIDIA Inference Microservices

NVIDIA Inference Microservices jobs

Explore NVIDIA Inference Microservices roles on Inference Jobs and apply today.

81-100 of 979 jobs

AN1w

Senior/Staff Software Engineer, Inference

Anthropic

New York, New York, United States (Hybrid)

$300k – $485k Yearly

AN1w

Senior Software Engineer, Inference

Anthropic

Dublin, Dublin, Ireland (Hybrid)

€235k – €295k Yearly

AI4w

Machine Learning Engineer - Defense

Applied Intuition

Washington, District of Columbia, United States (On-site)

$150k – $225k Yearly

NV2d

Lead Principal Engineer, Enterprise Agentic AI Platform

NVIDIA

Santa Clara, California, United States (On-site)

$272k – $431.3k Yearly

CO2w

Product Marketing Manager, CoreWeave Inference

CoreWeave

Livingston, New Jersey, United States (Hybrid)

$143k – $210k Yearly

NV2w

Director, Software Architecture

NVIDIA

Yokne'am, Northern District, Israel (On-site)

PE2w

UK Internship Program

Perplexity

London, England, United Kingdom (Hybrid)

AI2w

Senior Software Engineer - ML Infrastructure

Applied Intuition

Sunnyvale, California, United States (On-site)

$153k – $222k Yearly

NV4w

Senior HPC and AI Networking Performance Research and Analysis Engineer

NVIDIA

Shanghai, Shanghai, China (On-site)

CE2w

Performance Engineer - Inference

Cerebras

Toronto, Ontario, Canada (On-site)

CE1w

Senior Research Engineer - Inference ML

Cerebras

Sunnyvale, California, United States (Hybrid)

NV6d

Senior Software Engineer, AI Inference Systems

NVIDIA

Toronto, Ontario, Canada (Hybrid)

C$170k – C$275k Yearly

TM1w

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)

$350k – $475k Yearly

NE1w
AN1w

Staff Software Engineer, Inference

Anthropic

Dublin, County Dublin, Ireland (Hybrid)

€295k – €355k Yearly

NV2d

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)

$152k – $287.5k Yearly

NE2w

Senior ML Engineer (Token Factory)

Nebius

Amsterdam, North Holland, Netherlands (On-site)

NV5d

Senior Compiler Engineer, AI Inference Platforms

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $241.5k Yearly

D-2w

ML Compiler Architect, Senior Principal

d-Matrix

Toronto, Ontario, Canada (Hybrid)

OP2w

Software Engineer, Load Balancing - Inference

OpenAI

San Francisco, California, United States (On-site)

$325k – $490k Yearly