Home
Jobs
NVIDIA Inference Microservices

NVIDIA Inference Microservices jobs

Explore NVIDIA Inference Microservices roles on Inference Jobs and apply today.

81-100 of 979 jobs

AN1w

Senior/Staff Software Engineer, Inference

Anthropic · New York, New York, United States (Hybrid) · $300k – $485k Yearly

Anthropic

New York, New York, United States (Hybrid)

$300k – $485k Yearly

AN1w

Senior Software Engineer, Inference

Anthropic · Dublin, Dublin, Ireland (Hybrid) · €235k – €295k Yearly

Anthropic

Dublin, Dublin, Ireland (Hybrid)

€235k – €295k Yearly

AI4w

Machine Learning Engineer - Defense

Applied Intuition · Washington, District of Columbia, United States (On-site) · $150k – $225k Yearly

Applied Intuition

Washington, District of Columbia, United States (On-site)

$150k – $225k Yearly

NV2d

Lead Principal Engineer, Enterprise Agentic AI Platform

NVIDIA · Santa Clara, California, United States (On-site) · $272k – $431.3k Yearly

NVIDIA

Santa Clara, California, United States (On-site)

$272k – $431.3k Yearly

CO2w

Product Marketing Manager, CoreWeave Inference

CoreWeave · Livingston, New Jersey, United States (Hybrid) · $143k – $210k Yearly

CoreWeave

Livingston, New Jersey, United States (Hybrid)

$143k – $210k Yearly

NV2w

Director, Software Architecture

NVIDIA · Yokne'am, Northern District, Israel (On-site)

NVIDIA

Yokne'am, Northern District, Israel (On-site)

PE2w

UK Internship Program

Perplexity · London, England, United Kingdom (Hybrid)

Perplexity

London, England, United Kingdom (Hybrid)

AI2w

Senior Software Engineer - ML Infrastructure

Applied Intuition · Sunnyvale, California, United States (On-site) · $153k – $222k Yearly

Applied Intuition

Sunnyvale, California, United States (On-site)

$153k – $222k Yearly

NV4w

Senior HPC and AI Networking Performance Research and Analysis Engineer

NVIDIA · Shanghai, Shanghai, China (On-site)

NVIDIA

Shanghai, Shanghai, China (On-site)

CE2w

Performance Engineer - Inference

Cerebras · Toronto, Ontario, Canada (On-site)

Cerebras

Toronto, Ontario, Canada (On-site)

CE1w

Senior Research Engineer - Inference ML

Cerebras · Sunnyvale, California, United States (Hybrid)

Cerebras

Sunnyvale, California, United States (Hybrid)

NV6d

Senior Software Engineer, AI Inference Systems

NVIDIA · Toronto, Ontario, Canada (Hybrid) · C$170k – C$275k Yearly

NVIDIA

Toronto, Ontario, Canada (Hybrid)

C$170k – C$275k Yearly

TM1w

Research Engineer, Infrastructure, Inference

Thinking Machines Lab · San Francisco, California, United States (On-site) · $350k – $475k Yearly

Thinking Machines Lab

San Francisco, California, United States (On-site)

$350k – $475k Yearly

NE1w

ML Engineer, Large Language Models (LLM Training & Inference Optimization)

Nebius · Europe (Remote)

Nebius

Europe (Remote)

AN1w

Staff Software Engineer, Inference

Anthropic · Dublin, County Dublin, Ireland (Hybrid) · €295k – €355k Yearly

Anthropic

Dublin, County Dublin, Ireland (Hybrid)

€295k – €355k Yearly

NV2d

Senior Software Engineer, Quantized Inference

NVIDIA · Redmond, Washington, United States (On-site) · $152k – $287.5k Yearly

NVIDIA

Redmond, Washington, United States (On-site)

$152k – $287.5k Yearly

NE2w

Senior ML Engineer (Token Factory)

Nebius · Amsterdam, North Holland, Netherlands (On-site)

Nebius

Amsterdam, North Holland, Netherlands (On-site)

NV5d

Senior Compiler Engineer, AI Inference Platforms

NVIDIA · Santa Clara, California, United States (On-site) · $152k – $241.5k Yearly

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $241.5k Yearly

D-2w

ML Compiler Architect, Senior Principal

d-Matrix · Toronto, Ontario, Canada (Hybrid)

d-Matrix

Toronto, Ontario, Canada (Hybrid)

OP2w

Software Engineer, Load Balancing - Inference

OpenAI · San Francisco, California, United States (On-site) · $325k – $490k Yearly

OpenAI

San Francisco, California, United States (On-site)

$325k – $490k Yearly

Inference Jobs

Senior/Staff Software Engineer, Inference

Senior Software Engineer, Inference

Machine Learning Engineer - Defense

Lead Principal Engineer, Enterprise Agentic AI Platform

Product Marketing Manager, CoreWeave Inference

Director, Software Architecture

UK Internship Program

Senior Software Engineer - ML Infrastructure

Senior HPC and AI Networking Performance Research and Analysis Engineer

Performance Engineer - Inference

Senior Research Engineer - Inference ML

Senior Software Engineer, AI Inference Systems

Research Engineer, Infrastructure, Inference

ML Engineer, Large Language Models (LLM Training & Inference Optimization)

Staff Software Engineer, Inference

Senior Software Engineer, Quantized Inference

Senior ML Engineer (Token Factory)

Senior Compiler Engineer, AI Inference Platforms

ML Compiler Architect, Senior Principal

Software Engineer, Load Balancing - Inference

Related searches