1. Home
  2. Jobs
  3. Inference Architecture

Inference Architecture jobs

Explore Inference Architecture roles on Inference Jobs and apply today.

41-60 of 863 jobs

XA4w

Software Engineer - Applied Inference

xAI

Palo Alto, California, United States (On-site)

$180k – $440k Yearly

NV6d

Senior Software Developer, AI Networking

NVIDIA

Texas, United States (Remote)

$184k – $356.5k Yearly

NV2w

Principal Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)

$248k – $391k Yearly

OP2w

Research Engineer / Research Scientist - Foundations Retrieval Lead

OpenAI

San Francisco, California, United States (Hybrid)

$460k – $555k Yearly

NE1w

Senior Site Reliability Engineer — Token Factory (Inference Platform)

Nebius

Netherlands + 4 more (Remote)

HA2w

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)

PL2w

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)

NV6d

Senior Manager, Engineering - Enterprise AI and Automation

NVIDIA

Santa Clara, California, United States (On-site)

$272k – $431.3k Yearly

NE5d

Chief ML Researcher, Product

Nebius

United States (Remote)

$200k – $300k Yearly

CE3w

Principal Engineer, AI Inference Reliability

Cerebras

United States + 1 more (Remote)

NV5d

Senior Machine Learning Applications and Compiler Engineer

NVIDIA

Cambridge, England, United Kingdom (Hybrid)

XA1w

AI/HPC Network Development Engineer - Networking

xAI

Palo Alto, California, United States (On-site)

CE3w

Engineering Manager, Inference Platform

Cerebras

Sunnyvale, California, United States (On-site)

NE3d

Senior ML Engineer (Token Factory)

Nebius

Netherlands + 4 more (Remote)

XA4w

Member of Technical Staff, Inference

xAI

Palo Alto, California, United States (On-site)

$180k – $440k Yearly

CO3w

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)

$92k – $135k Yearly

OP2w

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)

$325k – $490k Yearly

CE3w

Deployment Engineer, AI Inference

Cerebras

Sunnyvale, California, United States (On-site)

NV5d

Software Engineer, TensorRT Specialized Platforms - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-site)

$124k – $195.5k Yearly

BA2w

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)

$150k – $250k Yearly