1. Home
  2. Jobs
  3. Inference Architecture

Inference Architecture Jobs

Browse 874 Inference Architecture jobs on Inference Jobs.

874 jobs

2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly
6dOP

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)$455k – $555k Yearly
3wCE

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)
6dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wNV

Senior Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$200k – $391k Yearly
2wNV

Senior Deep Learning Performance Architect

NVIDIA

California, United States (Hybrid)$152k – $287.5k Yearly
2wNV

Director, Software Architecture

NVIDIA

Yokne'am, Northern District, Israel (On-site)
2wPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
2wNV

Software Architect, Advanced Development

NVIDIA

Yokne'am, Northern District, Israel (On-site)
4dNV

Senior AI Inference Compiler Engineer

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
3wNV

Platform Architecture Engineer, GeForce NOW

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
4dNV

Senior Compiler Engineer, AI Inference Platforms

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wNV

Senior Software Engineer, Deep Learning Inference - TensorRT

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wCO

Solutions Architect - Kubernetes

CoreWeave

London, England, United Kingdom (Hybrid)£98k – £130k Yearly
2wNV

Senior Software Research Architect, AI Networking

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
3dDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300k – $430k Yearly