1. Home
  2. Jobs
  3. Inference Architecture

Inference Architecture Jobs

Browse 866 Inference Architecture jobs on Inference Jobs.

81-100 of 866 jobs

2dNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
5dCR

Research Engineer

Crusoe

Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
7dTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
7dNV

Senior System Software Engineer - Dynamo-Triton Inference Server

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
7dCO

Senior Software Engineer II, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$165k – $242k Yearly
2wBA

Engineering Manager - Forward Deployed Engineering (LLM)

Baseten

San Francisco, California, United States (On-site)$220k – $285k Yearly
7dCE

Senior Research Engineer - Inference ML

Cerebras

Sunnyvale, California, United States (Hybrid)
6dAN

Engineering Manager, Inference

Anthropic

San Francisco, California, United States (Hybrid)$425k – $560k Yearly
2wCO

Product Marketing Manager, CoreWeave Inference

CoreWeave

Livingston, New Jersey, United States (Hybrid)$143k – $210k Yearly
4wNV

Product Manager - BioNeMo Inference

NVIDIA

New York, New York, United States (On-site)$168k – $258.8k Yearly
2wOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
2wNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Santa Clara, California, United States (Hybrid)$184k – $356.5k Yearly
4dCO

Senior Software Engineer I, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$139k – $204k Yearly
2wNE

Senior ML Engineer (Token Factory)

Nebius

Amsterdam, North Holland, Netherlands (On-site)
2wOP

Software Engineer, Load Balancing - Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
7dVA

GPU Systems Engineer – HPC / Parallel Computing

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
6dNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Toronto, Ontario, Canada (Hybrid)C$170k – C$275k Yearly
1wNV

Senior ML Framework Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly