1. Home
  2. Jobs
  3. Inference Libraries

Inference Libraries Jobs

Browse 251 Inference Libraries jobs on Inference Jobs.

251 jobs

1wOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
3wCE

Inference Frontend

Cerebras

Sunnyvale, California, United States (On-site)
3wNV

Senior Technical Program Manager, Deep Learning Libraries

NVIDIA

Santa Clara, California, United States (On-site)$168k – $322k Yearly
2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly
5dVA

GPU Systems Engineer – HPC / Parallel Computing

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
5dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wCE

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
5dOP

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)$455k – $555k Yearly
1wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
5dTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
3wCO

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)$92k – $135k Yearly
1wPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
5dVA

Systems/GPU Research Engineer

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
1wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
3wAN

Software Engineer, Inference Deployment

Anthropic

San Francisco, California, United States (Hybrid)$320k – $485k Yearly