1. Home
  2. Jobs
  3. Inference Stacks

Inference Stacks Jobs

Browse 428 Inference Stacks jobs on Inference Jobs.

428 jobs

1wCO

Full-Stack Software Engineer, Inference

Cohere

Toronto, Ontario, Canada or Remote (Canada + 2 more)
2wAI

AI Infrastructure Engineer - Autonomy

Applied Intuition

Sunnyvale, California, United States (On-site)$153k – $222k Yearly
4wNV

Agentic AI Solution Engineering Intern - Summer 2026

NVIDIA

Austin, Texas, United States (On-site)$20 – $71 Hourly
2wAI

Software Engineer - Full Stack, Defense

Applied Intuition

Sunnyvale, California, United States (On-site)$153k – $222k Yearly
3wCE

Inference Frontend

Cerebras

Sunnyvale, California, United States (On-site)
1wCA

Software Engineer

Cartesia

San Francisco, California, United States (On-site)$180k – $250k Yearly
2wLA

Fullstack Engineer, Applied AI

LangChain

San Francisco, California, United States (On-site)$170k – $195k Yearly
2wCE

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)
2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly
1wBA

Software Engineer - Core Product

Baseten

San Francisco, California, United States (On-site)$150k – $230k Yearly
5dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
1wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)
5dNV

Senior System Software Engineer - Dynamo-Triton Inference Server

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2dAI

Software Architect - Fallback Stack

Applied Intuition

Sunnyvale, California, United States (On-site)$145k – $245k Yearly
1wPO

Member of Engineering (Inference)

Poolside

United Kingdom or Remote (Europe + 1 more)
5dOP

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)$455k – $555k Yearly
5dTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly