1. Home
  2. Jobs
  3. Speculative Decoding

Speculative Decoding Jobs

Browse 17 Speculative Decoding jobs on Inference Jobs.

17 jobs

7dTA

Research Engineer, Frontier Speculative Decoding

Together AI

San Francisco, California, United States (On-site)$190k – $270k Yearly
5dTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
3dCR

Research Engineer

Crusoe

Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
1wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
1wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
5dCE

Senior Research Engineer - Inference ML

Cerebras

Sunnyvale, California, United States (Hybrid)
1wOP

Research-Hardware Codesign Engineer

OpenAI

San Francisco, California, United States (Hybrid)$230k – $460k Yearly
2wMA

Research Scientist - Audio Codec

Mirelo AI

Berlin, Berlin, Germany (Hybrid)
2wTE

CPU Design Verification Technical Lead

Tenstorrent

United States (Hybrid)$100k – $500k Yearly
7dNV

DFT Verification Engineer

NVIDIA

Yokneam Ilit, Northern District, Israel (On-site)
2wNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
5dTE

CPU Architect, Load-Store

Tenstorrent

United States (Remote)$100k – $500k Yearly