1. Home
  2. Jobs
  3. Speculative Decoding

Speculative Decoding Jobs

Explore Speculative Decoding roles on Inference Jobs and apply today.

2mo agoTA

Research Engineer, Frontier Speculative Decoding

Together AI

San Francisco, California, United States (On-site)$190K – $270K Yearly
2mo agoTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200K – $280K Yearly
3mo agoHA
3mo agoCO

Member of Technical Staff, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
2mo agoXA

Member of Technical Staff, Inference

xAI

Palo Alto, California, United States (On-site)$180K – $440K Yearly
3w agoTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160K – $230K Yearly
3w agoET
3w agoTA

AI Researcher, Core ML

Together AI

San Francisco, California, United States (On-site)$200K – $280K Yearly
3mo agoPL
2mo agoDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300K – $430K Yearly
3mo agoBA

Software Engineer - Model API's

Baseten

San Francisco, California, United States (On-site)$150K – $230K Yearly
3mo agoBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150K – $250K Yearly
3mo agoOP

Software Engineer, Real Time

OpenAI

Seattle, Washington, United States (On-site)$255K – $405K Yearly
3mo agoNV

Senior Solutions Architect, Robotics Engineering

NVIDIA

Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
2mo agoNV
2mo agoNV
3mo agoOP

Research-Hardware Codesign Engineer

OpenAI

San Francisco, California, United States (Hybrid)$230K – $460K Yearly