Speculative Decoding Jobs
Browse 17 Speculative Decoding jobs on Inference Jobs.
17 jobs
7dTA
Research Engineer, Frontier Speculative Decoding
Together AI
San Francisco, California, United States (On-site)$190k – $270k Yearly
5dTA
Machine Learning Engineer - Inference
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
1wBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150k – $250k Yearly
1wOP
Research-Hardware Codesign Engineer
OpenAI
San Francisco, California, United States (Hybrid)$230k – $460k Yearly
5dTE
Sr. Engineer, SoC Design Verification – AI/ML Accelerator Chiplets
Tenstorrent
Toronto, Ontario, Canada (Hybrid)
2wNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly