1. Home
  2. Jobs
  3. Flash Attention

Flash Attention Jobs

Browse 10 Flash Attention jobs on Inference Jobs.

10 jobs

5dAN

Performance Engineer, GPU

Anthropic

San Francisco, California, United States (Hybrid)$315k – $560k Yearly
5dSC

ML Research Engineer, ML Systems

Scale

San Francisco, California, United States (On-site)$218.4k – $273k Yearly
2wTE

Engineer, ML Models

Tenstorrent

Santa Clara, California, United States (Hybrid)$100k – $500k Yearly
4wNV

Deep Learning Software Engineer, FlashInfer - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-site)$108k – $195.5k Yearly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
5dTA

AI Researcher, Core ML

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wSE

Embedded ML Engineer – Gesture Recognition

Sesame

San Francisco, California, United States (On-site)$175k – $280k Yearly