Flash Attention Jobs
Browse 10 Flash Attention jobs on Inference Jobs.
10 jobs
5dAN
Performance Engineer, GPU
Anthropic
San Francisco, California, United States (Hybrid)$315k – $560k Yearly
5dSC
ML Research Engineer, ML Systems
Scale
San Francisco, California, United States (On-site)$218.4k – $273k Yearly
2wTE
4wNV
Deep Learning Software Engineer, FlashInfer - New College Grad 2025
NVIDIA
Santa Clara, California, United States (On-site)$108k – $195.5k Yearly
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
5dTA
AI Researcher, Core ML
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
2wSE
Embedded ML Engineer – Gesture Recognition
Sesame
San Francisco, California, United States (On-site)$175k – $280k Yearly