Flash Attention Jobs
Explore Flash Attention roles on Inference Jobs and apply today.
3w agoSC
ML Research Engineer, ML Systems
Scale
San Francisco, California, United States (On-site)$218.4K – $273K Yearly
2mo agoSC
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale
San Francisco, California, United States (On-site)$252K – $315K Yearly
1w agoAN
Performance Engineer, GPU
Anthropic
San Francisco, California, United States (Hybrid)$280K – $850K Yearly
3mo agoBA
2mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
3mo agoCR
Senior Category Manager, Storage
Crusoe
San Francisco, California, United States (On-site)$177K – $214K Yearly
1mo agoCR
Senior Staff Systems Administrator
Crusoe
San Francisco, California, United States (On-site)$170K – $215K Yearly
3w agoCR
2w agoNV
Senior Deep Learning Software Engineer, Inference
NVIDIA
Netherlands + 1 more (Remote)zł 221.3K – zł 383.5K Yearly
2mo agoNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly