FlashAttention Jobs
Browse 9 FlashAttention jobs on Inference Jobs.
9 jobs
3d ago
PE
Inference Engineering Manager
Perplexity
San Francisco, California, United States (On-site)$300K – $485K Yearly
2w ago
TA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
3w ago
TA
Forward Deployed Engineer (Inference & Post-Training)
Together AI
San Francisco, California, United States (On-site)$270K – $300K Yearly
2w ago
NV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
2w ago
MA
5d ago
BA
GPU Kernel Engineer
Baseten
San Francisco, California, US or Remote (United States)$180K – $360K Yearly