WIP
A tiny flash attention implement in python, rust, cuda and c for learning purpose.
- python version
- triton version
- [c version]
- [rust version]
my env: cutlass v3.4, torch 1.14, cuda 12.4
WIP
A tiny flash attention implement in python, rust, cuda and c for learning purpose.
my env: cutlass v3.4, torch 1.14, cuda 12.4