-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support sparsity, target-size and sort_by_length for hstu #62
Conversation
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
0e6c243
to
6114f86
Compare
@manman-ren has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
Can we have example output from running the operator? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
@manman-ren has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
@manman-ren has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
@manman-ren merged this pull request in 45d195c. |
Copied over generate_sparse_seq_len
Example output
x_val hstu_triton_ragged_attention-latency
(256, 4, 16384, 2048, 0.8, 20, False) 146.458
(256, 4, 16384, 2048, 0.8, 20, False) 148.616
(256, 4, 16384, 2048, 0.8, 20, False) 145.135
(256, 4, 16384, 2048, 0.8, 20, False) 148.98
(256, 4, 16384, 2048, 0.8, 20, False) 147.167
(256, 4, 16384, 2048, 0.8, 20, False) 146.155
(256, 4, 16384, 2048, 0.8, 20, False) 144.787
(256, 4, 16384, 2048, 0.8, 20, False) 144.055
(256, 4, 16384, 2048, 0.8, 20, False) 144.35
(256, 4, 16384, 2048, 0.8, 20, False) 146.67