Question about two forloops in wkv6_cuda.cu #48

Seeker98 · 2024-12-22T03:02:48Z

i notice that there are two for loops in https://github.com/OpenGVLab/Vision-RWKV/blob/master/classification/mmcls_custom/models/backbones/cuda_v6/wkv6_cuda.cu line 23-57. what is the purpose of the first loop? i compared with RWKV-LM's cuda files but find no ideas.

Seeker98 · 2024-12-22T03:32:20Z

some other questions about wkv6_cuda: i observed nan errors and change all exp calls to __expf(-__expf(**)), and the model seemed to perform well (but i honestly dont know why i changed to this, perhaps intuition?).
I'm currently using rwkv with a full-length pixel sequence, and found that T_MAX must greaterequal than total pixel numbers in an image, smaller will cause cuda error: illegal memory access. but larger T_MAX leads to immediate oom, even if the model itself is really small(<3m params)... (my card: rtx 40901)
Any advices to address these? appreciate for ur help.

c1ircle · 2025-01-03T12:47:33Z

Same question about two loops

Seeker98 changed the title ~~Question about two forloops in wk~~ Question about two forloops in wkv6_cuda.cu Dec 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about two forloops in wkv6_cuda.cu #48

Question about two forloops in wkv6_cuda.cu #48

Seeker98 commented Dec 22, 2024

Seeker98 commented Dec 22, 2024

c1ircle commented Jan 3, 2025

Question about two forloops in wkv6_cuda.cu #48

Question about two forloops in wkv6_cuda.cu #48

Comments

Seeker98 commented Dec 22, 2024

Seeker98 commented Dec 22, 2024

c1ircle commented Jan 3, 2025