Skip to content

Added Masking

Compare
Choose a tag to compare
@tatp22 tatp22 released this 16 Jul 11:03
· 29 commits to master since this release

Added masking to the Linformer. However, this is still a WIP, since masking cannot be done in the traditional sense, like what is done in the attention is all you need paper, because there is an overhead of adding another (n,n) matrix, which is infeasable.