Releases · tatp22/linformer-pytorch

10 Oct 13:21

tatp22

0.19.3

f10226a

Latest working version Latest

Latest

Have not pushed up a release in a while, and this is a latest working version after 2 misc bugs have been fixed.

Assets 4

04 Aug 16:03

tatp22

0.16.0

e21153a

Added intermediate dim change

Added intermediate ff dimension

Now, the model dimension can be different in the intermediate layers.
This change applies to the ff module, and only in the encoder. Now, if
the flag ff_intermediate is not None, the layers will look like this:

channels -> ff_dim -> ff_intermediate (For layer 1)
ff_intermediate -> ff_dim -> ff_intermediate (For layers 2 to depth-1)
ff_intermediate -> ff_dim -> channels (For layer depth)

As opposed to

channels -> ff_dim -> channels (For all layers)

Assets 4

31 Jul 09:28

tatp22

0.15.0

072f34c

Able to use convolutional nets instead of linear

Now, the linformer supports convolution as a way to downsample the input, instead of relying on linear layers. This may reduce the amount of parameters necessary.

Assets 4

28 Jul 16:13

tatp22

0.14.0

2158efc

Encoder Decoder finished, Causal attention

Finished an encoder and a decoder module. Also, causal attention works, when the causal=True flag is set. Will update the README shortly...

Assets 4

16 Jul 11:03

tatp22

0.13.1

0e97577

Added Masking

Added masking to the Linformer. However, this is still a WIP, since masking cannot be done in the traditional sense, like what is done in the attention is all you need paper, because there is an overhead of adding another (n,n) matrix, which is infeasable.

Assets 4