v0.1.1
Important Bug Fixes
- Image tokens were mistakenly being computed twice in command line script when using torch
- Tokenizer was not working correctly on some machines previously (e.g. windows). Files are now read with ut8-encoding.
New Features
is_expendable
argument reduces memory usage for command line script by loading then unloading encoder/decoder/detokenizer when needed- simpler 4D
attention_state
replacing 5Dkeys_values_state
and faster inference time