Skip to content

v0.1.1

Compare
Choose a tag to compare
@kuprel kuprel released this 30 Jun 11:02
· 296 commits to main since this release

Important Bug Fixes

  • Image tokens were mistakenly being computed twice in command line script when using torch
  • Tokenizer was not working correctly on some machines previously (e.g. windows). Files are now read with ut8-encoding.

New Features

  • is_expendable argument reduces memory usage for command line script by loading then unloading encoder/decoder/detokenizer when needed
  • simpler 4D attention_state replacing 5D keys_values_state and faster inference time