You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In your case specifically, you can use the original Llama 3.2 1B tokenizer.model from https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct (if the unsloth version is based off the instruct model, use the base one otherwise). If unsloth modified any of the special tokens, then you will need a new tokenizer.model.
I don't believe you can load in the tokenizer without the tokenizer.model file, because it contains the BPE encoding itself.
while
tokenizer.model
is required in yaml config, but there are many models that doesn't havetokenizer.model
(example: unsloth/Llama-3.2-1B)In these cases, how can we use
tokenizer.json
ortokenizer_config.json
that are included in almost all model instead oftokenizer.model
?The text was updated successfully, but these errors were encountered: