You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Anygpt is trained only with the Next Token Prediction task.
Take text to image as an example,Is the training input speech tokens text tokens image tokens music tokens?
I want to know the input formats for training and inference.
training input :<sos> speech tokens <eos> text tokens <soi> image tokens <eoi> <som> music tokens,
training label :speech tokens <eos> text tokens <soi> image tokens <eoi> <som> music tokens <eom>. Is my understanding correct about training input and label?
The text was updated successfully, but these errors were encountered:
No, the content of the training depends on what data and what task we use. For example, for text to image conversion, the data will only contain text tokens image tokens . This is just an example. The actual template used can be found in the paper.
Anygpt is trained only with the Next Token Prediction task.
Take text to image as an example,Is the training input speech tokens text tokens image tokens music tokens?
I want to know the input formats for training and inference.
training input :<sos> speech tokens <eos> text tokens <soi> image tokens <eoi> <som> music tokens,
training label :speech tokens <eos> text tokens <soi> image tokens <eoi> <som> music tokens <eom>. Is my understanding correct about training input and label?
The text was updated successfully, but these errors were encountered: