Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About input formats for training and inference #25

Open
wen020 opened this issue May 27, 2024 · 2 comments
Open

About input formats for training and inference #25

wen020 opened this issue May 27, 2024 · 2 comments

Comments

@wen020
Copy link

wen020 commented May 27, 2024

Anygpt is trained only with the Next Token Prediction task.
Take text to image as an example,Is the training input speech tokens text tokens image tokens music tokens?
I want to know the input formats for training and inference.
training input :<sos> speech tokens <eos> text tokens <soi> image tokens <eoi> <som> music tokens,
training label :speech tokens <eos> text tokens <soi> image tokens <eoi> <som> music tokens <eom>. Is my understanding correct about training input and label?

@JunZhan2000
Copy link
Collaborator

No, the content of the training depends on what data and what task we use. For example, for text to image conversion, the data will only contain text tokens image tokens . This is just an example. The actual template used can be found in the paper.

@JunZhan2000
Copy link
Collaborator

Hello, we provide some training data samples and related descriptions, please refer to https://github.com/OpenMOSS/AnyGPT?tab=readme-ov-file#pretraining-and-sft

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants