Question for bird-and-animal training #68

trustme4321 · 2024-05-25T02:09:25Z

Hello, I'm interested in Cogview implemenation.
Actually I have problem in executing scripts/pretrain_single_node.sh
After 20,000 training iteration and executing ./scripts/text2image.sh, but there is no exact of shape for bird or animal in the result image. The text is 飞鹰(Flying eagle)

For training, I executed the shell script only changing the variable, NUM_GPUS_PER_WORKER from 8 to 1 since I have only one GPU instance.
I'm using Google Colab(A100 is supported).

NUM_GPUS_PER_WORKER=1
Image tokenizer number of tokens : 8192
The Number of layers : 12
The size of hidden layer : 1024
The number of attention heads : 16

mp_rank_00_model_state.pt is the result file of training and the size of pt(pytorch) file is only 2.76Giga bytes.

Of course, if I use the pretrained model, cogview-base.tar, then the result is all right.

Please check my question and I hope you can give any advise or comment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question for bird-and-animal training #68

Question for bird-and-animal training #68

trustme4321 commented May 25, 2024 •

edited

Loading

Question for bird-and-animal training #68

Question for bird-and-animal training #68

Comments

trustme4321 commented May 25, 2024 • edited Loading

trustme4321 commented May 25, 2024 •

edited

Loading