when train tta with bert-base config and sequence length 512,got NAN #5

yyht · 2021-01-13T16:44:09Z

hi, I am trying to do traing bert-base using tta for chinese, it got NAN with 1000-step optimization, I am wondering if you could give me some advice

joongbo · 2021-01-14T00:25:30Z

hi, there is no problem when I tried to train tta with the bert-base config (for English).
did you try to train tta for English with the bert-base config, and get the same problem?

anyway, for a different language with a different vocabulary, you should modify one line in modeling.py.
at line 161 in the file, 4 is for the dummy token id of [MASK].
so if you change vocabulary, you should match this number with your vocabulary id of "[MASK]" for dummy_ids.

lastly, in my experience, examining data (pre)processing again would be helpful.

if you have any further problems, please feel free to ask me again!

thanks :)

yyht · 2021-01-14T09:11:19Z

thanks for your help. I made some mistakes for hyparameters and it could run normally.
Since you have done some experiments with bert-base config, i am wondering wheather tta could achieve better results on English data such as GLUE and sentence reranking on NMT and ASR ?

joongbo · 2021-01-14T14:37:27Z

unfortunately, not yet tested on any specific tasks. due to the lack of computing resources in my lab, I had to use a much smaller batch size (less than 10 I think) for pre-training tta with bert-base config. so I tried, but not completed to train tta with that config.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

when train tta with bert-base config and sequence length 512,got NAN #5

when train tta with bert-base config and sequence length 512,got NAN #5

yyht commented Jan 13, 2021

joongbo commented Jan 14, 2021

yyht commented Jan 14, 2021

joongbo commented Jan 14, 2021

when train tta with bert-base config and sequence length 512,got NAN #5

when train tta with bert-base config and sequence length 512,got NAN #5

Comments

yyht commented Jan 13, 2021

joongbo commented Jan 14, 2021

yyht commented Jan 14, 2021

joongbo commented Jan 14, 2021