Loading Pre-trained Models #6

arjung128 · 2019-06-11T21:14:03Z

I’m not entirely sure how train.py loads pre-trained models. PyTorch’s documentation recommends torch.load() or torch.load_state_dict() — I see torch.load_state_dict() used for the optimizer, but neither used for the main model or dp_model variables.

I also see infos = cPickle.load(f) & histories = cPickle.load(f) which seem to resemble torch.load(), but the infos and histories variables don’t seem to be used to influence the model or dp_model variables. How are the weights loaded into the model or dp_model variables?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loading Pre-trained Models #6

Loading Pre-trained Models #6

arjung128 commented Jun 11, 2019 •

edited

Loading

Loading Pre-trained Models #6

Loading Pre-trained Models #6

Comments

arjung128 commented Jun 11, 2019 • edited Loading

arjung128 commented Jun 11, 2019 •

edited

Loading