You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I’m not entirely sure how train.py loads pre-trained models. PyTorch’s documentation recommends torch.load() or torch.load_state_dict() — I see torch.load_state_dict() used for the optimizer, but neither used for the main model or dp_model variables.
I also see infos = cPickle.load(f) & histories = cPickle.load(f) which seem to resemble torch.load(), but the infos and histories variables don’t seem to be used to influence the model or dp_model variables. How are the weights loaded into the model or dp_model variables?
The text was updated successfully, but these errors were encountered:
I’m not entirely sure how
train.py
loads pre-trained models. PyTorch’s documentation recommendstorch.load()
ortorch.load_state_dict()
— I seetorch.load_state_dict()
used for the optimizer, but neither used for the mainmodel
ordp_model
variables.I also see
infos = cPickle.load(f)
&histories = cPickle.load(f)
which seem to resembletorch.load()
, but the infos and histories variables don’t seem to be used to influence themodel
ordp_model
variables. How are the weights loaded into themodel
ordp_model
variables?The text was updated successfully, but these errors were encountered: