You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The MAE checkpoint is together with the transformer weights included in the checkpoint.
if you want the original MAE weights you can download it from the original MAE codebase.
In the transformer weights, for mae, there are only weights for patch_embed and special_vis_token (and pos_embed), but not the blocks. And in the forward method, you indeed only use patch_embed to encode the patches.
Do you not use the full mae like in udop_dual? This simple projection carries all the information for font, line spacing, color etc etc?
in config:
"mae_checkpoint": "mae_models/mae_pretrain_vit_large_full.pth"
in udop_dual:
But I found no pretiraned weights for mae encoder. Is the pretrained mae encoder weights available now?
Thank you!
The text was updated successfully, but these errors were encountered: