Skip to content

Models that can be trained from scratch #2395

Answered by rwightman
sinahmr asked this question in Q&A
Discussion options

You must be logged in to vote

@sinahmr they are all trainable from scratch with timm scripts, but not necessarily as per specific algorithms in the paper.

So the distilled deit, no, the code isn't there though it can be hacked on fairly easily (that's what the official impl is essentially, it's part timm training code and part their own). The non-distilled models, and deit-3 should be reproducible though.

beit, convnext-v2 (fcmae part), mae, dino, etc same thing... the models will train from scratch with standard xent or bce but I have not included reproductions of their specific unsupervised/semi-supervised learning algorithms.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@sinahmr
Comment options

Answer selected by sinahmr
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants