-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions about SkipBERT #1
Comments
Thank for your interests in our work!
A: The input tri-grams need to be cached before seeing speed-up when Regarding 'plot_mode':
A: The code for training SkipBERT is under general_distillation. We use distillation to train SkipBERT, but it should be feasible to train with MLM objective or other pretraining scheme.
A: We are so sorry for the mistake. It was supposed be released before. We will upload it again quickly. |
Thank you for the information! In regards to training our own SkipBERT models, I'm assuming we are supposed to use the Additionally, is it possible to calculate OOV at inference and see what percentage of trigrams encountered are OOV? |
Hello, I was interested in experimenting with the SkipBERT architecture. I was wondering if you could help with a few questions about implementation:
Thank you for the help!
The text was updated successfully, but these errors were encountered: