You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
Thank you very much for sharing the code for this amazing work.
I have been trying to reproduce and evaluate doc cls. I noticed that you encode the class names and those represent your labels. However, this creates sequences of 2-3 tokens. How do you use it to evaluate the accuracy?
Thank you very much
The text was updated successfully, but these errors were encountered:
ofir1080
changed the title
Document Classification Mertic
Document Classification Mertic (UDOP)
Jul 31, 2023
I manually decoded and mapped the result string to class labels (which does not seems intuitive). However, the results where sometimes close but not exact, here's some examples which the model "failed":
for example, if scientificwritten results in 2-3 tokens, then the evaluation will be exact match of these tokens. the model should be able to predict all the subtokens and will evaluates to correct if all tokens match
Hello,
Thank you very much for sharing the code for this amazing work.
I have been trying to reproduce and evaluate doc cls. I noticed that you encode the class names and those represent your labels. However, this creates sequences of 2-3 tokens. How do you use it to evaluate the accuracy?
Thank you very much
The text was updated successfully, but these errors were encountered: