-
-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Benchmarking for the different algorithms #26
Comments
@assem-ch , It will be much work for one man. I see many separate reports for different words, it is not practical to follow each manually. Stemmer is never be 100% perfect as manual learned method by human.
|
Btw, not all release packages have mentioned version. |
@assem-ch If you think the Arabic stemmer is too worthy for Alfanous & too many Arabic project that I put considerable time in it. I will see If I go with separate project for stop-word list and derivatives list build by crowd-sourced verification to get high quality test data for the stemmer. Because, I see few arabic stop-words list and basically few persons efforts. All depend on the way how to reach the interested/effective Arab community. |
@sneetsher there is already a project for testing data https://github.com/ibnmalik/golden-corpus-arabic, it can be exposed with the stemmer to get new suggests from users. This is my phd project and I should really focus on it... I will work on a demo/review web app for it to welcome feedback and improve the visibility. For alfanous its not too worthy, but it fix a gap for stemming words that dont exist in quran exactly but in other forms |
No description provided.
The text was updated successfully, but these errors were encountered: