-
Notifications
You must be signed in to change notification settings - Fork 92
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integration with sklearn #29
Comments
I'm not against it at all. I guess we would need to evaluate on arbitrary grid points, in order to evaluate a score on the data. But FFTKDE generates equidistant grid points and throws away the original data. This would mean doing high-dimensional interpolation. I haven't looked into how hard or easy that might be, or if there is some other clever way of doing it. |
Agree that the scoring function may be non-trivial. But adding all other functionality would allow users to write their own scoring functions, which may be reasonable for some cases. |
Agreed. The sklearn API is very nice in it's own right, and a lot of people are ac customized to it and to some degree expect estimators to follow the API. I imitated it, but the APIs differ a little bit. I'll leave the issue open. Up for grabs for anyone. I might look into it when (if?) I have time in the future. |
Evaluating the density of arbitrary points would be absolutely a nice feature. |
Hi In the past I tried implementing an integration directly with
The first problem could be tolerated, but I couldn't think of a solution for the second problem, and doesn't look like |
might be of interest: scikit-learn/scikit-learn#26896 |
Thanks for this push and the amazingly fast library! I would double down on this and claim that the evaluation on arbitrary points is the only use-case (apart of pretty plots).
I fail to see what this has to do with the evaluation? We don't need this at all, the evaluation should happen on new, arbitrary points whose values are determined by the interpolation of the grid points that the KDE holds, AFAIU, right? |
Hi
Do you plan to add support for integration with sklearn, to use e.g. cross-validation for bandwidth selection? Would be great to do something like this with KDEpy.
This would need the
estimator.get_params()
, etc. functions required from sklearn.Ideally there would also be a
estimator.score()
function.The text was updated successfully, but these errors were encountered: