Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

step_poisson_matric_factorization() #205

Open
EmilHvitfeldt opened this issue Oct 1, 2023 · 3 comments
Open

step_poisson_matric_factorization() #205

EmilHvitfeldt opened this issue Oct 1, 2023 · 3 comments
Labels
feature a feature request or enhancement

Comments

@EmilHvitfeldt
Copy link
Member

https://cran.r-project.org/web/packages/poismf/index.html

@EmilHvitfeldt EmilHvitfeldt added the feature a feature request or enhancement label Oct 1, 2023
@topepo
Copy link
Member

topepo commented Oct 2, 2023

@jrosell
Copy link

jrosell commented Dec 10, 2024

I read about Gamma-Poisson factorization for single categorical columns in Patricio Cerda, Gaël Varoquaux. Encoding high-cardinality string categorical variables. 2019.
(analogous to solving the following non-negative matrix factorization (NMF) with the generalized Kullback-Leibler divergence)

The paper includes an interesting online algorithm.
Image

Does it make sense to use {reticulate} with this python implementation?
https://skrub-data.org/stable/reference/generated/skrub.GapEncoder.html

@EmilHvitfeldt
Copy link
Member Author

without having looked at the documentation, i lean on the side of translating the method to R rather than using {reticulate}. Purely on the basis of developer burden. Using {reticulate} in a package is already not the best experience, and then you have to worry about breaking changes from the python implementation etc etc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature a feature request or enhancement
Projects
None yet
Development

No branches or pull requests

3 participants