Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Aggregation Unique supports the list format. #12437

Closed
mjshare opened this issue Dec 26, 2022 · 2 comments
Closed

Aggregation Unique supports the list format. #12437

mjshare opened this issue Dec 26, 2022 · 2 comments
Labels
feature request New feature or request question Further information is requested

Comments

@mjshare
Copy link

mjshare commented Dec 26, 2022

Aggregation Unique supports the list format. Does the community have a plan?

@mjshare mjshare added Needs Triage Need team to review and classify feature request New feature or request labels Dec 26, 2022
@beckernick beckernick added question Further information is requested and removed Needs Triage Need team to review and classify labels Jan 3, 2023
@beckernick
Copy link
Member

Could you provide a little bit more information?

Are you working in C++ or Python? Are you looking for a way to get the unique elements per list, the unique lists across a column, or something else?

@GregoryKimball
Copy link
Contributor

Hello @mjshare thank you for your message. I believe the issue here is that we don't support groupby aggregations on List columns yet in cuDF-python.

We support unique on List columns:

>>> df = cudf.DataFrame({'a':[[1],[2],[1]], 'b':[1,2,3]})
>>> df['a'].unique()
0    [1]
1    [2]
Name: a, dtype: list

however, groupby throws a NotImplementedError

>>> df.groupby('a').min()
Traceback (most recent call last):
...
NotImplementedError: Unsupported column type passed to create an Index: <class 'cudf.core.column.lists.ListColumn'>

This issue appears to already be covered by #8039. For now I'll close this issue in favor of #8039 and the related #12037.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants