Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Early return for num_shards==0 in the Beam pipeline. #778

Merged
merged 2 commits into from
Dec 3, 2024

Conversation

marcenacp
Copy link
Contributor

No description provided.

@marcenacp marcenacp requested a review from a team as a code owner December 2, 2024 10:05
Copy link

github-actions bot commented Dec 2, 2024

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

@marcenacp marcenacp force-pushed the feature/efficient-filtering-100 branch from b3040e8 to ae35cc8 Compare December 2, 2024 10:09
@@ -195,6 +195,10 @@ def execute_operations_in_beam(
enumerate(files)
)
num_shards = len(files)
if not num_shards:
raise ValueError(
f"Empty {record_set=}. No file found for filters={json.dumps(filters)}"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No files?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done.

@marcenacp marcenacp force-pushed the feature/efficient-filtering-100 branch from 4b606dd to a374fa9 Compare December 3, 2024 10:10
@marcenacp marcenacp merged commit 5fbea37 into main Dec 3, 2024
12 checks passed
@marcenacp marcenacp deleted the feature/efficient-filtering-100 branch December 3, 2024 10:19
@github-actions github-actions bot locked and limited conversation to collaborators Dec 3, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants