We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Query to reproduce:
from datachain import DataChain, C from datachain.lib.tar import process_tar ( DataChain.from_storage("hf://datasets/mozilla-foundation/common_voice_17_0") .settings(parallel=4, cache=True, prefetch=4) .filter(C("file.path").glob("*/en/train/*.tar")) .gen(file=process_tar) .save("common-voice-index") )
The text was updated successfully, but these errors were encountered:
#747
Related #746
Sorry, something went wrong.
@skshetry can this be closed?
Cache is still broken. The above script might work because #730 stopped prefetching for HF.
@skshetry thanks! does require an upstream fix? could you please summarize (or copy from the previous tickets / discussion).
No branches or pull requests
Query to reproduce:
The text was updated successfully, but these errors were encountered: