[BUG]: inappropriate arguments for `map_batches` in ray mode #533

HYLcool · 2025-01-08T06:40:29Z

For now, running Data-Juicer on multiple nodes in "ray" mode, which uses map_batches to process datasets, might cause some implicit problems.

The map_batches method has two arguments, num_gpus and concurrency, which are actually cluster-level arguments. However, they are calculated automatically according to the hardware information of a single machine. So, there might be some resource utilization problems when running on multiple nodes for OPs with _accelerator is "cuda".

The text was updated successfully, but these errors were encountered:

HYLcool added bug Something isn't working dj:dist issues/PRs about distributed data processing labels Jan 8, 2025

github-project-automation bot added this to data-juicer Jan 8, 2025

github-project-automation bot moved this to Todo in data-juicer Jan 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: inappropriate arguments for `map_batches` in ray mode #533

[BUG]: inappropriate arguments for `map_batches` in ray mode #533

HYLcool commented Jan 8, 2025

[BUG]: inappropriate arguments for map_batches in ray mode #533

[BUG]: inappropriate arguments for map_batches in ray mode #533

Comments

HYLcool commented Jan 8, 2025

[BUG]: inappropriate arguments for `map_batches` in ray mode #533

[BUG]: inappropriate arguments for `map_batches` in ray mode #533