Chunk size #49

hoelzer · 2023-04-06T09:05:07Z

We have a new --chunk parameter to split the ILP corpus for faster parallel computing.

However, when the chunk size is too large concerning the number of input genomes, RIBAP crashes. E.g., I tried --chunks 80 for eight input genomes: crash.

We could add a check and warning. Or even better: we automatically adjust the chunk size when the user is defining something to high in comparison to the input genomes (not sure what would be a good formula here... e.g. --chunks 200 for 167 Klebsiella was fine, ...)

The text was updated successfully, but these errors were encountered:

klamkiew · 2023-04-07T12:00:07Z

I think the formula is number of pairwise comparisons == upper limit for --chunks
E.g., 8 input genomes lead to 28 pairwise comparisons, meaning it doesn't make sense to have more than 28 chunks.
Not sure how / when to tell NF this though ;)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chunk size #49

Chunk size #49

hoelzer commented Apr 6, 2023

klamkiew commented Apr 7, 2023

Chunk size #49

Chunk size #49

Comments

hoelzer commented Apr 6, 2023

klamkiew commented Apr 7, 2023