-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
indexing error: there is no k-mer that belongs to a list of size > 128 and <= 256 #126
Comments
ohhh — thanks for reporting this @rpolicastro! This seems like an error in the underlying construction of sshash. I'll ping @jermp here as he will likely know where this is coming from and what the fix might be upstream (once we know the fix, it will be pretty easy to propagate new versions of piscem; simpleaf itself shouldn't require an update). I believe it's related to the skew data structure in sshash, and particularly to one of the skew buckets being empty! One really simple idea is, what if you try with a smaller minimizer size — like |
Hi all, Best, |
No, but it would be trivial to expose (the underlying piscem-cpp includes it). So I can add that and in the meantime @rpolicastro can try a different minimizer size. However, I wonder if there is a way to make the skew index itself robust to possibly empty buckets. --Rob |
Yeah, good point. I know it is annoying that this might happen. |
Well, just to keep my on track: jermp/sshash#42. |
Thank you both for the quick response. I assume if I move forward with this setting for now it won't have any adverse effects on the downstream analysis? Cheers, |
Nope! In fact, the results should not be any different. The only distinction is that mapping may become a few % slower (may not even be noticeable). I am adding the --Rob |
I didn't check with the above reprex but I just wanted to confirm with the new simpleaf version (0.16.2) that setting the minimizer length back to default |
w00t! I was just going to come here to ask if you could do this if it wasn't too much trouble, but you already did it! Glad to hear that just propagating a different seed fixes it :). |
Hi!
I was trying out the new version of simpleaf (0.16.1) and piscem (0.7.0) and I ran into an error during indexing.
Installing everything through conda/mamba.
Downloading the mouse genome from ENSEMBL.
Running simpleaf index.
The error.
The text was updated successfully, but these errors were encountered: