Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

matam_db_preprocessing.py clustering #107

Open
mdsufz opened this issue May 18, 2022 · 0 comments
Open

matam_db_preprocessing.py clustering #107

mdsufz opened this issue May 18, 2022 · 0 comments

Comments

@mdsufz
Copy link

mdsufz commented May 18, 2022

I want to construct a personalized database. However, from what I understood, I can go to SILVA and download the NR99 (clustered at 99 % identity) or the Ref (not clustered).
Usually, I would just download the Ref and then use Vsearch to cluster the sequences at 95 % identity. However, the function matam_db_preprocessing.py also does some clustering to the provided sequence file. So my question is the following: if I run the above mentioned function on the clustered database will it re-cluster these sequences ? If so, can we just provide the unclustered database to MATAM and perform the user-specified identity clustering?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant