matam_db_preprocessing.py clustering #107

mdsufz · 2022-05-18T07:33:34Z

I want to construct a personalized database. However, from what I understood, I can go to SILVA and download the NR99 (clustered at 99 % identity) or the Ref (not clustered).
Usually, I would just download the Ref and then use Vsearch to cluster the sequences at 95 % identity. However, the function matam_db_preprocessing.py also does some clustering to the provided sequence file. So my question is the following: if I run the above mentioned function on the clustered database will it re-cluster these sequences ? If so, can we just provide the unclustered database to MATAM and perform the user-specified identity clustering?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

matam_db_preprocessing.py clustering #107

matam_db_preprocessing.py clustering #107

mdsufz commented May 18, 2022

matam_db_preprocessing.py clustering #107

matam_db_preprocessing.py clustering #107

Comments

mdsufz commented May 18, 2022