-
Couldn't load subscription status.
- Fork 247
Open
Description
I would like to assign a taxonomic label to my protein sequences using the blast NR database and the mmseqs taxonomy command available in the docker image (quay.io/microbiome-informatics/mmseqs:2.13). I noticed that the default coverage value is zero according to the help page.
-c FLOAT List matches above this fraction of aligned (covered) residues (see --cov-mode) [0.000]
However, if I set the -c parameter to 0.8, which sounds reasonable to me, then all of my sequences are labelled as no rank unclassified.
Full command:
mmseqs taxonomy queryDB ${MMSEQS2_DATABASE_DIR} taxresults.database tmp --lca-ranks superkingdom,phylum,class,order,family,genus,species,subspecies --threads 28
My questions are:
- Doesn't it always make sense to increase the
-cparameter to reduce spurious hits? - How can I inspect the alignment of the best hit?
- Does easy-taxonomy also use the same default
lca-modeor a different one?
Metadata
Metadata
Assignees
Labels
No labels