CD-HIT protein clustering - difference length
From here
Please note that by default, cd-hit only lists matches where sequences in db2 are not longer than sequences in db1.
You may use options -S2 or -s2 to overwrite this default.
From here
Please note that by default, cd-hit only lists matches where sequences in db2 are not longer than sequences in db1.
You may use options -S2 or -s2 to overwrite this default.
changed milestone to %Analyses
added Analysis Enhancement Investigation labels
Trying -s2 -0.9
:
-s2 length difference cutoff for db1, default 1.0 by default, seqs in db1 >= seqs in db2 in a same cluster
if set to 0.9, seqs in db1 may just >= 90% seqs in db2
mentioned in commit c84c44d1
The results did not change much.
closed
Change to -s2 0.5
mentioned in commit 214f07ad
mentioned in commit 1002a9e8