MMSeqs2 parameters

Question

MMSeqs2 parameters

mhorlacher opened this issue 4 months ago · comments

In the paper it is written that sequences were split into train/eval/test via MMSeqs2 clustering with a sequence_identity threshold of 0.95 - could you provide the full set of parameters used for clustering, or were the remaining ones left to be the default? Thanks!

Tobias Hegelund Olsen · Answer 1 · Fri Mar 22 2024 20:31:20 GMT+0800 (China Standard Time)

Hi Marc, thank you for the question. Other than using "--cov-mode 1" to better handle fragmented sequences, the rest of the parameters used for clustering were the default ones.

I hope this helps!

Marc Horlacher · Answer 2 · Fri Mar 22 2024 23:36:40 GMT+0800 (China Standard Time)

Helps a lot, thanks!