Question about the splice sites prediction
AstroSign opened this issue · comments
I am trying to use the pretrained mode on my splice site data but the result is kind of random. In my case, I used the binary classifier(dnaprom) to classify if the splice site is at the middle of the sequence. My false positive samples are generated by randomly selecting not matched donors and acceptors.
I don't know if that's because the format of my data or other reason. Could you please provide the splice site data in your experiments? It will be helpful if you can provide both the 3-class(donor, acceptor and non-splice site) one and the TP splice site dataset.
I am trying to use the pretrained mode on my splice site data but the result is kind of random. In my case, I used the binary classifier(dnaprom) to classify if the splice site is at the middle of the sequence. My false positive samples are generated by randomly selecting not matched donors and acceptors.
I don't know if that's because the format of my data or other reason. Could you please provide the splice site data in your experiments? It will be helpful if you can provide both the 3-class(donor, acceptor and non-splice site) one and the TP splice site dataset.
I encountered the same question as I use the binary classifier to classify my data. The result is random when I use sequences randomly extracted from reference. However, when I use my own positive data, and the negative data from sample_data, I got a better accuracy.