mlin / PhyloCSF

Phylogenetic analysis of multi-species genome sequence alignments to identify conserved protein-coding regions

Home Page:http://compbio.mit.edu/PhyloCSF

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Is it possible to check the coding potential with only one species mRNA sequence?

nowandnow opened this issue · comments

I did three tests with the following dataset (Aldh2.mRNA.fa, Aldh2.mRNA.test.fa, Aldh2.mRNA.test2.fa)
I got the same amino acid as a result. (The max score has changed, but it is equally positive score.)

Can I check the coding potential with only the mRNA sequence now?

below is my commands and results.
PhyloCSF 29mammals Aldh2.mRNA.fa --aa --orf=ATGStop --frames=3 --removeRefGaps
-> Aldh2.mRNA.fa max_score(decibans) 2013.9264 343 1899 MLRAAL...
PhyloCSF 29mammals Aldh2.mRNA.test.fa --aa --orf=ATGStop --frames=3 --removeRefGaps
-> Aldh2.mRNA.test.fa max_score(decibans) 760.1431 343 1899 MLRAAL...
PhyloCSF 29mammals Aldh2.mRNA.test2.fa --aa --orf=ATGStop --frames=3
-> Aldh2.mRNA.test.fa max_score(decibans) 760.1431 343 1899 MLRAAL...
fasta.zip

The max score was reduced, but it was positive value.
Based on this, can I think of this mRNA as a coding sequence?
I am working on checking whether a novel rna is a coding sequence or a non-coding sequence.
I want to use PhyloCSF as one of the tools to check it.

Yours sincerely.