alexdobin / STAR

RNA-seq aligner

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

high % of reads mapped to no features gene counts

teryyoung opened this issue · comments

image
image

this is my mapping result, the percentage mapped reads looks well, but a lot of these reads assigned to no features, how do I fix this problem?

this is my command:
STAR --twopassMode Basic\ --quantMode TranscriptomeSAM GeneCounts\ --runThreadN 6\ --genomeDir ${ref_dir}\ --alignIntronMin 5\ --alignIntronMax 60000\ --outSAMtype BAM Unsorted\ --outSAMattrRGline ID:"${sample}" SM:"${sample}" PL:ILLUMINA\ --outFilterMismatchNmax 2\ --outSJfilterReads Unique --outSAMmultNmax 1\ --outFileNamePrefix ./alignment/${sample}/${sample}\ --outSAMmapqUnique 60\ --readFilesCommand zcat\ --readFilesIn "$r1" "$r2"
--outTmpDir ./tmp

samtools sort -@ 12 -o ./alignment/${sample}/${sample}.sorted.bam ./alignment/${sample}/${sample}Aligned.out.bam
`

This means that many reads in your library map to intronic or intergenic regions. If you have a set of more complete annotations, you may be able to increase the % of exonic reads, but this more likely the issue with library prep.

@alexdobin thanks for your reply, I used the <hg38.ncbiRefSeq.gtf> annotation file.
But now I find the library are commercial referencial RNA which contain many of fusion Genes. I think this is why STAR cannot assign them to gene features ? I wonder can STAR recognize fusion gene? or any other issue?

Hi @teryyoung
STAR can detect chimeric alignments, and you would need some downstream processing to detect fusions (e.g. STAR-Fusion).
However, I doubt this will explain what you see.