simras / CLAP

CLAP - A pipeline used to process CLIP-seq (specifically PAR-CLIP, HITS-CLIP and iCLIP) data.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Is it possible to use for bacteria?

tmargus opened this issue · comments

Hi,
I see that CLAP is designed to run on eukaryote genomes.
How to adapt it to run on bacteria?

I changed make_annotation.sh by adding species escherichia_coli_k_12 and adjusted ftp site and PATH accordingly. It retrieves annotation and sequence and performs all operations without error message. There are some cosmetic things like Chromosome is converted to "chrChromosome"
and some scripts assume 3 command line arguments one of which is "MT" but it doesn't seem to hurt. Is there anything else I should take into account or have to change/modify?
Cheers,
Tonu

Is it not more common for genes to overlap in bacteria?

This could present a problem in the processing of the annotation. Otherwise I cannot think of any reasons it should not work if the organism is well annotated and you check the genome and plasmid names match the sequence files you will use. In the annotation pipeline we rename chromosomes so they match ENSEMBL standard (UCSC chrMT > ENSEMBL chrM).

You can turn off mapping to splice junctions.