Is it possible to use for bacteria?
tmargus opened this issue · comments
Hi,
I see that CLAP is designed to run on eukaryote genomes.
How to adapt it to run on bacteria?
I changed make_annotation.sh
by adding species escherichia_coli_k_12 and adjusted ftp site and PATH accordingly. It retrieves annotation and sequence and performs all operations without error message. There are some cosmetic things like Chromosome is converted to "chrChromosome"
and some scripts assume 3 command line arguments one of which is "MT" but it doesn't seem to hurt. Is there anything else I should take into account or have to change/modify?
Cheers,
Tonu
Is it not more common for genes to overlap in bacteria?
This could present a problem in the processing of the annotation. Otherwise I cannot think of any reasons it should not work if the organism is well annotated and you check the genome and plasmid names match the sequence files you will use. In the annotation pipeline we rename chromosomes so they match ENSEMBL standard (UCSC chrMT > ENSEMBL chrM).
You can turn off mapping to splice junctions.