SENASICA-CNRPYC / plasmidID

PlasmidID is a mapping-based, assembly-assisted plasmid identification tool that analyzes and gives graphic solution for plasmid identification.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CircleCI Build Status License: GPL v3 CircleCi Community Scif

plasmidID Logo



#f03c15 PlasmidID is already available to download #f03c15

PlasmidID is a mapping-based, assembly-assisted plasmid identification tool that analyzes and gives graphic solution for plasmid identification.

PlasmidID is a computational pipeline implemented in BASH that maps Illumina reads over plasmid database sequences. The most covered sequences are clustered by identity to avoid redundancy and the longest are used as scaffold for plasmid reconstruction. Reads are assembled and annotated by automatic and specific annotation. All information generated from mapping, assembly, annotation and local alignment analyses is gathered and accurately represented in a circular image which allow user to determine plasmidic composition in any bacterial sample.

This image sumarizes PlasmidID pipeline, including the most important steps. For furder details, including:

workflow_small

Quick usage

Example: Clone the repo:

git clone git@github.com:BU-ISCIII/plasmidID.git
cd plasmidID
git submodule init
git submodule update

Run it with the test data using docker or singularity:

Notice that the input files MUST be in your present working directory or in any folder inside it. For example, if I execute this command in /home/smonzon, my folder with the files would be in /home/smonzon/TEST_DATA.

docker run -v $PWD:$PWD -w $PWD buisciii/plasmidid plasmidID.sh \
     -1 TEST_DATA/KPN_TEST_R1.fastq.gz  \
     -2 TEST_DATA/KPN_TEST_R2.fastq.gz \
     -d TEST_DATA/plasmids_TEST_database.fasta \
     -c TEST_DATA/contigs_KPN_TEST.fasta \
     --no-trim \
     -s KPN

Or you can use Singularity instead:

singularity exec docker://buisciii/plasmidid plasmidID.sh \
     -1 TEST_DATA/KPN_TEST_R1.fastq.gz  \
     -2 TEST_DATA/KPN_TEST_R2.fastq.gz \
     -d TEST_DATA/plasmids_TEST_database.fasta \
     -c TEST_DATA/contigs_KPN_TEST.fasta \
     --no-trim \
     -s KPN

You can also build the image in your local machine cloning the repository and building it using the Singularity recipe provided:

git clone git@github.com:BU-ISCIII/plasmidID.git
cd plasmidID
git submodule init
git submodule update
singularity build plasmidid.simg Singularity
singularity exec plasmidid.simg plasmidID.sh \
     -1 TEST_DATA/KPN_TEST_R1.fastq.gz  \
     -2 TEST_DATA/KPN_TEST_R2.fastq.gz \
     -d TEST_DATA/plasmids_TEST_database.fasta \
     -c TEST_DATA/contigs_KPN_TEST.fasta \
     --no-trim \
     -s KPN

If you want to use trimming functionality you have to supply trimmomatic-directory parameter:

docker run -v $PWD:$PWD -w $PWD buisciii/plasmidid plasmidID.sh \
     -1 file_R1.fastq.gz  \
     -2 file_R2.fastq.gz \
     -d database.fasta \
     -c contigs.fasta \
     --trimmomatic-directory /scif/apps/trimmomatic/Trimmomatic-0.38 \
     -s KPN

Note: You MUST skip trimming filtering in order to use TEST_DATA as it is already filtered!

About

PlasmidID is a mapping-based, assembly-assisted plasmid identification tool that analyzes and gives graphic solution for plasmid identification.

License:GNU General Public License v3.0


Languages

Language:Shell 98.7%Language:Dockerfile 1.3%