Leoberium / pyIPSA

Integrative Pipeline for Splicing Analysis

pyIPSA

Integrative Pipeline for Splicing Analysis

Installation & Run

To use this program you must have python environment with the following programs and libraries installed:

snakemake
pysam
pandas
numpy

Such environment may be created via conda:

conda create -n ipsa
conda activate ipsa
conda install -c conda-forge -c bioconda snakemake pysam pandas

To install the pipeline:
git clone https://github.com/Leoberium/pyIPSA.git

To run (test run if input directory is empty):

Python environment with required libraries must be active
Make directory with Snakefile active
Run snakemake command

Test run produces empty aggregated_junction_stats.tsv file in output directory.

Options for snakemake command are available in snakemake documentation.

For Arcuda users

Just load module with Python and install libraries:

module load ScriptLang/python/3.8.3
pip3 install --user --upgrade snakemake pandas pysam

After that you can run pipeline using cluster engine: snakemake --cluster qsub --j <number of jobs>

Working folders

Folder in repository:

config - the folder with config file, where you set up your pipeline
deprecated - the folder with old scripts not used in workflow
known_SJ - the folder with annotated splice junctions
workflow - the folder with working scripts of the pipeline

Additional directories created

genomes - the folder which stores all downloaded genomes
annotations - the folder which stores all downloaded annotations

About

Integrative Pipeline for Splicing Analysis

Languages

Language:Python 100.0%