Integrative Pipeline for Splicing Analysis
To use this program you must have python environment with the following programs and libraries installed:
snakemake
pysam
pandas
numpy
Such environment may be created via conda:
conda create -n ipsa
conda activate ipsa
conda install -c conda-forge -c bioconda snakemake pysam pandas
To install the pipeline:
git clone https://github.com/Leoberium/pyIPSA.git
To run (test run if input directory is empty):
- Python environment with required libraries must be active
- Make directory with Snakefile active
- Run
snakemake
command
Test run produces empty aggregated_junction_stats.tsv
file in output directory.
Options for snakemake
command are available in
snakemake documentation.
Just load module with Python and install libraries:
module load ScriptLang/python/3.8.3
pip3 install --user --upgrade snakemake pandas pysam
After that you can run pipeline using cluster engine:
snakemake --cluster qsub --j <number of jobs>
Folder in repository:
config
- the folder with config file, where you set up your pipelinedeprecated
- the folder with old scripts not used in workflowknown_SJ
- the folder with annotated splice junctionsworkflow
- the folder with working scripts of the pipeline
Additional directories created
genomes
- the folder which stores all downloaded genomesannotations
- the folder which stores all downloaded annotations