jhawe / chipseq_pipeline

A snakemake ChIP-seq pipeline allowing IDR analysis

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

NOTE: this pipeline is still under development. The current rulegraph generated by Snakemake can be found here.

A snakemake pipeline for ChIP-sequencing data

Introduction

Workflow description

How to get started

The following command will map, merge and filter reads for all samples as well as downsample all bam files to 20e6 reads using a cluster computation (qsub). It will further call peaks for all samples using the macs2 peak caller.

NOTE: The use of '--use-conda' is mandatory if not all tools required by the analysis pipeline are installed on the host system. ALso, this ensures that all tools which are run are used in the same version as in the paper.

To obtain only peak calls for all samples, filtered for encode blacklisted peaks, replace the keyword 'all' at the end of the snakemake call with 'filtered_peaks_all'

nohup nice snakemake --use-conda -u configs/cluster.json --jobs=100 --local-cores=1 --cluster "qsub -pe smp {threads} -hard -l job_mem={resources.mem_mb}M \
      -q {cluster.q} -cwd -V -o {log} -e {log} -N {cluster.N}" all &

# SLURM test
nohup nice snakemake --use-conda -u configs/slurm.config --jobs=100 --local-cores=1 --cluster \
  "sbatch -t {cluster.time} -c {cluster.cpu} --mem-per-cpu {cluster.mem} \
      -p {cluster.partition} -o {cluster.log} -e {cluster.log}" results/current/peaks/BH1-1_peaks.narrowPeak &

About

A snakemake ChIP-seq pipeline allowing IDR analysis


Languages

Language:Python 80.2%Language:R 19.8%