YazBraimah / khmer_trinity_snakemake

A Snakemake pipeline for Illumina Transcriptome Assembly: trimmomatic + dignorm+ trinity

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

khmer_trinity_snakemake

What's this?

This is a repo containint a Snakemake workflow to assemble RNA-Seq reads.

The general workflow is as follows:

  • Read trimming with Trimmomatic

  • Read normalization with the khmer/diginorm Python package.

  • Transcriptome assembly with Trinity.

How to run it?

  1. Clone this repo

  2. Make a virtualenv with python>=3.4:

virtualenv --python=python3.5 .
  1. Activate it
source bin/activate
  1. Install locally the required software:
bash scripts/install_software.sh
  1. Test the pipeline with the sample data to see if it works (Trimmomatic, Trinity, khmer and additional pip packages):
ln -s config.yaml.example config.yaml
snakemake -j 24

How to analyse your data?

Copy the config.yaml.example into config.yaml and fill the file with your data:

  • Paths to your FASTQ files

  • Adaptors used

  • Phred Scores

  • Additional parameters for Trimmomatics, Khmer and Trinity

Have fun!

About

A Snakemake pipeline for Illumina Transcriptome Assembly: trimmomatic + dignorm+ trinity

License:MIT License


Languages

Language:Python 82.0%Language:Shell 18.0%