Nextomics / Sniffles

Structural variation caller using third generation sequencing

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Sniffles

Sniffles is a structural variation caller using third generation sequencing (PacBio or Oxford Nanopore). It detects all types of SVs (10bp+) using evidence from split-read alignments, high-mismatch regions, and coverage analysis. Please note the current version of Sniffles requires sorted output from BWA-MEM (use -M and -x parameter) or NGMLR with the optional SAM attributes enabled! If you experience problems or have suggestions please contact: fritz.sedlazeck@gmail.com

Please see our github wiki for more information (https://github.com/fritzsedlazeck/Sniffles/wiki)

How to build Sniffles

wget https://github.com/fritzsedlazeck/Sniffles/archive/master.tar.gz -O Sniffles.tar.gz
tar xzvf Sniffles.tar.gz
cd Sniffles-master/
mkdir -p build/
cd build/
cmake ..
make

cd ../bin/sniffles*
./sniffles

Note Mac users often have to provide parameters to the cmake command:

cmake -D CMAKE_C_COMPILER=/opt/local/bin/gcc-mp-4.7 -D CMAKE_CXX_COMPILER=/opt/local/bin/g++-mp-4.7 .. 

NGMLR

Sniffles performs best with the mappings of NGMLR our novel long read mapping method. Please see: https://github.com/philres/ngmlr


Citation:

Please see and cite our paper: https://www.nature.com/articles/s41592-018-0001-7


Poster & Talks:

Accurate and fast detection of complex and nested structural variations using long read technologies Biological Data Science, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 26 - 29.10.2016

NGMLR: Highly accurate read mapping of third generation sequencing reads for improved structural variation analysis Genome Informatics 2016, Wellcome Genome Campus Conference Centre, Hinxton, Cambridge, UK, 19.09.-2.09.2016


Datasets used in the mansucript:

We provide the NGMLR aligned reads and the Sniffles calls for the data sets used:

Arabidopsis trio:

Genome in the Bottle trio:

NA12878:

SKBR3:

About

Structural variation caller using third generation sequencing

License:MIT License


Languages

Language:HTML 36.8%Language:C++ 28.1%Language:C 20.0%Language:Makefile 5.2%Language:Assembly 2.6%Language:Ada 1.7%Language:Shell 1.3%Language:Pascal 1.3%Language:C# 1.0%Language:DIGITAL Command Language 0.5%Language:CMake 0.4%Language:Objective-C 0.4%Language:CSS 0.2%Language:CLIPS 0.1%Language:M4 0.1%Language:Roff 0.1%Language:Perl 0.1%Language:SAS 0.0%Language:Module Management System 0.0%Language:Batchfile 0.0%