Musketeer-D / rerio

Research release basecalling models and configurations

Home Page:https://nanoporetech.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

/ONT_logo.png


Rerio

Rerio is comprised of "research release" basecalling models and configuration files. All models are compatible with Guppy (see Nanopore Community page for download/install instructions). Since research models often utilise new features, the latest version of Guppy may be required.

The research models provide cutting-edge functions, speeds and accuracies that have not been productionised or validated by Oxford Nanopore Technologies in the Guppy executable basecaller. Nevertheless, models and config files can be run with the basecalling infrastructure in Guppy executable by using the instructions available in this repository.

Models are provided for DNA and RNA, various pore types and to basecall different modified bases in a variety of contexts.

Features

  • New and advanced research models that are at the forefront of nanopore sequencing analyses (e.g. highest accuracies, quickest speeds, more modified basecalls in more contexts)
  • Barcoding comaptibility (using Guppy files)
  • Functions with Megalodon (2.0 and above) to call modified DNA and RNA bases

Note: The results generated by reserach basecalling models have not been scrunitised nor validated and Oxford Nanopore cannot support each use case (see Reserach Release disclaimer)


Getting started

Follow the instructions below to use research models with Guppy executable.

Input

  • .fast5s containing trace data

Output(s)

  • FASTQ or FASTA files - according to the basecalling model implemented
  • .hdf5 files - modified basecalling models may produce tabulated data as .hdf5 formats

Dependencies

You will require:

  • Python-compatible environment (e.g. Terminal)
  • The most recent Guppy version, available on the Nanopore Community Software downloads page.

Note: Rerio is versioned by the minimum version of Guppy required to run _all_ models in the repository. A new release of Rerio will be tagged when a model is added that requires a more recent version of Guppy.

Installation

Rerio can be downloaded by cloning from github git clone https://github.com/nanoporetech/rerio. Once Rerio has been downloaded, models can be downloaded via the download_model.py script.

# Download all models
rerio/download_model.py
# Download specific model(s)
rerio/download_model.py rerio/basecall_models/res_dna_r941_min_modbases-all-context_v001

Once desired models have been downloaded, they can be run by specifying the Guppy data path option (-d or --data_path) and selecting the desired config file (-c or --config).

./ont-guppy/bin/guppy_basecaller -i fast5s/ -s basecalled_fast5s \
    -d ./rerio/basecall_models/ \
    -c res_dna_r941_min_modbases-all-context_v001.cfg

Use and description of models

Config DNA/RNA Pore Device Modbases First Added
res_dna_r941_min_modbases-all-context_v001.cfg DNA R9.4.1 MinION/GridION 5mC, 6mA v3.5.1
res_dna_r941_min_flipflop_v001.cfg DNA R9.4.1 MinION/GridION   v3.5.1
res_dna_r941_min_rle_v001.cfg DNA R9.4.1 MinION/GridION   v3.5.1
res_dna_r103_min_flipflop_v001.cfg DNA R10.3 MinION/GridION   v3.5.1
res_rna2_r941_min_flipflop_v001.cfg RNA2 R9.4.1 MinION/GridION   v3.5.1

Megalodon Support

Rerio research models can be run within megalodon by specifying the data directory in the --guppy-params argument.

megalodon fast5s/ --guppy-params "-d ./rerio/basecall_models/" \
    --guppy-config res_dna_r941_min_modbases-all-context_v001.cfg

Barcoding Support

The Rerio github code repository includes a minimal barcoding stub to allow Guppy to run successfully. In order to enable full Guppy barcoding capabilities, all barcoding files must be transferred from the guppy data directory to the rerio data directory.

cp ont-guppy/data/barcoding/* rerio/basecall_models/barcoding/

Help

Licence and Copyright

© 2020 Oxford Nanopore Technologies Ltd.

Rerio is distributed under the terms of the Oxford Nanopore Technologies, Ltd. Public License, v. 1.0. If a copy of the License was not distributed with this file, You can obtain one at http://nanoporetech.com

Research Release

Research releases are provided as technology demonstrators to provide early access to features or stimulate Community development of tools. Support for this software will be minimal and is only provided directly by the developers. Feature requests, improvements, and discussions are welcome and can be implemented by forking and pull requests. However much as we would like to rectify every issue and piece of feedback users may have, the developers may have limited resource for support of this software. Research releases may be unstable and subject to rapid iteration by Oxford Nanopore Technologies.

About

Research release basecalling models and configurations

https://nanoporetech.com/

License:Other


Languages

Language:Python 100.0%