BenjaminSchwessinger / nanoget

Functions to extract information from Oxford Nanopore sequencing data and alignments

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

nanoget

This module provides functions to extract useful metrics from Oxford Nanopore sequencing reads and alignments.

Twitter URL install with conda Build Status

FUNCTIONS

Data can be presented in the following formats, using the following functions:

  • A sorted bam file process_bam(bamfile, threads)
  • A standard fastq file process_fastq_plain(fastqfile, 'threads')
  • A fastq file with metadata from MinKNOW or Albacore process_fastq_rich(fastqfile)
  • A sequencing_summary file generated by Albacore process_summary(sequencing_summary.txt, 'readtype')

Fastq files can be compressed using gzip, bzip2 or bgzip. The data is returned as a pandas DataFrame with standardized headernames for convenient extraction. The functions perform logging while being called and extracting data.

INSTALLATION

pip install nanoget

or
install with conda

conda install -c bioconda nanoget

About

Functions to extract information from Oxford Nanopore sequencing data and alignments

License:GNU General Public License v3.0


Languages

Language:Python 99.7%Language:Shell 0.3%