colindaven / awesome-pangenomes

A list of software for pangenomics

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

awesome-pangenomes

A list of software capable of analyzing eukaryotic genomes for pangenomics

πŸš€ indicates a popular repository

Toolkits

  • vg Full featured construction, mapping and SNP calling toolkit based on multiple formats. πŸš€
  • odgi Fast toolkit based on odgi format πŸš€
  • gfatools Toolkit for GFA parsing and conversion
  • gfakluge Toolkit and c++ API for GFA manipulation
  • gaftools Toolkit for GAF (Graph Alignment Format) sorting and manipulation.
  • gretl Statistics and analysis for GFA files, written in Rust

Pangenome construction

  • Minigraph Fast method by Heng Li, produces referenceGFA (rGFA) format (not GFA or odgi) πŸš€
  • Cactus/minigraph Pangenome builder which prioritizes downstream compatibility. Produces GFA and odgi. πŸš€
  • PGGB Pangenome Graph Builder, calculates SNPs as part of the pipeline. Produces GFA and odgi. πŸš€
  • Pantools v3+ Fully featured construction of pangenome graphs
  • PSVCP
  • PHG Practical Haplotype Graph
  • PATO R package for pangenome construction
  • Chrom_mini_graph Generate and map reads onto a coloured minimizer pangenome graph
  • GET_PANGENES Perl scripts used by the Ensembl Plants team for pangenomics
  • impg Create an implicit pangenome graph for a homologous target region, then use output bed files to extract sequences for PGGB etc.
  • MGRgraph An Algorithm to Build a Multi-genome Reference (warning - last updated 2018)
  • pangene Pangene constructs a pangenome gene graph from one protein set and many genomes and includes simple but effective visualization πŸš€
  • MEMO MEMO constructs a pangenome and index and allows kmer based conservation analyses and visualization

Short read alignment to a pangenome graph

  • vg map Original vg mapper (superseded by vg giraffe)
  • vg giraffe Faster and more modern alternative to vg map πŸš€
  • Hisat2
  • Minigraph Construct graphs or align short or long reads to graphs
  • Chrom_mini_graph Generate and map reads onto a coloured minimizer pangenome graph

Long read alignment to a pangenome graph

SNP callers and genotypers

  • vg call SNP caller for pangenomes, with gam or GAF output πŸš€
  • Paragraph A suite of graph-based genotyping tools for short read data
  • Pangenie kmer-based SV genotyping using short reads. Intended for human only (in 2023).
  • vg surject surject to linear reference, then use linear SNP caller like Freebayes, Deepvariant etc πŸš€
  • Deepvariant Case study of deep variant SNP calling on vg giraffe aligned bam files

Structural Variation (SV) callers and genotypers

  • vg call Call and genotype structural variants on a graph using long and short reads. πŸš€
  • GraphTyper A graph SV genotyper (does not call SVs)
  • Pangenie kmer-based SV genotyping using short reads. Intended for human only (in 2023).
  • SVarp Use long reads to detect structural variants in a GFA format pangenome.

Bubble (SV) detection

  • bubblegun A tool for detecting Bubbles and Superbubbles

Pangenome viewers -interactive

  • Panagram
  • Bandage Visualize GFA files in an interactive standalone app πŸš€
  • SeqTubemap Elegant path visualization for smaller regions of a pangenome from the vg team πŸš€
  • VAG Visualization of short sequence alignments in a pangenome
  • Panache View linearized pangenomes
  • Waragraph
  • PanGraphViewer
  • Wally View GFA (Work in progress 2023)
  • VRPG View rGFA or GFA, written in python and html
  • Pantograph is a commercial pangenome graph viewer option
  • PGV A web based viewer similar to SeqTubeMap
  • Pancat Scripts to filter and visualize GFA files
  • gfaestus GFA visualizer, GPU-accelerated using Vulkan
  • gfaviz Graphical interactive tool for the visualization of sequence graphs in GFA format
  • AGB Interactive assembly graph browser
  • MoMI-G Genome graph browser for SVs visualization. User can filter and visualize annotations and inspect SVs with read alignments over the genome graph. πŸš€
  • pangene Pangene can visualize one protein set mapped to x genomes to check synteny and presence absence. πŸš€

Pangenome viewers -static

  • vg view - generates static images
  • odgi - generates static images

Pangenome pipelines

  • pangenome. A comprehensive nf-core Nextflow pipeline using PGGB and odgi components. Better parallelized than PGGB alone. πŸš€
  • pangepop A snakemake pipeline to create a pangenome with minigraph-cactus and align reads against it with vg giraffe

Graph validation tools

File formats

Miscellaneous tools

  • gfainject Map short alignments in BAM format to a GFA (seems it is not a real aligner but a conversion tool). Output in GAF format.
  • GRAFIMO GRAph-based Finding of Individual Motif Occurrences using vg

kmer based approaches

Libraries to explore pangenomes

  • gfapy implements GFA1 and GFA2 parsing and scalable exploration of graphs in Python
  • gfagraphs implements rGFA and GFA1 parsing and editing of graphs in Python
  • graphanalyzer a python package to read and analyze the PAF and the GFA files for the graphs.

Other lists of pangenome tools

Contributions

Is something missing? Contributions are welcome, please make PRs to main or write an issue with a link.

About

A list of software for pangenomics

License:MIT License