arangrhie / T2T-Polish

Evaluation and polishing workflows for T2T genome assemblies

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

T2T-Polish

This repository contains up-to-date evaluation and polishing workflows to adapt on general genome assembly projects, with most of the ideas developed and described in this paper.

For exact command lines and workflows used to generate the T2T-CHM13v1.0 and T2T-CHM13v1.1 assemblies, please refer to the Methods section in the CHM13-Issues repo. Note that some of the tools have been updated since then, and are tracked on this repo.

Contents

Related external links

Variant call, refinements and formatting (Also see Error Detection)

Repeat-aware alignments

Automated polishing

  • Racon: Liftover branch for outputting edits in .vcf
  • Merfin: Latest stable code-base

Base level QV estimation

Citation

Please cite if any of the codes shared in this repo was used:

Mc Cartney AM, Shafin K, Alonge M et al. Chasing perfection: validation and polishing strategies for telomere-to-telomere genome assemblies. Nat Methods (2022) doi: https://doi.org/10.1038/s41592-022-01440-3

About

Evaluation and polishing workflows for T2T genome assemblies

License:Other


Languages

Language:C 39.2%Language:Shell 36.3%Language:Jupyter Notebook 8.1%Language:Python 7.2%Language:R 6.9%Language:Java 2.3%Language:Makefile 0.1%