thackl / cr-genomes

Supplementary code and data for genomes of C.roenbergensis: E4-10P, BVI, Cflag and RCC970-E3

Home Page:https://doi.org/10.1101/751586

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

https://zenodo.org/badge/204036590.svg?style=svg

Generic scripts used in data analysis

https://github.com/thackl/sam-scripts

  • bam-coverage
  • bam-junctions

https://github.com/thackl/seq-scripts

  • seq-circ-restart
  • seq-circ-trim
  • seq-comp

https://github.com/thackl/phylo-scripts

  • tax-resolve

Kmer-based genome size estimation for strain E4-10P

genome-size-estimation/genome-size-estimation.R

genome-size-estimation/CrE410P-kmer-spectrum.png

C. roenbergensis draft assemblies

Assemblies have been processed with redundans, but still contain bacterial contaminations and potential misassemblies, and are unpolished.

fileassemblerprimary_datanum_seqssum_lenmin_lenmax_lenN50
CrBV-c0b.faCanu 1.8pacbio3433910762551181533637429009
CrBV-c1b.faCanu 1.8pacbio394401170585308975077270687
CrBV-f1b.faFlye 2.3.7pacbio2133788439851291475468479063
CrBV-w5b.faWtdbg 2.1pacbio2683835619650981458748435830
CrCf-c0b.faCanu 1.8pacbio2723456170055891022239229877
CrCf-c1b.faCanu 1.8pacbio3113655148154901011470270089
CrCf-f1b.faFlye 2.3.7pacbio216341378055245946461276367
CrCf-w5b.faWtdbg 2.1pacbio187332535115670938012321504
CrEa-c0b.faCanu 1.8pacbio2653573187250571514847402275
CrEa-c1b.faCanu 1.8pacbio2933628336651281087465282182
CrEa-f1b.faFlye 2.3.7pacbio1903450938853371335512348875
CrEa-gen-dp-4.5.faSPAdes 3.6.1miseq + corr. pacbio315297103641002910477228292
CrEa-w5b.faWtdbg 2.1pacbio3763679197250021475054432668
CrEa-w6b.faWtdbg 2.1pacbio371364298485057947144405606
CrEa-w7b.faWtdbg 2.1pacbio3333679042850801467234546423
CrEa-w8b.faWtdbg 2.1pacbio3853672577250021259187383112
CrRC-c0b.faCanu 1.8pacbio51330455408515156394199385
CrRC-c1b.faCanu 1.8pacbio392340474466587635481155376
CrRC-f1b.faFlye 2.3.7pacbio396337927905193720955131225
CrRC-w5b.faWtdbg 2.1pacbio296318456125053701287178453

About

Supplementary code and data for genomes of C.roenbergensis: E4-10P, BVI, Cflag and RCC970-E3

https://doi.org/10.1101/751586

License:Other


Languages

Language:Perl 61.1%Language:R 24.7%Language:Python 14.2%