sashaaero / genetio

Library for I/O and storage of genetic data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

genetio

Library for I/O and storage of genetic data

Supported input formats:

  • PLINK binary ped (.bed/.bim/.fam)
  • Eigenstrat
  • Packed Ancestrymap

Supported output formats for phased haplotype data:

  • Eigenstrat (.phgeno/.phsnp/.phind)
  • Gzipped Eigenstrat
  • Impute2
  • Gzipped Imput2

Output for unphased genotypes available in Eigenstrat format

Internal storage formats:

  • Each genotype stored in three bits, implemented in PersonBits---enables rapid processing of multiple SNPs at a time using 64-bit unsigned ints
  • A bulk storage format with support for families, with each genotype stored in 2 bits (the same as PLINK bed format), implemented in PersonBulk

Coming soon: support for VCF input

About

Library for I/O and storage of genetic data

License:GNU General Public License v3.0


Languages

Language:C++ 98.2%Language:C 1.2%Language:Makefile 0.6%