mbarakatt / ALL_PCA_1000G

PCA for every single pair of populations in the 1000 Genomes Populations

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PCA for every single pair of populations in the 1000 Genomes Populations

30-second explanation on how these were made

  • I used the the .vcf file from the affy_6 chip affy_6
  • Converted to plink format
  • Removed SNPs with maf < 0.01
  • Created one set of plink files for all pairs of populations
  • LD based pruning using plink with arguments 50 5 0.8
  • Ran PCA using plink
  • Used homemade python script to generate the pdfs
  • The big pdf, with all the populations, was made using pdfjam
  • More questions? Simply go look at the code

Examples

Low quality png render of a huge pdf with all pcas on it (I recommend you download it!) all pca png

Example of pca for a pair of populations. All of the pairs can be found in the figures folder. demo pair pop pca

About

PCA for every single pair of populations in the 1000 Genomes Populations


Languages

Language:Shell 53.9%Language:Python 46.1%