Best practices for converting plink bed bim fam to geno snp ind
geneanalyst opened this issue · comments
Is there currently a best practice for adding plink bed bim fam samples to geno snp ind datasets.
I've always 1st converted geno snp ind to plink using convertf , merged all plink data in plink, then converted bed bim fam to ped map pedind and back to geno snp ind using convertf.
Sometimes I feel this is not the best way to do it though.
Also, FYI plink doesn't preserve allele order and I wasn't sure if allele order in snp files is important to AF calculations, so I have as a matter of practice always re-aligned allele order in snp files (messed up by conversions from plink) to match dbSNP and ensure that col 5 is REF and col 6 is ALT in snp file
Thanks. Do you have a par file I can use for mergit? I didn't see any commands for it.
Ok, I was concerned that that the code looks to col 6 of snp for ALT and to calculate AF. As a matter of practice I use a script to re-import REF and ALT into col 5 and 6 of snp from Hg Reference Genome