cyverseuk / GWASSER

Dockerized Agave app for GWASSER, an R script for simple gwassing.

Home Page:http://cyverseuk.org/applications/gwasser/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

geno and pheno input file headers

teatree1212 opened this issue · comments

Based on the R script I understand that after merging the genotype and phenotype files need to have a Genotype header, a phenotype header and content, and as they are merged on "ID" , both have to have had an ID header.. But are there any rules on the header names?
-> Do you have any documentation about the minimal requirements of the input files and appropriate spelling, if this is necessary?
-> Could I have some more columns containing e.g. Gene names despite the header name being unknown?

If I remember correctly you can have as many columns as you like in the phenotype file (as long as you have ID, a phenotype column and at least a SNP one), you will select the one to analyze with the --phenos argument. The header names are case sensitive, I'm not used enough to R to know if there are other spelling requirements.
@Ward9250 maybe you could correct me and write some lines in the README.md to describe the input files formatting? that would probably be useful to any user who doesn't wish to go through the code