josephvalencia / FairGenomicPCA

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

FairGenomicPCA

CS 599 Convex Optimization @ Oregon State December 2020. This is an implementation of several dimensionality reduction methods to compare the performance of PCA-like algorithms on group and individual fairness objectives. The variants are:

I compared these algorithms on the Adult and Credit datasets and replicate experiments from Price et al 2006

The central dependencies for this project include pandas, numpy, scipy, jax, scikit-learn, and cvxpy. I have included both a pip-freeze.txt and a conda list if you have issues with libraries. (I used conda)

To test, please run:

python genome_trials.py

This is a small test on only the 500 SNPs with maximum variance. If you would like to test more, you can increase the KEEP_TOP_SNPs constant.

Also run

python adult_trials.py

You will see similar output, but the Pareto-PCA method will run and never finish.

About


Languages

Language:Jupyter Notebook 74.2%Language:Python 25.8%