maolopez / human_genome_projects

Manipulating txt, csv files with human genome information.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

#human_genome_projects

Manipulating txt, csv files with human genome information.

General description: A CSV file with a Human DNA genoma has 2395592 words. Instead a text file with the Bible book has only 824383 words. The CSV files has 4 columns and 598912 rows. The columns are rsid, chromosome, position and genotype. Percentages are calculated too.

Script Description Usage
csv_2_list_count_genotypes.py calculate the frequency of each genotype python3 csv_2_list_count_genotypes.py file.csv
csv_2_list_count_chromosomes.py calculate the frequency of each chromosome python3 csv_2_list_count_chromosomes.py file.csv
csv_2_list_count_genotypes_by_chromosome.py the frequency of each chromosome:genotype pair same logic

Wheel Installation

pip install dist/human_genome_projects-0.1.0-py36-none-any.whl

OR

pip3.6 install --user https://s3.amazonaws.com/human_genome_projects-0.1.0-py36-none-any.whl

Docker Usage

docker run IMAGE_ID

About

Manipulating txt, csv files with human genome information.

License:Creative Commons Attribution Share Alike 4.0 International


Languages

Language:Python 95.0%Language:Dockerfile 5.0%