$ pip install tqdm sqlite3 pandas
$ git clone https://github.com/oleuml/biodatabase.git
$ mkdir res
$ cd res/
$ wget hgdownload.cse.ucsc.edu/goldenpath/hg38/chromosomes/chr<NUM>.fa.gz
$ wget ftp://ftp.sanger.ac.uk/pub/gencode/Gencode_human/release_27/gencode.v27.annotation.gtf.gz
$ gunzip *.gz
Also needed SNPs
from http://genome.ucsc.edu/cgi-bin/hgTables with following configuartion:
- genome: Human
- group: Variation
- track: Common SNPs
- region: genome
- output format: selected fields from primary and related tables
- output file:
chr.snp
And columns/fields (if you are clicking "get output"
):
- chrom
- chromStart
- chromEnd
- name
- strand
- refNCBI
- observed
- molType
- class
And store the output file
in the resource directory
.
$ python3 run.py -c NUM -a
$ yaourt -S sqlitebrowser
$ brew cask install db-browser-for-sqlite