ilarsf / MapDbSNP

Add positions to data that only contains dbSNP IDs

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

MapDbSNP

  • UNIX commands and Rscripts will add positions to data that only contains dbSNP IDs
  • Will download dbSNP151 from UCSC genome browser and prepare files for future runs (~90 GB hard disk space)

Required R packages

  • data.table
  • optparse
  • parallel
  • here

Usage

Rscript ./script/positionsFromDBSNP.r [options]

Options:

--input=path to file with dbSNP IDs, e.g. summary statistics
--ID=column name with dbSNP IDs
--build=Genome Build: hg19 or hg38
--outdir=Path of output directory
--prefix=Prefix for output file name without path or extension
--cpus=Number of available CPUs for parallel runs (uses up to 64 for hg19 or 68 for hg38)
--skip=Skip lines of input file

Example command line:

Rscript ./script/positionsFromDBSNP.r \
--input=./example/example_input.txt \
--ID=ID \
--build=hg19 \
--outdir=./example \
--prefix=example \
--cpus=16

About

Add positions to data that only contains dbSNP IDs

License:GNU General Public License v3.0


Languages

Language:R 91.4%Language:Awk 6.5%Language:Shell 2.2%