VCF indel notation converter from (A -> "-") to (GA -> G) including leading base (proper VCF notation).
For info please contact me: or.yaacov@mail.huji.ac.il
R (3+), Bioconductor (packages: BSgenomem, BSgenome.Hsapiens.UCSC.hg19, Biostrings)
Takes a 5 col tsv file (chr, pos, name, ref, alt):
chr1 20996757 NULL T -
chr1 20996257 NULL TT -
chr1 20996457 NULL - TT
chr1 20996457 NULL - T
chr1 20996457 NULL A G
Converts to:
chr1 20996756 NULL AT A
chr1 20996256 NULL GTT G
chr1 20996456 NULL A ATT
chr1 20996456 NULL A AT
chr1 20996457 NULL A G