This repository contains a loose collection of scripts that I use to work with sequencing data. Most of them are probably only useful for myself, but I've made them public, because there is no reason to keep them private. Majority of the scripts are wrappers around commonly used tools. Below is a short description of some the scripts that might be more broadly useful.
Scripts for processing BAM files.
bamToFragmentBed.py
- Use bedtools to convert paired-end BAM file into a BED file of fragments.
Scripts for processing VCF files.
liftoverVcfGenotypes.py
- Uses CrossMap.py to lift over VCF file from hg19 coordinates to GRCh38 coordinates.
Scripts for processing GTF files.
gtf2database.py
- Converts GTF/GFF file into a database used by gffutilsfilterGFF.py
- Select a subset of genes and transcripts from a GFF file.