edawson / presig

Convert MAF files to formats used as input to mutational signature programs including SigProfiler

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

presig: Transform a MAF file into the inputs of signet / sparsesigs

Eric T Dawson
Nov 2019

Introduction

presig generates count matrices of single-base (SBS) and indel (ID) features for input to SigProfiler, SignatureAnalyzer, and signet/sparsesigs. It takes a MAF file as input, as well as a corresponding references FASTA file.

Requirements

presig requires pyfaidx for parsing FASTA files and pycotap for testing.

Installation

git clone --recursive https://github.com/edawson/presig
cd presig/
pip install -r requirements.txt -e .

Basic usage

presig is primarily designed to be used at the command line, though you can also import its individual functions into a python environment if desired.

Convert a MAF file to feature counts

python presig/presig.py -m <MAF> -f <FASTA>

This will generate two files (<MAF>.SBS96.tsv and <MAF>.ID83.tsv) in the current directory.

Convert a MAF file to the SigProfiler simple text format

python presig/presig.py -m <MAF> -f <FASTA> -s -u > <outputfile>.txt

Run SigProfiler (including installing SigProfilerHelper)

To run SBS96 signatures using the TSV counts matrix generated using presig:

git clone --recursive https://github.com/edawson/sigprofilerhelper
python sigprofilerhelper/run_sigprofiler.py -c 16 -i 1000 -s 1 -e 7 -t <MAF>.SBS96.tsv

Questions and bug reports

Please post an issue on the GitHub if you have a question or find a bug.

About

Convert MAF files to formats used as input to mutational signature programs including SigProfiler

License:MIT License


Languages

Language:Python 100.0%