MutScape is a Python tool for preprocessing variant call format(VCF) and mutation annotation format(MAF) files and
pip install termcolor
pip install tqdm
pip install numpy
pip install pandas
pip install vcfpy
pip install seaborn
pip install oncodriveclust
pip install comut
Preprocessing VCF files and MAF files
python3 vcf_maf_process.py ...
Required input files(Required)
python3 vcf_maf_process.py \
-f [tsv file path]
- for VCF files: a 9 columns TSV file
NORMAL |
TUMOR |
MuSe |
Mutect2 |
SomaticSniper |
Strelka2 |
VarScan2 |
At Least # CALLS |
At Most # REJECT |
... |
... |
... |
... |
... |
... |
... |
... |
... |
- for MAF files: a 1 column TSV file
python3 vcf_maf_process.py \
...
-vf GI "[1:3,5]" CI "15,15,0,0,0,0.05,8,8" P 1 FFPE 0.9 \
...
python3 vcf_maf_process.py \
...
-c \
...
python3 vcf_maf_process.py \
...
-v2m 48 \
...
python3 vcf_maf_process.py \
...
-mf GI "[1:3,5]"
...
Output files and Meta files
python3 vcf_maf_process.py \
...
-o [output files path] \
-m [meta files path] \
...
Data Analysis and Visualization
- Preprocessing: Make all MAFs into one MAF file
python3 vcf_maf_process.py \
-f examples/Tissue_samples/ms_maf/maf.tsv \
-m examples/Tissue_samples/ms_maf \
-o examples/Tissue_samples/ms_maf
- Estimation
python3 maf_analysis.py \
-f examples/Tissue_samples/ms_maf/maf_combination.maf \
-ms 1 "[2,9,10]" \
-o examples/Tissue_samples/outputs \
-p examples/Tissue_samples/pictures
- Analysis and Visualization
python3 maf_analysis.py \
-f examples/Tissue_samples/ms_maf/maf_combination.maf \
-ms 2 "[3]" \
-o examples/Tissue_samples/outputs \
-p examples/Tissue_samples/pictures
- Outputs
![](https://camo.githubusercontent.com/b667e65c6deab7bcdfeb2373234e664ec8751af200860bae33be4952b459e80a/68747470733a2f2f692e696d6775722e636f6d2f6d685779576a662e706e67)
HRD Score, Whole-Genome doubling(WGD) and Chromosome instability(CIN)
- HRD Score
python3 maf_analysis.py \
-f examples/CBCP_samples/CBCP_oncokb_maf.txt \
-hrd examples/CBCP_samples/HRD_input.tsv grch37 \
-o examples/CBCP_samples/outputs \
-p examples/CBCP_samples/pictures
- WGD_CIN
python3 maf_analysis.py \
-f examples/CBCP_samples/CBCP_oncokb_maf.txt \
-wgdcin examples/CBCP_samples/HRD_input.tsv \
-o examples/CBCP_samples/outputs \
-p examples/CBCP_samples/pictures
- ONCOKB
python3 maf_analysis.py \
-f examples/TCGA/TCGA_Mutect_v10_white_rerun.maf \
-oncokb ../oncokb-annotator/ ca398551-c549-49bd-80f0-0e68d9ca033c 4 examples/TCGA/clinical_input.txt \
-o examples/TCGA/outputs \
-p examples/TCGA/pictures
- Outputs examples
![](https://camo.githubusercontent.com/3e49cdeea83e6cf5fc6d4879839b91fb5599a9353662fb7f2242120dffee6001/68747470733a2f2f692e696d6775722e636f6d2f6853763356734d2e706e67)