MCorentin

Molitor Corentin's repositories

vargen

VarGen is an R package designed to get a list of variants related to a disease. It just need an OMIM morbid ID as input and optionally a list of tissues / gwas traits of interest to complete the results. You can also use your own customised list of genes. VarGen is capable of annotating the variants to help you identify the most impactful ones.

Language:RMIT13 5 5

Solanum_sitiens_assembly

Instructions to reproduce the de novo assembly of Solanum sitiens (accession LA1974).

MIT400

plot_transcripts_filtering.py

Script to plot the number of transcripts left after filtering by low expression

Language:PythonMIT300

plotReadLengths

Python Script to create a histogram of the sequences lengths in a Fasta file (useful to get the distribution of Pacbio Reads for example)

Language:PythonGPL-3.0200

MOAR

MOAR (Mummer On Assembly against a Reference)

Language:ShellMIT1 20

book

Language:TeX000

ChromosomesOverview

R code to create bar graph of chromosomes and add on them the positions of transcripts/genes

Language:RMIT000

efficientR

Efficient R programming: a book

Language:TeXNOASSERTION000

Solanum_chilense_assembly

Scripts and files used to perform the de novo assembly of Solanum chilense (LA1972)

Language:PerlMIT000

sra-cleaning

Python script to automatically parse a "Contamination.txt" file from the Sequence Reads Archive (SRA) and correct the assembly FASTA file and annotation GTF file.

Language:PythonMIT000

tutorial-kmer-spectra

R markdown explaining k-mer spectra, and how sequencing errors and heterozygosity are affecting them.

MIT000

GBS2LK

From GBS to Linkage Map (using Tassel)

Language:PythonMIT020

GeneToCN

Gene copy number prediction from k-mer frequencies

Language:PythonGPL-3.0000

Pichis

000

PRS-Tutorial

A tutorial on how to run basic polygenic risk score analysis

MIT000

run_pilon_batches.sh

Script to run pilon by batches of contigs (to avoid out of memory issues)

Language:ShellMIT000

SAIGE

Development for SAIGE and SAIGE-GENE(+)

GPL-3.0000

SIFT4G_Create_Genomic_DB

Create genomic databases with SIFT predictions. Input is an organism's genomic DNA (.fa) file and the gene annotation file (.gtf). Output will be a database that can be used with SIFT4G_Annotator.jar to annotate VCF files.

GPL-3.0000

Test_Geno

000