Thierry Gosselin's starred repositories

LightGBM

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

future

:rocket: R package: future: Unified Parallel and Distributed Processing in R for Everyone

ranger

A Fast Implementation of Random Forests

forcats

🐈🐈🐈🐈: tools for working with categorical variables (factors)

Language:RLicense:NOASSERTIONStargazers:538Issues:21Issues:231

SNPRelate

R package: parallel computing toolset for relatedness and principal component analysis of SNP data (Development version only)

ohana

mixture classification, constraint optimization, outlier detection, population structure, admixture history, and selection detection.

missRanger

R package "missRanger" for fast imputation of missing values by random forests.

Language:RLicense:GPL-2.0Stargazers:58Issues:9Issues:34

stacks_workflow

RADseq workflow built around STACKS

Language:RLicense:GPL-3.0Stargazers:56Issues:9Issues:12

terastructure

TeraStructure is a new algorithm to fit Bayesian models of genetic variation in human populations on tera-sample-sized data sets (10^12 observed genotypes, i.e., 1M individuals at 1M SNPs). This package provides a scalable, multi-threaded C++ implementation that can be run on a single computer.

Language:C++License:GPL-3.0Stargazers:47Issues:13Issues:14

readDepth

R package for inferring copy number from read depth

Language:RLicense:NOASSERTIONStargazers:30Issues:0Issues:0

eca-bioinf-handbook

Lecture notes/book in progress on computing for conservation genomics

Language:TeXLicense:MITStargazers:27Issues:6Issues:0

scat

SCAT software for "Smoothed and Continuous Assignment Tests"

strataG

strataG is a toolkit for haploid sequence and multilocus genetic data summaries, and analyses of population structure.

genepopedit

Simple and flexible manipulation of genomic data.

MavericK

Source code for the program MavericK, described fully at www.bobverity.com/maverick

Language:C++Stargazers:11Issues:0Issues:0

whoa

Where's my Heterozygotes at? Observations on genotyping Accuracy

Language:HTMLStargazers:6Issues:5Issues:0
Language:RoffLicense:GPL-3.0Stargazers:3Issues:12Issues:9

Lobster-assigner

Assigner tutorial for Benestan et al. erratum

rmetasim

rmetasim population genetic simulation engine

ebvSim

GEO-BON Genetics Working Group simulation based evaluations of Essential Biodiversity Variables (EBVs)