nicholas-owen / scrna2019

supporting code for the multinomial single cell RNA-Seq paper

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Feature Selection and Dimension Reduction for Single Cell RNA-Seq based on a Multinomial Model

This repository contains supporting code to facilitate reproducible analysis. For details see the biorxiv preprint. If you find bugs please create a github issue. GLM-PCA is available as a standalone R package.

Coauthors

Will Townes, Stephanie Hicks, Martin Aryee, and Rafa Irizarry

Description of Repository Contents

algs

Implementations of dimension reduction algorithms

  • existing.R - wrapper functions for PCA, tSNE, ZINB-WAVE, etc
  • glmpca.R - implementation of PCA for generalized linear model likelihoods. This method is highlighted in the paper as being suitable for single cell RNA-Seq data.

real

Analysis of various real scRNA-Seq datasets. The Rmarkdown files can be used to produce figures in the manuscript

real_benchmarking

Systematic assessment of clustering performance of a variety of normalization, feature selection, and dimension reduction algorithms using ground-truth datasets.

Downloadable table of results from assessments

util

Utility functions.

  • clustering.R - wrappers for seurat clustering, model based clustering, and k-means
  • functions.R - Poisson and Binomial deviance and residuals functions, a function for loading 10x read counts from molecule information files.
  • functions_genefilter.R - convenience functions for gene filtering (feature selection) based on highly variable genes, highly expressed genes, and deviance.

About

supporting code for the multinomial single cell RNA-Seq paper


Languages

Language:R 100.0%