FinucaneLab / gene_features

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Repository of gene features used for gene prioritization

This repository contains gene features (gene x feature matrices) used in the Finucane lab, primarily for gene prioritization methods.

Raw data

Most raw data is in the form of (cell x gene) or (tissue x gene) count or TPM matrices. Experiments were performed either at the bulk or single cell level, and in some cases are single cells are merged before we derive features. Raw data will in many cases not be available on the public repository due to either total size or because it is currently unpublished and we do not have permissions to share. Please update this list everytime a new dataset is added.

Feature types

We derive features underlying tisse type / cellular processes using:

  • Features that explain the most variance across (and within) cell populations (gene loadings on top X PCs, gene loading on top X ICs, within cluster PCs and ICs)
  • Features that represent genes that define predefined cell populations or identified clusters (one vs. all differential gene expression -- t-stat, DE genes)
  • Features that represent expression programs shared across cell types (TBD, co-expression gene modules)

Features can be found here.

Getting setup

(Need to update) Run install.R to install all necessary packages. Working code for each type of data is named similarly and lives in code.

Analysis

Derived features

Current output features:

Next steps:

  • Gene modules (WCGNA)
  • Co-expression analysis

About

License:GNU General Public License v3.0


Languages

Language:R 100.0%Language:Shell 0.0%