Song Lab @ Cal's repositories
tape-neurips2019
Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology. (DEPRECATED)
factored-attention
This repository contains code for reproducing results in our paper Interpreting Potts and Transformer Protein Models Through the Lens of Simplified Attention
MultiCluster
Software for three-way clustering of multi-tissue multi-individual gene expression data using semi-nonnegative tensor decomposition
SGDP_IGHV_TRBV
Clustered and filtered reads after mapping for the SImons Diversity Genome Project.
epigenome_editing_2023
Code for training and analyzing models from the epigenome editing paper Batra et al.
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
forward-equivalent-trees
Efficiently simulating multi-type birth-death processes via forward-equivalent parameter mapping
slip
SLIP is a sandbox environment for engineering protein sequences with synthetic fitness functions.