LI ZHENG's starred repositories
Font-Awesome
The iconic SVG, font, and CSS toolkit
open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
latexify_py
A library to generate LaTeX expression from Python code.
lm-evaluation-harness
A framework for few-shot evaluation of language models.
RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
GraphEmbedding
Implementation and experiments of graph embedding algorithms.
PyTorch-BigGraph
Generate embeddings from large-scale graph-structured data.
ClusterGCN
A PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).
shaDow_GNN
NeurIPS 2021: Improve the GNN expressivity and scalability by decoupling the depth and receptive field of state-of-the-art GNN architectures
SparseLinear
A custom PyTorch layer that is capable of implementing extremely wide and sparse linear layers efficiently
stopwords-ja
Japanese stopwords collection
multilingual_keyphrase_generation
[NAACL'22-Findings] Dataset for "Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training"