davidchern's starred repositories
MemoryMosaics
Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.
Convolutional-KANs
This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learnable non linear activations in each pixel.
efficient-kan
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
Score-Entropy-Discrete-Diffusion
[ICML 2024 Oral] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
infini-mini-transformer
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and training code.
YuLan-Chat
YuLan-Chat: An Open-Source Bilingual Chatbot
Bloom-Lora
Finetune Bloom big language model with Lora method
ColossalAI
Making large AI models cheaper, faster and more accessible