davidchern

followers

following

stars

davidchern's starred repositories

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.038520 383 1645

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT14295 110 343

efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Language:PythonMIT3774 33 36

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Language:PythonApache-2.02383 17 163

drl-zh

Deep Reinforcement Learning: Zero to Hero!

Language:Jupyter NotebookMIT1987 11 3

MAP-NEO

Language:Python816 10 34

Convolutional-KANs

This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learnable non linear activations in each pixel.

Language:Jupyter NotebookMIT683 13 11

FourierKAN

Language:PythonMIT671 7 6

LLMBox

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Language:PythonMIT543 6 9

YuLan-Chat

YuLan: An Open-Source Large Language Model

Language:PythonMIT521 5 11

Score-Entropy-Discrete-Diffusion

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Language:PythonMIT333 6 10

ChebyKAN

Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.

Language:Jupyter Notebook324 8 8

MoRA

MoRA: High-Rank Updating for Parameter-Efﬁcient Fine-Tuning

Language:PythonApache-2.0319 3 11

fast-kan

FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)

Language:Jupyter NotebookApache-2.0318 2 13

AGI-survey

kanrl

Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments

Language:Python245 4 7

DCFormer

Language:PythonMIT158 6 1

Steel-LLM

Train a Chinese LLM From 0 by Personal

Language:Jupyter Notebook132 4 1

LKAN

Variations of Kolmogorov-Arnold Networks

Language:PythonMIT110 3 4

FCN-KAN

Kolmogorov–Arnold Networks with modified activation (using fully connected network to represent the activation)

Language:PythonMIT97 1 1

Token-level-Direct-Preference-Optimization

Reference implementation for Token-level Direct Preference Optimization(TDPO)

Language:PythonApache-2.083 1 4

infini-mini-transformer

This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and training code.

Language:Python51 2 2

Hrrformer

Hrrformer: A Neuro-symbolic Self-attention Model (ICML23)

Language:Python44 3 1

nanoXLSTM

The simplest, fastest repository for training/finetuning medium-sized xLSTMs.

Language:PythonMIT38 10

MemoryMosaics

Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.

Language:PythonApache-2.030 5 3

Bloom-Lora

Finetune Bloom big language model with Lora method

Language:Python28 1 9

ferns

Fast Exact Retrieval for Nearest-neighbor Search

Language:Jupyter Notebook1100

NCHL

Neuron centric Hebbian Learning

Language:Jupyter Notebook200

LARS-VSA

Language:Jupyter NotebookMIT100