davidchern's starred repositories

MemoryMosaics

Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.

Language:PythonLicense:Apache-2.0Stargazers:24Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

MoRA

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Language:PythonStargazers:266Issues:0Issues:0

Hrrformer

Hrrformer: A Neuro-symbolic Self-attention Model (ICML23)

Language:PythonStargazers:39Issues:0Issues:0
Language:PythonLicense:MITStargazers:110Issues:0Issues:0

Convolutional-KANs

This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learnable non linear activations in each pixel.

Language:Jupyter NotebookLicense:MITStargazers:491Issues:0Issues:0
License:MITStargazers:255Issues:0Issues:0

nanoXLSTM

The simplest, fastest repository for training/finetuning medium-sized xLSTMs.

Language:PythonLicense:MITStargazers:36Issues:0Issues:0
Language:PythonStargazers:663Issues:0Issues:0

fast-kan

FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:233Issues:0Issues:0

LKAN

Variations of Kolmogorov-Arnold Networks

Language:PythonLicense:MITStargazers:100Issues:0Issues:0

ferns

Fast Exact Retrieval for Nearest-neighbor Search

Language:Jupyter NotebookStargazers:11Issues:0Issues:0

ChebyKAN

Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.

Language:Jupyter NotebookStargazers:276Issues:0Issues:0

FCN-KAN

Kolmogorov–Arnold Networks with modified activation (using fully connected network to represent the activation)

Language:PythonLicense:MITStargazers:87Issues:0Issues:0

drl-zh

Deep Reinforcement Learning: Zero to Hero!

Language:Jupyter NotebookLicense:MITStargazers:1931Issues:0Issues:0

kanrl

Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments

Language:PythonStargazers:213Issues:0Issues:0

efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Language:PythonLicense:MITStargazers:3201Issues:0Issues:0
Language:PythonLicense:MITStargazers:605Issues:0Issues:0

NCHL

Neuron centric Hebbian Learning

Language:Jupyter NotebookStargazers:2Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:12972Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3111Issues:0Issues:0

Token-level-Direct-Preference-Optimization

Reference implementation for Token-level Direct Preference Optimization(TDPO)

Language:PythonLicense:Apache-2.0Stargazers:31Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1614Issues:0Issues:0

Score-Entropy-Discrete-Diffusion

[ICML 2024 Oral] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Language:PythonLicense:MITStargazers:208Issues:0Issues:0

infini-mini-transformer

This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and training code.

Language:PythonStargazers:48Issues:0Issues:0

YuLan-Chat

YuLan-Chat: An Open-Source Bilingual Chatbot

Language:PythonLicense:MITStargazers:475Issues:0Issues:0

LLMBox

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Language:PythonLicense:MITStargazers:382Issues:0Issues:0

Bloom-Lora

Finetune Bloom big language model with Lora method

Language:PythonStargazers:28Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38127Issues:0Issues:0

Steel-LLM

Train a Chinese LLM From 0 by Personal

Language:Jupyter NotebookStargazers:97Issues:0Issues:0