PENG Bo (BlinkDL)

BlinkDL

User data from Github https://github.com/BlinkDL

Company:http://withablink.com

Home Page:https://rwkv.com/

GitHub:@BlinkDL

Twitter:@BlinkDL_AI

PENG Bo's repositories

RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:14110Issues:137Issues:261

ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Language:PythonLicense:Apache-2.0Stargazers:9509Issues:92Issues:121

AI-Writer

AI 写小说,生成玄幻和言情网文等等。中文预训练生成模型。采用我的 RWKV 模型,类似 GPT-2 。AI写作。RWKV for Chinese novel generation.

Language:PythonLicense:Apache-2.0Stargazers:3380Issues:50Issues:26

Hua

Hua is an AI image editor with Stable Diffusion (and more).

RWKV-CUDA

The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )

nanoRWKV

RWKV in nanoGPT style

Language:PythonLicense:MITStargazers:193Issues:5Issues:0

minGPT-tuned

A *tuned* minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:Jupyter NotebookLicense:MITStargazers:117Issues:7Issues:3

modded-nanogpt-rwkv

RWKV-7: Surpassing GPT

Language:PythonLicense:MITStargazers:98Issues:1Issues:0

fast.c

Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.

Language:CLicense:Apache-2.0Stargazers:73Issues:5Issues:1

RWKV-v2-RNN-Pile

RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.

Language:PythonLicense:Apache-2.0Stargazers:67Issues:4Issues:1

LinearAttentionArena

Here we will test various linear attention designs.

Language:PythonLicense:Apache-2.0Stargazers:62Issues:9Issues:0

SmallInitEmb

LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence

Language:PythonStargazers:59Issues:3Issues:0

BookCNN

《深度卷积网络:原理与实践》现已在淘宝天猫京东当当发售. 这里是其中的代码下载.

Language:Jupyter NotebookStargazers:56Issues:4Issues:4

WorldModel

Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business / finance / governance, and can align agents with human too.

License:Apache-2.0Stargazers:39Issues:10Issues:0

Albatross

efficient RWKV inference engine

Language:PythonLicense:Apache-2.0Stargazers:38Issues:0Issues:1

LM-Trick-Questions

Here we collect trick questions and failed tasks for open source LLMs to improve them.

Basis

The Basis Programming Language

Language:PythonStargazers:26Issues:3Issues:0
Language:JavaScriptStargazers:13Issues:2Issues:0

AntiAging

List of Anti-aging Research

License:MITStargazers:11Issues:5Issues:0

zoology

Understand and test language model architectures on synthetic tasks.

Language:PythonLicense:Apache-2.0Stargazers:10Issues:0Issues:0

Nala

The Nala markup, to turn a "Natural Language" sentence into a code-like statement. Nala 标注,将自然语言变为编程语言。

License:MITStargazers:9Issues:4Issues:0

BlinkColorTheme

A colorful theme for HTML+JS+CSS.

Language:CSSStargazers:4Issues:1Issues:0

Model_Leaderboard

Leaderboard of AI models.

Language:HTMLLicense:Apache-2.0Stargazers:4Issues:2Issues:0

FastChat

An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0

MathBook

一个较为系统的数学笔记(graduate level)

BasisLang.com

BasisLang.com

Language:HTMLStargazers:1Issues:2Issues:0

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

libai

LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

ProjectIvory

Project Ivory is a simple forum written a few years ago.

License:MITStargazers:0Issues:2Issues:0