Yufei (Benny) Chen's starred repositories
llama-mistral
Inference code for Mistral and Mixtral hacked up into original Llama implementation
cumulative-reasoning
Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
tensorflow-on-raspberry-pi
TensorFlow for Raspberry Pi
physical-web
The Physical Web: walk up and use anything
vim-fugitive
fugitive.vim: A Git wrapper so awesome, it should be illegal
handsontable
JavaScript data grid with a spreadsheet look & feel. Works with React, Angular, and Vue. Supported by the Handsontable team ⚡
elasticsearch-river-mongodb
MongoDB River Plugin for ElasticSearch
NoClickJSLibrary
Turns your website into a non-click one