Songchen Li's starred repositories

dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

Stargazers:3474Issues:0Issues:0

DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Language:PythonLicense:NOASSERTIONStargazers:1706Issues:0Issues:0

recommenders

Best Practices on Recommendation Systems

Language:PythonLicense:MITStargazers:18929Issues:0Issues:0

YabyerChessEngine

Chess Engine + GUI

Language:CStargazers:2Issues:0Issues:0

MarkdownSharp

Open source C# implementation of Markdown processor, used by Stack Overflow.

Language:C#License:MITStargazers:249Issues:0Issues:0
Language:JavaScriptStargazers:10Issues:0Issues:0

H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Language:PythonStargazers:371Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:26564Issues:0Issues:0

RDMI

This is the repo for remote direct memory introspection.

Language:C++License:MITStargazers:19Issues:0Issues:0

KIVI

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Language:PythonLicense:MITStargazers:220Issues:0Issues:0
Language:TeXStargazers:207Issues:0Issues:0

ebooks

收藏的一些经典的历史、政治、心理、哲学、数学、计算机方面电子书(约10万本)

Language:JavaScriptStargazers:4072Issues:0Issues:0

ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:342Issues:0Issues:0

LongBench

[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonLicense:MITStargazers:634Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8886Issues:0Issues:0

chronos-forecasting

Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting

Language:PythonLicense:Apache-2.0Stargazers:2407Issues:0Issues:0

GP-GAN

Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

Language:PythonLicense:MITStargazers:460Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:28088Issues:0Issues:0

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6418Issues:0Issues:0

COMP646_Project

COMP 646 project at Rice in spring 2024

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

Time-LLM

[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:1302Issues:0Issues:0

Neural-Network-Parameter-Diffusion

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Language:PythonStargazers:830Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13697Issues:0Issues:0

LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonLicense:MITStargazers:600Issues:0Issues:0

system-design

A resource to help you pass system design interview and become good at work 👇

License:NOASSERTIONStargazers:12400Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11309Issues:0Issues:0

LatexToMathML

Latex转Word格式的公式

Language:HTMLStargazers:57Issues:0Issues:0

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

Stargazers:978Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:9088Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:677Issues:0Issues:0