Yuhong Li (leeyeehoo)

leeyeehoo

Geek Repo

Company:Burger Shot

Location:Los Santos

Home Page:http://leeyeehoo.github.io/

Twitter:@yli3521

Github PK Tool:Github PK Tool

Yuhong Li's starred repositories

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8052Issues:0Issues:0
Language:PythonStargazers:118Issues:0Issues:0

InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Language:PythonLicense:MITStargazers:212Issues:0Issues:0

JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Language:PythonLicense:Apache-2.0Stargazers:929Issues:0Issues:0

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonLicense:MITStargazers:1179Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:164Issues:0Issues:0

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1107Issues:0Issues:0

Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Language:PythonStargazers:358Issues:0Issues:0

LEval

[ACL'24] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Language:PythonLicense:GPL-3.0Stargazers:282Issues:0Issues:0

fstattention

Memory bandwidth efficient sparse tree attention

Language:PythonStargazers:2Issues:0Issues:0

KVQuant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language:PythonStargazers:205Issues:0Issues:0

axlearn

An Extensible Deep Learning Library

Language:PythonLicense:Apache-2.0Stargazers:926Issues:0Issues:0

EasyKV

Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)

Language:PythonStargazers:49Issues:0Issues:0

LongLM

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonLicense:MITStargazers:499Issues:0Issues:0

flashinfer

FlashInfer: Kernel Library for LLM Serving

Language:CudaLicense:Apache-2.0Stargazers:667Issues:0Issues:0

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:28209Issues:0Issues:0

LLMFarm

llama and other large language models on iOS and MacOS offline using GGML library.

Language:SwiftLicense:MITStargazers:952Issues:0Issues:0

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

Language:TypeScriptLicense:Apache-2.0Stargazers:7105Issues:0Issues:0
Language:Jupyter NotebookStargazers:406Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:6804Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3723Issues:0Issues:0

ELF

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

Language:C++License:NOASSERTIONStargazers:3357Issues:0Issues:0

magicoder

Magicoder: Source Code Is All You Need

Language:PythonLicense:MITStargazers:1904Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:722Issues:0Issues:0

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:5173Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:4568Issues:0Issues:0
Language:PythonStargazers:1627Issues:0Issues:0

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language:PythonLicense:MITStargazers:1499Issues:0Issues:0

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:2311Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5257Issues:0Issues:0