Pumpkin's starred repositories

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8746Issues:0Issues:0

awesome-model-transferability-estimation

A collection of model transferability estimation methods.

Stargazers:17Issues:0Issues:0

StableCascade

Official Code for Stable Cascade

Language:Jupyter NotebookLicense:MITStargazers:6448Issues:0Issues:0
Stargazers:6Issues:0Issues:0

LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonLicense:MITStargazers:559Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29262Issues:0Issues:0

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3317Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:16172Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5370Issues:0Issues:0

Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

License:Apache-2.0Stargazers:584Issues:0Issues:0

gptstore-prompts

Here are the Top 100 prompts on GPTStore, which we can use to learn and improve prompt engineering.

License:CC0-1.0Stargazers:467Issues:0Issues:0

InfoBatch

Lossless Training Speed Up by Unbiased Dynamic Data Pruning

Language:PythonStargazers:301Issues:0Issues:0

buildVpn

图文教程搭建一个vpn翻墙

Stargazers:1716Issues:0Issues:0

run

润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新**人的核心宗教,核心信念。

License:CC-BY-SA-4.0Stargazers:31121Issues:0Issues:0

TrustLLM

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

Language:PythonLicense:MITStargazers:361Issues:0Issues:0

factool

FacTool: Factuality Detection in Generative AI

Language:PythonLicense:Apache-2.0Stargazers:782Issues:0Issues:0

depyf

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Language:PythonLicense:MITStargazers:390Issues:0Issues:0

jiant

jiant is an nlp toolkit

Language:PythonLicense:MITStargazers:1626Issues:0Issues:0

multidiffusion-upscaler-for-automatic1111

Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0

Language:PythonLicense:NOASSERTIONStargazers:4607Issues:0Issues:0
Language:PythonStargazers:75Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:34235Issues:0Issues:0

mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Language:PythonLicense:GPL-3.0Stargazers:902Issues:0Issues:0

llama2.c

Inference Llama 2 in one file of pure C

Language:CLicense:MITStargazers:16824Issues:0Issues:0

dreamgaussian4d

[arXiv 2023] DreamGaussian4D: Generative 4D Gaussian Splatting

Language:PythonLicense:MITStargazers:475Issues:0Issues:0

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:28484Issues:0Issues:0

supervision

We write your reusable computer vision tools. 💜

Language:PythonLicense:MITStargazers:17872Issues:0Issues:0

Wonder3D

Single Image to 3D using Cross-Domain Diffusion for 3D Generation

Language:PythonLicense:AGPL-3.0Stargazers:4537Issues:0Issues:0

longformer

Longformer: The Long-Document Transformer

Language:PythonLicense:Apache-2.0Stargazers:2001Issues:0Issues:0

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonLicense:MITStargazers:1108Issues:0Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3279Issues:0Issues:0