huwenxing's starred repositories

Language:PythonLicense:Apache-2.0Stargazers:54Issues:0Issues:0

Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

License:Apache-2.0Stargazers:4134Issues:0Issues:0

regmix

🧬 RegMix: Data Mixture as Regression for Language Model Pre-training

Language:Jupyter NotebookLicense:MITStargazers:81Issues:0Issues:0
Language:PythonLicense:MITStargazers:47Issues:0Issues:0

Shadowrocket-ADBlock-Rules-Forever

提供多款 Shadowrocket 规则,拥有强劲的广告过滤功能。每日 8 时重新构建规则。

License:NOASSERTIONStargazers:12373Issues:0Issues:0

FILM

Official repo for "Make Your LLM Fully Utilize the Context"

Language:PythonLicense:MITStargazers:239Issues:0Issues:0

XVERSE-MoE-A4.2B

XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.

Language:PythonLicense:Apache-2.0Stargazers:35Issues:0Issues:0

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:32019Issues:0Issues:0

text-generation-webui

A Gradio web UI for Large Language Models.

Language:PythonLicense:AGPL-3.0Stargazers:39927Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4612Issues:0Issues:0

dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

Language:PythonLicense:NOASSERTIONStargazers:2498Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49464Issues:0Issues:0

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1213Issues:0Issues:0
Language:Jupyter NotebookStargazers:87Issues:0Issues:0

ScalingAlignment

The official implementation of "Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment".

Language:PythonLicense:GPL-3.0Stargazers:7Issues:0Issues:0
Language:PythonLicense:MITStargazers:4020Issues:0Issues:0

Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

License:Apache-2.0Stargazers:844Issues:0Issues:0

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6987Issues:0Issues:0

Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Language:PythonStargazers:425Issues:0Issues:0

Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:8710Issues:0Issues:0

TransformerCompression

For releasing code related to compression methods for transformers, accompanying our publications

Language:PythonLicense:MITStargazers:359Issues:0Issues:0

LLaMA-Pro

[ACL 2024] Progressive LLaMA with Block Expansion.

Language:PythonLicense:Apache-2.0Stargazers:470Issues:0Issues:0

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2238Issues:0Issues:0

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Language:PythonLicense:Apache-2.0Stargazers:862Issues:0Issues:0

EAGLE

Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)

Language:PythonLicense:Apache-2.0Stargazers:786Issues:0Issues:0

RoleEval

A Bilingual Role Evaluation Benchmark for Large Language Models

Stargazers:33Issues:0Issues:0

Chat-Haruhi-Suzumiya

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1787Issues:0Issues:0

sodaverse

🥤🧑🏻‍🚀Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization"

Language:PythonLicense:MITStargazers:218Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:2643Issues:0Issues:0

NLP-Movie_Scripts

Trying to predict a movie's success based on the script (before filming)

Language:Jupyter NotebookStargazers:33Issues:0Issues:0