zyds

followers

following

stars

Beijing

你可是处女座啊's starred repositories

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonApache-2.044400

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION2122500

ReAlign

Reformatted Alignment

Language:JavaScript9200

Simple-Trl-Training

基于DPO算法微调语言大模型，简单好上手。

Language:Python1400

GitHub-Chinese-Top-Charts

:cn: GitHub中文排行榜，各语言分设「软件 | 资料」榜单，精准定位中文好项目。各取所需，高效学习。

Language:JavaNOASSERTION9144800

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonApache-2.083000

NeurIPS-WANT-submission-efficient-parallelization-layouts

Language:PythonNOASSERTION2100

EmoLLM

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3

Language:PythonMIT42600

BCEmbedding

Netease Youdao's open-source embedding and reranker models for RAG products.

Language:PythonApache-2.099100

llamafia.github

Apache-2.028600

deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Language:PythonApache-2.037300

awesome-llm-interpretability

A curated list of Large Language Model (LLM) Interpretability resources.

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Language:PythonApache-2.0505400

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonApache-2.0191200

HuggingFace-Download-Accelerator

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

Language:Python56400

mamba

Language:PythonApache-2.01017000

LongBench

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonMIT49700

punica

Serving multiple LoRA finetuned LLM as one

Language:PythonApache-2.085600

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonApache-2.0153800

autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Language:PythonApache-2.0160100

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Language:PythonGPL-3.0277100

Grounded-Segment-Anything

Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookApache-2.01378200

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonApache-2.01262400

lm-format-enforcer

Enforce the output format (JSON Schema, Regex etc) of a language model

Language:PythonMIT87300

awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案

MIT106000

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据！

Language:PythonApache-2.0158500

MOSS-RLHF

MOSS-RLHF

Language:PythonApache-2.0118800

NEFTune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Language:PythonMIT33600

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.0400500

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT628600