MXuer's starred repositories

hello-algo

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Language:JavaLicense:NOASSERTIONStargazers:79951Issues:459Issues:188

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:33416Issues:236Issues:4419

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:25912Issues:175Issues:4177

awesome-cto

A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9703Issues:84Issues:246

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

pydub

Manipulate audio with a simple and easy high level interface

Language:PythonLicense:MITStargazers:8581Issues:135Issues:567

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7711Issues:108Issues:439

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7312Issues:110Issues:150

starcoder

Home of StarCoder: fine-tuning & inference!

Language:PythonLicense:Apache-2.0Stargazers:7204Issues:69Issues:141

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

SuperCLUE

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

insanely-fast-whisper

Incredibly fast Whisper-large-v3

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1818Issues:14Issues:0

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:1233Issues:17Issues:82

notebooks

code for deep learning courses

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1062Issues:28Issues:1

hands-on-rl

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

Language:Jupyter NotebookLicense:MITStargazers:1014Issues:19Issues:3

VSCode-Zhihu

Zhihu extension built on vscode.

Language:TypeScriptLicense:MITStargazers:849Issues:8Issues:153

energy-forecasting

🌀 𝗧𝗵𝗲 𝗙𝘂𝗹𝗹 𝗦𝘁𝗮𝗰𝗸 𝟳-𝗦𝘁𝗲𝗽𝘀 𝗠𝗟𝗢𝗽𝘀 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸 | 𝗟𝗲𝗮𝗿𝗻 𝗠𝗟𝗘 & 𝗠𝗟𝗢𝗽𝘀 for free by designing, building and deploying an end-to-end ML batch system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 2.5 𝘩𝘰𝘶𝘳𝘴 𝘰𝘧 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 & 𝘷𝘪𝘥𝘦𝘰 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴

Language:PythonLicense:MITStargazers:816Issues:14Issues:23

the-book-of-modern-cpp

The Book of Modern C++

ClashPro

windows, iOS, MacOS released

X-LLM

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages

Language:PythonLicense:Apache-2.0Stargazers:296Issues:10Issues:15

tinyllama

A tiny x86 retro computer

Language:AssemblyLicense:GPL-3.0Stargazers:275Issues:10Issues:2

coedit

Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)

optimized_transducer

Memory efficient transducer loss computation

Language:CMakeLicense:NOASSERTIONStargazers:68Issues:8Issues:7