Li Lai's starred repositories

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:7214Issues:0Issues:0

streamv2v

Official Pytorch implementation of StreamV2V.

Language:PythonLicense:NOASSERTIONStargazers:399Issues:0Issues:0

sharik

Sharik is an open-source, cross-platform solution for sharing files via Wi-Fi or Mobile Hotspot

Language:DartLicense:MITStargazers:1132Issues:0Issues:0

stb

stb single-file public domain libraries for C/C++

Language:CLicense:NOASSERTIONStargazers:25871Issues:0Issues:0

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Language:PythonLicense:MITStargazers:798Issues:0Issues:0

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4317Issues:0Issues:0

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:1790Issues:0Issues:0
Language:PythonStargazers:760Issues:0Issues:0

fastllm

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

Language:C++License:Apache-2.0Stargazers:3215Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:5913Issues:0Issues:0

lightning-whisper-mlx

An extremely fast implementation of whisper optimized for Apple Silicon using MLX.

Language:PythonStargazers:473Issues:0Issues:0

DouyinLiveRecorder

可循环值守和多人录制的直播录制软件,支持抖音、TikTok、快手、虎牙、斗鱼、B站、小红书、pandatv、afreecatv、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、花椒、流星、Twitch等平台直播录制

Language:PythonLicense:MITStargazers:3849Issues:0Issues:0

Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

License:Apache-2.0Stargazers:737Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:12341Issues:0Issues:0

SCB-dataset

Student Classroom Behavior dataset

Language:PythonStargazers:117Issues:0Issues:0

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3365Issues:0Issues:0

Steel-LLM

Train a Chinese LLM From 0 by Personal

Language:Jupyter NotebookStargazers:114Issues:0Issues:0

kungfu

Kungfu Trader

Language:C++License:Apache-2.0Stargazers:3328Issues:0Issues:0

data-science-competition

该仓库用于记录作者本人参加的各大数据科学竞赛的获奖方案源码以及一些新比赛的原创baseline. 主要涵盖:kaggle, 阿里天池,华为云大赛校园赛,百度aistudio,和鲸社区,datafountain等

Language:PythonStargazers:1291Issues:0Issues:0

baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Language:PythonLicense:MITStargazers:2343Issues:0Issues:0

labelU

Data annotation toolbox supports image, audio and video data.

Language:PythonStargazers:271Issues:0Issues:0

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Language:PythonLicense:MITStargazers:4015Issues:0Issues:0

CLIP-Chinese

中文CLIP预训练模型

Language:PythonStargazers:369Issues:0Issues:0

Open-GroundingDino

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Language:PythonLicense:MITStargazers:318Issues:0Issues:0

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4230Issues:0Issues:0

TinyRAG

TinyRAG

Language:PythonStargazers:194Issues:0Issues:0

MFTCoder

High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.

Language:PythonLicense:NOASSERTIONStargazers:597Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3239Issues:0Issues:0

ChatGPTBook

《ChatGPT原理与实战:大型语言模型的算法、技术和私有化》

Language:PythonLicense:Apache-2.0Stargazers:306Issues:0Issues:0

bytepiece

更纯粹、更高压缩率的Tokenizer

Language:PythonLicense:Apache-2.0Stargazers:429Issues:0Issues:0