wuxiaobo's repositories
myflashtext
快速的中文字符串匹配小工具
albert-text-classification
albert for text classification with tf2
Alpaca-family-library
Summarize all low-cost replication methods for Chatgpt. It is believed that with the improvement of data and model fine-tuning techniques, small models suitable for various segmented fields will continue to emerge and have better performance.
BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
ChatGPTX-Uni
实现一种单/多Lora-Fusion权值交叉融合+Zero-Finetune零微调增强的跨模型技术方案,LLM-Base+LLM-X+Alpaca,初期,LLM-Base为Chatglm6B底座模型,LLM-X是LLAMA增强模型。该方案简易高效,目标是使此类语言模型能够低能耗广泛部署,并最终在小模型的基座上发生“智能涌现”,力图最小计算代价达成ChatGPT、GPT4、ChatRWKV等人类友好亲和效果。后期将以此为中心模型大脑Agent,进一步融入并指挥CV目标检测、文本图像生成、语音命令交互等执行模型。当前可以满足总结、提问、问答、摘要、改写、评论、扮演等各种需求。
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地部署 (Chinese LLaMA & Alpaca LLMs)
ColossalAI
Making large AI models cheaper, faster and more accessible
DeepLearningSystem
Deep Learning System core principles introduction.
hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程,支持 Python, C++, Java, C#, Go, Swift, JS, TS, Dart, Rust, C, Zig 等语言。English edition ongoing
InstructDS
EMNLP 2023: Instructive Dialogue Summarization with Query Aggregations
LKY_OfficeTools
一键自动化 下载、安装、激活 Office 的利器。
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
llm-inference-benchmark
LLM Inference benchmark
lobe-chat
🤖 Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.
LongBench
LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
notebooks
Jupyter notebooks for the Natural Language Processing with Transformers book
Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
OpenRLHF
A Ray-based High-performance RLHF framework (Support 70B+ full tuning & LoRA & Mixtral)
parler-tts
Inference and training library for high-quality TTS models.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
surya
Accurate line-level text detection and recognition (OCR) in any language
tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
trt-samples-for-hackathon-cn
Simple samples for TensorRT programming
UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
vector-search
The definitive guide to using Vector Search to solve your semantic search production workload needs.