俞航's repositories
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Awesome-AI-KnowWhys
Fundamental Researchers/Discoveries on How/Why AI works!
axolotl
Go ahead and axolotl questions
ColossalAI
Making large AI models cheaper, faster and more accessible
CrystalNodes
Beautify the unreal editor graph nodes.
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
draw-a-ui
Draw a mockup and generate html for it
exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
FastGPT
FastGPT is a knowledge-based question answering system built on the LLM. It offers out-of-the-box data processing and model invocation capabilities. Moreover, it allows for workflow orchestration through Flow visualization, thereby enabling complex question and answer scenarios!
gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
hqq
Official implementation of Half-Quadratic Quantization (HQQ)
keras
Deep Learning for humans
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
LocalAI
:robot: Self-hosted, community-driven, local OpenAI compatible API. Drop-in replacement for OpenAI running LLMs on consumer-grade hardware. Free Open Source OpenAI alternative. No GPU required. Runs ggml, gguf, GPTQ, onnx, TF compatible models: llama, llama2, gpt4all, rwkv, whisper, vicuna, koala, cerebras, falcon, dolly, starcoder, and many others
manim
Animation engine for explanatory math videos
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
stable-diffusion-webui
Stable Diffusion web UI
StableVideoP13
Fork from older Stable Video Diffusion which includes the P13 file
ToolBench
An open platform for training, serving, and evaluating large language model for tool learning.
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
vditor
♏ 一款浏览器端的 Markdown 编辑器,支持所见即所得(富文本)、即时渲染(类似 Typora)和分屏预览模式。An In-browser Markdown editor, support WYSIWYG (Rich Text), Instant Rendering (Typora-like) and Split View modes.
web_search
web search extension for text-generation-webui
XAgent
An Autonomous LLM Agent for Complex Task Solving
yet-another-retnet
A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (https://arxiv.org/pdf/2307.08621.pdf)