FoolishMao's starred repositories
Qwen-Agent
Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
HuixiangDou
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
PP-Vehicle-onnxrun-cpp-py
使用ONNXRuntime部署百度飞桨开源PP-Vehicle车辆分析,包含车辆检测,识别车型和车辆颜色,车牌检测,车牌识别5个功能,不依赖PaddlePaddle就能运行,包含C++和Python两个版本的程序
TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
GuoFeng-Webnovel
Multilingual Corpus of Web Fiction
yolov5-face
YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931) ECCV Workshops 2022)
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
RWKV-Runner
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.