shaohua.zhang's repositories
Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image (uncensored)
Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️
PyTorch-Tutorial-2nd
《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
llm_babyCare
育儿宝典
VIMER
视觉预训练基础模型仓库
awesome-digital-human
A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.
paperless-ngx
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
cat-catch
猫抓 chrome资源嗅探扩展
SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Topic-on-Table-Recognition
This is a survey on the topic of table recognition
Pix2Text
Pix In, Latex & Text Out. Recognize Chinese, English Texts, and Math Formulas from Images.
sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
CVprojects
computer vision projects | 计算机视觉相关好玩的AI项目(Python、C++)
WenxinWorkshop-Python-SDK
一个文心千帆平台的第三方 Python SDK。A third-party Python SDK for a WenxinWorkshop.
inst-inpaint
A novel inpainting framework that can remove objects from images based on the instructions given as text prompts.
gpt-researcher
GPT based autonomous agent that does online comprehensive research on any given topic
DeepFaceLive
Real-time face swap for PC streaming or video calls
RingRWKV
修复Transformer官方库中RWKV的适配问题,支持RWKV所有系列模型在转换后,通过RingRWKV库,与其他transfomer模型一样简单方便地部署和微调。
Serving
A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)
DAVAR-Lab-OCR
The implementations of some works from Davar-Lab. Currently we have the code of Text Perceptron (AAAI 2020). Some works' code will be published soon, including YORO (ACMMM 2019) , TRIE (ACMMM2020), FREE(TIP 2020), SPIN (AAAI 2021), MANGO (AAAI2021), etc.
char-detection
🔥Char detection base on crnn 字符(单字)检测基于CRNN
Code-LMs
Guide to using pre-trained large language models of source code
OMML
Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
CenterNet
Object detection, 3D detection, and pose estimation using center point detection:
TableGeneration
通过浏览器渲染生成表格图像
danbooru-diffusion-prompt-builder
Danbooru / NovelAI 标签超市
AlphX-Code-For-DAR
粤港澳大湾区(黄埔)国际算法算例大赛-古籍文档图像识别与分析算法比赛 Alphx队源码