shaohua.zhang's starred repositories
ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
DeepFaceLive
Real-time face swap for PC streaming or video calls
everyone-can-use-english
人人都能用英语
FreeAskInternet
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use.
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
ComfyUI-Workflows-ZHO
我的 ComfyUI 工作流合集 | My ComfyUI workflows collection
poster-design
一款漂亮且功能强大的在线海报设计器,图片编辑器,仿稿定设计,适用于多种场景:海报生成、电商产品图、文章长图、视频/公众号封面等。A beautiful online image designer, suitable for various scenarios like generate posters, making design easier!
llm-universe
本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
mlc-MiniCPM
MiniCPM on Android platform.
language-models
pre-trained Language Models
ComfyUI-DynamiCrafterWrapper
Wrapper to use DynamiCrafter models in ComfyUI
FineControlNet
Official Pytorch Implementation of "FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection", 2023