DouzWang's starred repositories
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
facefusion
Industry leading face manipulation platform
yoloexplorer
YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds
llm-reasoners
A library for advanced large language model reasoning
Easy-Graph
EasyGraph is an open-source network analysis library designed to cover advanced network processing methods. It includes functionalities for detecting structural hole spanners, network embedding, and various classic network analysis techniques.
fashion-assistant
Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from images. We pass the prompt, along with the extracted features, to LLM, allowing for advanced image dataset queries.
llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
RealChar
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖
xxim-server
惺惺 —— 属于你的社交地盘!惺惺是一个100%开源社交平台,每个人都可以搭建自己的服务器,掌握数据的所有权。此APP非盈利项目!