MA YING's starred repositories
LivePortrait
Bring portraits to life!
fish-speech
Brand new TTS solution
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
LMM_caption
An attempt at dataset labeling with Large Multimodal Models
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
PySide6-Code-Tutorial
可能是最好的PySide6中文教程!用代码实例讲解PySide6,附优质Demos、图标库、QSS皮肤、相关文章等分享!
CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
webrtc-stream
Simple python webrtc streaming demo
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
awesome-video-text-retrieval
A curated list of deep learning resources for video-text retrieval.