MA YING (mayingwuhu)

mayingwuhu

Geek Repo

0

followers

0

following

Company:Communication University of China

Github PK Tool:Github PK Tool

MA YING's starred repositories

LivePortrait

Bring portraits to life!

Language:PythonLicense:MITStargazers:7005Issues:0Issues:0

ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:42588Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:5828Issues:0Issues:0

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:3992Issues:0Issues:0

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2710Issues:0Issues:0

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Language:PythonLicense:MITStargazers:797Issues:0Issues:0

LMM_caption

An attempt at dataset labeling with Large Multimodal Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4369Issues:0Issues:0

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:17530Issues:0Issues:0

unmasked_teacher

[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Language:PythonLicense:MITStargazers:268Issues:0Issues:0

InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language:PythonLicense:Apache-2.0Stargazers:1139Issues:0Issues:0
Language:PythonLicense:MITStargazers:2216Issues:0Issues:0

PySide6-Code-Tutorial

可能是最好的PySide6中文教程!用代码实例讲解PySide6,附优质Demos、图标库、QSS皮肤、相关文章等分享!

Language:PythonLicense:GPL-3.0Stargazers:898Issues:0Issues:0

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

Stargazers:1090Issues:0Issues:0

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

Stargazers:5825Issues:0Issues:0

Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Language:PythonLicense:Apache-2.0Stargazers:4032Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9250Issues:0Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:4487Issues:0Issues:0

webrtc-stream

Simple python webrtc streaming demo

Language:PythonStargazers:57Issues:0Issues:0

ALPRO

Align and Prompt: Video-and-Language Pre-training with Entity Prompts

Language:PythonLicense:BSD-3-ClauseStargazers:185Issues:0Issues:0

X-CLIP

An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"

Language:PythonLicense:MITStargazers:123Issues:0Issues:0

XPretrain

Multi-modality pre-training

Language:PythonLicense:NOASSERTIONStargazers:454Issues:0Issues:0

towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Language:PythonLicense:Apache-2.0Stargazers:3087Issues:0Issues:0

ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Language:ShellLicense:Apache-2.0Stargazers:24536Issues:0Issues:0

yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:48661Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:129471Issues:0Issues:0

MOSS

An open-source tool-augmented conversational language model from Fudan University

Language:PythonLicense:Apache-2.0Stargazers:11888Issues:0Issues:0

awesome-video-text-retrieval

A curated list of deep learning resources for video-text retrieval.

Stargazers:565Issues:0Issues:0

DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Language:PythonLicense:MITStargazers:3231Issues:0Issues:0

XMem

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Language:PythonLicense:MITStargazers:1667Issues:0Issues:0