mdswyz's starred repositories

Language:HTMLLicense:MITStargazers:40Issues:0Issues:0

awesome-flow-matching

A summary of related works about flow matching, stochastic interpolants

Stargazers:150Issues:0Issues:0

cv

Geoff Boeing's academic CV in LaTeX

Language:TeXLicense:MITStargazers:288Issues:0Issues:0

run

润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新**人的核心宗教,核心信念。

License:CC-BY-SA-4.0Stargazers:30995Issues:0Issues:0

IMDer

An official implementation of "Incomplete Multimodality-Diffused Emotion Recognition" in PyTorch. (NeurIPS 2023)

Language:PythonLicense:MITStargazers:21Issues:0Issues:0

DMD

An official implementation of "Decoupled Multimodal Distilling for Emotion Recognition" in PyTorch. (CVPR 2023 highlight)

Language:PythonLicense:MITStargazers:73Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

sam-hq

Segment Anything in High Quality [NeurIPS 2023]

Language:PythonLicense:Apache-2.0Stargazers:3477Issues:0Issues:0

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:6890Issues:0Issues:0

Awesome-Video-Diffusion-Models

[Arxiv] A Survey on Video Diffusion Models

Stargazers:1436Issues:0Issues:0
Language:PythonLicense:MITStargazers:577Issues:0Issues:0

StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Language:PythonLicense:Apache-2.0Stargazers:1345Issues:0Issues:0

DiCMoR

An official implementation of "Distribution-Consistent Modal Recovering for Incomplete Multimodal Learning" in PyTorch. (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:13Issues:0Issues:0

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6306Issues:0Issues:0

loveu-tgve-2023

Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.

Language:PythonLicense:Apache-2.0Stargazers:66Issues:0Issues:0

chatgpt-prompts-for-academic-writing

This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.

Stargazers:2547Issues:0Issues:0

DARKFFHQ

A Benchmark for Face hallucination in low-light scenarios (Part of ''Learning to Hallucinate Face in the Dark'', IEEE TMM 2023)

Stargazers:2Issues:0Issues:0

controlvideo

Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"

Language:PythonLicense:Apache-2.0Stargazers:208Issues:0Issues:0

FateZero

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Language:Jupyter NotebookLicense:MITStargazers:1064Issues:0Issues:0

vid2vid-zero

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

Language:PythonStargazers:326Issues:0Issues:0

video-diffusion-pytorch

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch

Language:PythonLicense:MITStargazers:1156Issues:0Issues:0

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Stargazers:2638Issues:0Issues:0

webvid

Large-scale text-video dataset. 10 million captioned short videos.

Language:PythonStargazers:528Issues:0Issues:0

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonLicense:Apache-2.0Stargazers:4123Issues:0Issues:0

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3371Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:1645Issues:0Issues:0

X-LLM

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages

Language:PythonLicense:Apache-2.0Stargazers:290Issues:0Issues:0

MERTools

Toolkits for Multimodal Emotion Recognition

Language:PythonStargazers:128Issues:0Issues:0

Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Language:Jupyter NotebookLicense:MITStargazers:728Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:44950Issues:0Issues:0