qpmnh's repositories

acoustic-images-self-supervision

Code for the paper "Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning" ECCV 2020

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

solo-learn

solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

-Audio-visualization

使用快速傅里叶变换(FFT)实现的音频文件的可视化

Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

AudioDVP

AudioDVP:Photorealistic Audio-driven Video Portraits

Language:PythonStargazers:0Issues:1Issues:0

AVID-CMA

Audio Visual Instance Discrimination with Cross-Modal Agreement

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

AVVP-ECCV20

Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)

Language:PythonStargazers:0Issues:0Issues:0

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

Stargazers:0Issues:1Issues:0

awesome-computer-vision

A curated list of awesome computer vision resources

Stargazers:0Issues:0Issues:0

awesome-self-supervised-learning

A curated list of awesome self-supervised methods

Stargazers:0Issues:0Issues:0

CM-ACC

Cross-model active contrastive coding

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

deep_avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepClustering

Methods and Implements of Deep Clustering

Stargazers:0Issues:0Issues:0

KWS-Net

Seeing Wake Words: Audio-visual Keyword Spotting

License:MITStargazers:0Issues:0Issues:0

Localizing-Visual-Sounds-the-Hard-Way

Localizing Visual Sounds the Hard Way

License:Apache-2.0Stargazers:0Issues:0Issues:0

Mead

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

moco

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

OpenSelfSup

Self-Supervised Learning Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PSOL

code repository of “Rethinking the Route Towards Weakly Supervised Object Localization” in CVPR 2020

Stargazers:0Issues:0Issues:0

PSP_CVPR_2021

A PyTorch implementation of the CVPR-2021 paper: Positive Sample Propagation along the Audio-Visual Event Line.

Stargazers:0Issues:0Issues:0

qpmnh.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

Separating-Sounds-from-a-Single-Image

PyTorch implementation of "Separating Sounds from a Single Image"

License:MITStargazers:0Issues:0Issues:0

sound-spaces

A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:1Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:SCSSStargazers:0Issues:0Issues:0

Talking-Face_PC-AVS

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

VGGSound

VGGSound: A Large-scale Audio-Visual Dataset

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

VisualVoice

Audio-Visual Speech Separation with Cross-Modal Consistency

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0