qpmnh

followers

following

stars

qpmnh's repositories

acoustic-images-self-supervision

Code for the paper "Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning" ECCV 2020

Language:PythonMIT100

solo-learn

solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning

Language:PythonMIT100

-Audio-visualization

使用快速傅里叶变换（FFT）实现的音频文件的可视化

Language:Python000

111

Language:Jupyter NotebookMIT000

academic-resume

Language:Jupyter NotebookMIT000

AudioDVP

AudioDVP:Photorealistic Audio-driven Video Portraits

Language:Python010

AVID-CMA

Audio Visual Instance Discrimination with Cross-Modal Agreement

Language:PythonNOASSERTION010

AVVP-ECCV20

Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)

Language:Python000

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

010

awesome-computer-vision

A curated list of awesome computer vision resources

000

awesome-self-supervised-learning

A curated list of awesome self-supervised methods

000

CM-ACC

Cross-model active contrastive coding

Language:PythonMIT000

deep_avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Language:PythonMIT000

DeepClustering

Methods and Implements of Deep Clustering

000

KWS-Net

Seeing Wake Words: Audio-visual Keyword Spotting

MIT000

Localizing-Visual-Sounds-the-Hard-Way

Localizing Visual Sounds the Hard Way

Apache-2.0000

Mead

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

Language:PythonMIT000

moco

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Language:PythonNOASSERTION000

OpenSelfSup

Self-Supervised Learning Toolbox and Benchmark

Language:PythonApache-2.0000

PSOL

code repository of “Rethinking the Route Towards Weakly Supervised Object Localization” in CVPR 2020

000

PSP_CVPR_2021

A PyTorch implementation of the CVPR-2021 paper: Positive Sample Propagation along the Audio-Visual Event Line.

000

qpmnh.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT000

Separating-Sounds-from-a-Single-Image

PyTorch implementation of "Separating Sounds from a Single Image"

MIT000

sound-spaces

A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.

Language:PythonCC-BY-4.0010

starter-hugo-academic

Language:Jupyter NotebookMIT000

starter-hugo-portfolio-theme

Language:SCSS000

Talking-Face_PC-AVS

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)

CC-BY-4.0000

VGGSound

VGGSound: A Large-scale Audio-Visual Dataset

Language:PythonNOASSERTION010

VisualVoice

Audio-Visual Speech Separation with Cross-Modal Consistency

Language:PythonNOASSERTION000

WAL

Language:Python000