Beast code in Giters

qpmnh's repositories

mvda

Discriminant Analysis Algorithms

Language:Python100

2.5D-Visual-Sound

2.5D visual sound

Language:PythonCC-BY-4.0000

3D-ResNets-PyTorch

3D ResNets for Action Recognition (CVPR 2018)

Language:PythonMIT000

ADL

Attention-based Dropout Layer for Weakly Supervised Object Localization (CVPR 2019 Oral)

Language:PythonMIT000

avsd

[CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog

000

awesome-lane-detection

A paper list of lane detection.

000

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

MIT000

Awsome-Deep-Learning-for-Video-Analysis

Papers, code and datasets about deep learning and multi-modal learning for video analysis

MIT000

co-separation

Co-Separating Sounds of Visual Objects (ICCV 2019)

CC-BY-4.0000

CSC

Category-Aware Spatial Constraint for Weakly Supervised Detection

NOASSERTION000

DANet

DANet: Divergent Activation for Weakly Supervised Object Localization，in ICCV 2019

000

Deep-Co-Clustering

Deep Co-Clustering (SDM'19)

000

Deep-multimodal-subspace-clustering-networks

Tensorflow implementation of "Deep Multimodal Subspace Clustering Networks"

Language:Python000

fair-sslime

FAIR Self-Supervised Learning Integrated Multi-modal Environment (SSLIME)

NOASSERTION000

faster-rcnn.pytorch

A faster pytorch implementation of faster r-cnn

MIT000

hiddenlayer

Neural network graphs and training metrics for PyTorch, Tensorflow, and Keras.

MIT000

Machine_based_understanding_audiovisual

Deep Learning based audiovisual data analysis

Language:Python000

moments_models

The pretrained models trained on Moments in Time Dataset

Language:PythonBSD-2-Clause000

multisensory

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

Apache-2.0000

mws

Code for paper in CVPR2019, 'Multi-source weak supervision for saliency detection'

000

pvse

Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)

MIT000

pyDML

Distance Metric Learning Algorithms for Python

Language:PythonGPL-3.0000

pytorch_MELM

The pytorch implementation of the Min-Entropy Latent Model for Weakly Supervised Object Detection

000

Simplified_DMC

A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)

MIT000

Sound-Source-Localization-using-ConvLSTM

ConvLSTM is used to localize sound sources from Short Time Fourier Transform of Audio

Language:Python000

Survey_of_Deep_Metric_Learning

A comprehensive survey of deep metric learning and related works

000

Talking-Face-Generation-DAVS

Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

MIT000

weakly-supervised-detection

Weakly Supervised Object Detection In Practice

000

Weakly-Supervised-Object-Localization

Weakly Supervised Object Localization Papers

000

wsod

Weakly Supervised Object Detection

MIT000