qpmnh's repositories
2.5D-Visual-Sound
2.5D visual sound
3D-ResNets-PyTorch
3D ResNets for Action Recognition (CVPR 2018)
ADL
Attention-based Dropout Layer for Weakly Supervised Object Localization (CVPR 2019 Oral)
avsd
[CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog
awesome-lane-detection
A paper list of lane detection.
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
Awsome-Deep-Learning-for-Video-Analysis
Papers, code and datasets about deep learning and multi-modal learning for video analysis
co-separation
Co-Separating Sounds of Visual Objects (ICCV 2019)
CSC
Category-Aware Spatial Constraint for Weakly Supervised Detection
DANet
DANet: Divergent Activation for Weakly Supervised Object Localization,in ICCV 2019
Deep-Co-Clustering
Deep Co-Clustering (SDM'19)
Deep-multimodal-subspace-clustering-networks
Tensorflow implementation of "Deep Multimodal Subspace Clustering Networks"
fair-sslime
FAIR Self-Supervised Learning Integrated Multi-modal Environment (SSLIME)
faster-rcnn.pytorch
A faster pytorch implementation of faster r-cnn
hiddenlayer
Neural network graphs and training metrics for PyTorch, Tensorflow, and Keras.
Machine_based_understanding_audiovisual
Deep Learning based audiovisual data analysis
moments_models
The pretrained models trained on Moments in Time Dataset
multisensory
Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
mws
Code for paper in CVPR2019, 'Multi-source weak supervision for saliency detection'
pvse
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
pyDML
Distance Metric Learning Algorithms for Python
pytorch_MELM
The pytorch implementation of the Min-Entropy Latent Model for Weakly Supervised Object Detection
Simplified_DMC
A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)
Sound-Source-Localization-using-ConvLSTM
ConvLSTM is used to localize sound sources from Short Time Fourier Transform of Audio
Survey_of_Deep_Metric_Learning
A comprehensive survey of deep metric learning and related works
Talking-Face-Generation-DAVS
Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)
weakly-supervised-detection
Weakly Supervised Object Detection In Practice
Weakly-Supervised-Object-Localization
Weakly Supervised Object Localization Papers
wsod
Weakly Supervised Object Detection