maodong2056's repositories
activity_recognition
Activity Recognition Model Based On PyTorch
activitynet-2016-cvprw
Tools to participate in the ActivityNet Challenge 2016
AI-Challenger-Caption-Competition
AI CHALLENGER 全球AI挑战赛 图像中文描述
AI_Challenger_2017
AI Challenger, a platform for open datasets and programming competitions to artificial intelligence (AI) talents around the world.
AI_challenger_Chinese_Caption
Repository for image caption for Chinese
CBR
Cascaded Boundary Regression For Temporal Action Detection
CompactBilinearPooling-Pytorch
A Pytorch Implementation for Compact Bilinear Pooling.
CVPR2018_attention
Context Encoding for Semantic Segmentation MegaDepth: Learning Single-View Depth Prediction from Internet Photos LiteFlowNet: A Lightweight Convolutional Neural Network for Optical Flow Estimation PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume On the Robustness of Semantic Segmentation Models to Adversarial Attacks SPLATNet: Sparse Lattice Networks for Point Cloud Processing Left-Right Comparative Recurrent Model for Stereo Matching Enhancing the Spatial Resolution of Stereo Images using a Parallax Prior Unsupervised CCA Discovering Point Lights with Intensity Distance Fields CBMV: A Coalesced Bidirectional Matching Volume for Disparity Estimation Learning a Discriminative Feature Network for Semantic Segmentation Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi- Supervised Semantic Segmentation Unsupervised Deep Generative Adversarial Hashing Network Monocular Relative Depth Perception with Web Stereo Data Supervision Single Image Reflection Separation with Perceptual Losses Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains EPINET: A Fully-Convolutional Neural Network for Light Field Depth Estimation by Using Epipolar Geometry FoldingNet: Interpretable Unsupervised Learning on 3D Point Clouds Decorrelated Batch Normalization Unsupervised Learning of Depth and Egomotion from Monocular Video Using 3D Geometric Constraints PU-Net: Point Cloud Upsampling Network Real-Time Monocular Depth Estimation using Synthetic Data with Domain Adaptation via Image Style Transfer Tell Me Where To Look: Guided Attention Inference Network Residual Dense Network for Image Super-Resolution Reflection Removal for Large-Scale 3D Point Clouds PlaneNet: Piece-wise Planar Reconstruction from a Single RGB Image Fully Convolutional Adaptation Networks for Semantic Segmentation CRRN: Multi-Scale Guided Concurrent Reflection Removal Network DenseASPP: Densely Connected Networks for Semantic Segmentation SGAN: An Alternative Training of Generative Adversarial Networks Multi-Agent Diverse Generative Adversarial Networks Robust Depth Estimation from Auto Bracketed Images AdaDepth: Unsupervised Content Congruent Adaptation for Depth Estimation DeepMVS: Learning Multi-View Stereopsis GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation Single-Image Depth Estimation Based on Fourier Domain Analysis Single View Stereo Matching Pyramid Stereo Matching Network A Unifying Contrast Maximization Framework for Event Cameras, with Applications to Motion, Depth, and Optical Flow Estimation Image Correction via Deep Reciprocating HDR Transformation Occlusion Aware Unsupervised Learning of Optical Flow PAD-Net: Multi-Tasks Guided Prediciton-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing Surface Networks Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation TextureGAN: Controlling Deep Image Synthesis with Texture Patches Aperture Supervision for Monocular Depth Estimation Two-Stream Convolutional Networks for Dynamic Texture Synthesis Unsupervised Learning of Single View Depth Estimation and Visual Odometry with Deep Feature Reconstruction Left/Right Asymmetric Layer Skippable Networks Learning to See in the Dark
flownet2
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
fluent-cap
code for fluency-guided cross-lingual image captioning
GestureRecognition-PyTorch
Action recognition network -- CNN + LSTM.
Hidden-Two-Stream
Caffe implementation for "Hidden Two-Stream Convolutional Networks for Action Recognition"
jpeg4py
Python cffi libjpeg-turbo bindings and helper classes
kinetics-i3d
Convolutional neural network model for video classification trained on the Kinetics dataset.
neuraltalk2
Efficient Image Captioning code in Torch, runs on GPU
py-denseflow
Extract TVL1 optical flows in python (multi-process && multi-server)
scnn
Segment-CNN: A Framework for Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs
spatial-temporal-panel
Pytorch implementation of (CDC) Convoluton-De-Convolution network.
Squeeze-Excitation-PyTorch
Squeeze-and-Excitation module for convolutional neural networks written in PyTorch
SST-Tensorflow
Tensorflow Implementation of the Paper "SST: Single-Stream Temporal Action Proposals" in CVPR 2017.
train_cnn-rnn-attention
cnn+rnn+attention: vgg(vgg16,vgg19)+rnn(LSTM, GRU)+attention, resnet(resnet_v2_50,resnet_v2_101,resnet_v2_152)+rnnrnn(LSTM, GRU)+attention, inception_v4+rnn(LSTM, GRU)+attention, inception_resnet_v2+rnn(LSTM, GRU)+attention,.....
tsne-viz
Python Code For t-SNE Visualization
TURN-TAP
TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals
two-stream-action-recognition
Using two stream architecture to implement a classic action recognition method on UCF101 dataset
two-stream-fcan
The repository to contain codes and models for paper "Two-stream Flow-guided Convolutional Attention Networks for Action Recognition".
two-stream-pytorch
PyTorch implementation of two-stream networks for video action recognition
TwoStreamFusion-1
just run the model.py to train