| Bridging the Gap Between Value and Policy Based Reinforcement Learning |
NIPS |
code |
46593 |
| REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models |
NIPS |
code |
46593 |
| Focal Loss for Dense Object Detection |
ICCV |
code |
18356 |
| Mask R-CNN |
ICCV |
code |
9493 |
| Deep Photo Style Transfer |
CVPR |
code |
8655 |
| LightGBM: A Highly Efficient Gradient Boosting Decision Tree |
NIPS |
code |
7536 |
| Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation |
NIPS |
code |
6449 |
| Attention is All you Need |
NIPS |
code |
6288 |
| Large Pose 3D Face Reconstruction From a Single Image via Direct Volumetric CNN Regression |
ICCV |
code |
3354 |
| Densely Connected Convolutional Networks |
CVPR |
code |
3130 |
| A Unified Approach to Interpreting Model Predictions |
NIPS |
code |
3122 |
| Deformable Convolutional Networks |
ICCV |
code |
2165 |
| ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games |
NIPS |
code |
1823 |
| PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation |
CVPR |
code |
1523 |
| Improved Training of Wasserstein GANs |
NIPS |
code |
1405 |
| Fully Convolutional Instance-Aware Semantic Segmentation |
CVPR |
code |
1395 |
| Aggregated Residual Transformations for Deep Neural Networks |
CVPR |
code |
1361 |
| Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network |
CVPR |
code |
1301 |
| Unsupervised Image-to-Image Translation Networks |
NIPS |
code |
1205 |
| Photographic Image Synthesis With Cascaded Refinement Networks |
ICCV |
code |
1142 |
| High-Resolution Image Inpainting Using Multi-Scale Neural Patch Synthesis |
CVPR |
code |
1072 |
| SphereFace: Deep Hypersphere Embedding for Face Recognition |
CVPR |
code |
1048 |
| Deep Feature Flow for Video Recognition |
CVPR |
code |
966 |
| Bayesian GAN |
NIPS |
code |
942 |
| Pyramid Scene Parsing Network |
CVPR |
code |
934 |
| Efficient Modeling of Latent Information in Supervised Learning using Gaussian Processes |
NIPS |
code |
906 |
| Finding Tiny Faces |
CVPR |
code |
856 |
| Toward Multimodal Image-to-Image Translation |
NIPS |
code |
794 |
| Learning to Discover Cross-Domain Relations with Generative Adversarial Networks |
ICML |
code |
784 |
| YOLO9000: Better, Faster, Stronger |
CVPR |
code |
773 |
| PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space |
NIPS |
code |
772 |
| Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks |
ICML |
code |
729 |
| FlowNet 2.0: Evolution of Optical Flow Estimation With Deep Networks |
CVPR |
code |
720 |
| Channel Pruning for Accelerating Very Deep Neural Networks |
ICCV |
code |
649 |
| Dilated Residual Networks |
CVPR |
code |
640 |
| Inferring and Executing Programs for Visual Reasoning |
ICCV |
code |
636 |
| DSOD: Learning Deeply Supervised Object Detectors From Scratch |
ICCV |
code |
582 |
| Arbitrary Style Transfer in Real-Time With Adaptive Instance Normalization |
ICCV |
code |
572 |
| Accelerating Eulerian Fluid Simulation With Convolutional Networks |
ICML |
code |
570 |
| Learning Disentangled Representations with Semi-Supervised Deep Generative Models |
NIPS |
code |
556 |
| Inductive Representation Learning on Large Graphs |
NIPS |
code |
552 |
| Regressing Robust and Discriminative 3D Morphable Models With a Very Deep Neural Network |
CVPR |
code |
537 |
| How Far Are We From Solving the 2D & 3D Face Alignment Problem? (And a Dataset of 230,000 3D Facial Landmarks) |
ICCV |
code |
526 |
| SSH: Single Stage Headless Face Detector |
ICCV |
code |
515 |
| Learning From Simulated and Unsupervised Images Through Adversarial Training |
CVPR |
code |
492 |
| Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space |
CVPR |
code |
487 |
| Video Frame Interpolation via Adaptive Convolution |
CVPR |
code |
482 |
| Video Frame Interpolation via Adaptive Separable Convolution |
ICCV |
code |
482 |
| GMS: Grid-based Motion Statistics for Fast, Ultra-Robust Feature Correspondence |
CVPR |
code |
460 |
| Joint Detection and Identification Feature Learning for Person Search |
CVPR |
code |
459 |
| Dual Path Networks |
NIPS |
code |
451 |
| Flow-Guided Feature Aggregation for Video Object Detection |
ICCV |
code |
436 |
| Deep Image Matting |
CVPR |
code |
434 |
| Richer Convolutional Features for Edge Detection |
CVPR |
code |
399 |
| Annotating Object Instances With a Polygon-RNN |
CVPR |
code |
397 |
| Recurrent Highway Networks |
ICML |
code |
397 |
| Detect to Track and Track to Detect |
ICCV |
code |
387 |
| RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation |
CVPR |
code |
379 |
| Detecting Oriented Text in Natural Images by Linking Segments |
CVPR |
code |
364 |
| Deep Lattice Networks and Partial Monotonic Functions |
NIPS |
code |
349 |
| Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results |
NIPS |
code |
347 |
| RON: Reverse Connection With Objectness Prior Networks for Object Detection |
CVPR |
code |
345 |
| Universal Style Transfer via Feature Transforms |
NIPS |
code |
344 |
| Residual Attention Network for Image Classification |
CVPR |
code |
329 |
| One-Shot Video Object Segmentation |
CVPR |
code |
316 |
| Accurate Single Stage Detector Using Recurrent Rolling Convolution |
CVPR |
code |
314 |
| Feature Pyramid Networks for Object Detection |
CVPR |
code |
310 |
| Efficient softmax approximation for GPUs |
ICML |
code |
304 |
| OctNet: Learning Deep 3D Representations at High Resolutions |
CVPR |
code |
302 |
| Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution |
CVPR |
code |
301 |
| Pixel Recursive Super Resolution |
ICCV |
code |
301 |
| Self-Critical Sequence Training for Image Captioning |
CVPR |
code |
299 |
| Age Progression/Regression by Conditional Adversarial Autoencoder |
CVPR |
code |
297 |
| Style Transfer from Non-Parallel Text by Cross-Alignment |
NIPS |
code |
296 |
| Dilated Recurrent Neural Networks |
NIPS |
code |
285 |
| Lifting From the Deep: Convolutional 3D Pose Estimation From a Single Image |
CVPR |
code |
280 |
| DeepBach: a Steerable Model for Bach Chorales Generation |
ICML |
code |
276 |
| The Predictron: End-To-End Learning and Planning |
ICML |
code |
274 |
| Convolutional Sequence to Sequence Learning |
ICML |
code |
258 |
| OptNet: Differentiable Optimization as a Layer in Neural Networks |
ICML |
code |
245 |
| Prototypical Networks for Few-shot Learning |
NIPS |
code |
244 |
| Deep Voice: Real-time Neural Text-to-Speech |
ICML |
code |
242 |
| Reinforcement Learning with Deep Energy-Based Policies |
ICML |
code |
233 |
| Learning Deep CNN Denoiser Prior for Image Restoration |
CVPR |
code |
231 |
| GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium |
NIPS |
code |
229 |
| A Point Set Generation Network for 3D Object Reconstruction From a Single Image |
CVPR |
code |
228 |
| Deeply Supervised Salient Object Detection With Short Connections |
CVPR |
code |
228 |
| BlitzNet: A Real-Time Deep Network for Scene Understanding |
ICCV |
code |
227 |
| Language Modeling with Gated Convolutional Networks |
ICML |
code |
221 |
| Unlabeled Samples Generated by GAN Improve the Person Re-Identification Baseline in Vitro |
ICCV |
code |
215 |
| Stacked Generative Adversarial Networks |
CVPR |
code |
215 |
| RMPE: Regional Multi-Person Pose Estimation |
ICCV |
code |
215 |
| Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning |
CVPR |
code |
214 |
| Generative Face Completion |
CVPR |
code |
212 |
| VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition |
ICCV |
code |
210 |
| The Reversible Residual Network: Backpropagation Without Storing Activations |
NIPS |
code |
210 |
| Recurrent Scale Approximation for Object Detection in CNN |
ICCV |
code |
209 |
| Learning From Synthetic Humans |
CVPR |
code |
207 |
| Spatially Adaptive Computation Time for Residual Networks |
CVPR |
code |
203 |
| Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis |
ICCV |
code |
202 |
| 3D Bounding Box Estimation Using Deep Learning and Geometry |
CVPR |
code |
200 |
| Multi-View 3D Object Detection Network for Autonomous Driving |
CVPR |
code |
199 |
| Visual Dialog |
CVPR |
code |
199 |
| Interpretable Explanations of Black Boxes by Meaningful Perturbation |
ICCV |
code |
192 |
| Inverse Compositional Spatial Transformer Networks |
CVPR |
code |
189 |
| FastMask: Segment Multi-Scale Object Candidates in One Shot |
CVPR |
code |
189 |
| OnACID: Online Analysis of Calcium Imaging Data in Real Time |
NIPS |
code |
189 |
| Semantic Scene Completion From a Single Depth Image |
CVPR |
code |
188 |
| Learning Efficient Convolutional Networks Through Network Slimming |
ICCV |
code |
186 |
| Learning Feature Pyramids for Human Pose Estimation |
ICCV |
code |
185 |
| Be Your Own Prada: Fashion Synthesis With Structural Coherence |
ICCV |
code |
183 |
| Scene Graph Generation by Iterative Message Passing |
CVPR |
code |
182 |
| Fast Image Processing With Fully-Convolutional Networks |
ICCV |
code |
180 |
| Learning Multiple Tasks with Multilinear Relationship Networks |
NIPS |
code |
178 |
| Learning to Reason: End-To-End Module Networks for Visual Question Answering |
ICCV |
code |
178 |
| Single Shot Text Detector With Regional Attention |
ICCV |
code |
176 |
| Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment With Limited Resources |
ICCV |
code |
175 |
| Deep Feature Interpolation for Image Content Changes |
CVPR |
code |
170 |
| On Human Motion Prediction Using Recurrent Neural Networks |
CVPR |
code |
167 |
| Image Super-Resolution via Deep Recursive Residual Network |
CVPR |
code |
163 |
| Learning Cross-Modal Embeddings for Cooking Recipes and Food Images |
CVPR |
code |
160 |
| Input Convex Neural Networks |
ICML |
code |
159 |
| Simple Does It: Weakly Supervised Instance and Semantic Segmentation |
CVPR |
code |
159 |
| Low-Shot Visual Recognition by Shrinking and Hallucinating Features |
ICCV |
code |
158 |
| Oriented Response Networks |
CVPR |
code |
157 |
| Soft Proposal Networks for Weakly Supervised Object Localization |
ICCV |
code |
154 |
| Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks |
ICML |
code |
147 |
| Axiomatic Attribution for Deep Networks |
ICML |
code |
146 |
| Gradient Episodic Memory for Continual Learning |
NIPS |
code |
146 |
| DSAC - Differentiable RANSAC for Camera Localization |
CVPR |
code |
144 |
| Attend to You: Personalized Image Captioning With Context Sequence Memory Networks |
CVPR |
code |
143 |
| Conditional Similarity Networks |
CVPR |
code |
142 |
| Language Modeling with Recurrent Highway Hypernetworks |
NIPS |
code |
141 |
| Triple Generative Adversarial Nets |
NIPS |
code |
138 |
| Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning |
NIPS |
code |
138 |
| One-Sided Unsupervised Domain Mapping |
NIPS |
code |
137 |
| Detecting Visual Relationships With Deep Relational Networks |
CVPR |
code |
137 |
| Attentive Recurrent Comparators |
ICML |
code |
136 |
| Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach |
ICCV |
code |
136 |
| Learning a Multi-View Stereo Machine |
NIPS |
code |
135 |
| Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model |
NIPS |
code |
134 |
| Multi-Context Attention for Human Pose Estimation |
CVPR |
code |
131 |
| Controlling Perceptual Factors in Neural Style Transfer |
CVPR |
code |
130 |
| Bayesian Compression for Deep Learning |
NIPS |
code |
130 |
| Adversarial Discriminative Domain Adaptation |
CVPR |
code |
129 |
| Working hard to know your neighbor's margins: Local descriptor learning loss |
NIPS |
code |
128 |
| Concrete Dropout |
NIPS |
code |
127 |
| SegFlow: Joint Learning for Video Object Segmentation and Optical Flow |
ICCV |
code |
127 |
| Segmentation-Aware Convolutional Networks Using Local Attention Masks |
ICCV |
code |
126 |
| Detail-Revealing Deep Video Super-Resolution |
ICCV |
code |
126 |
| CREST: Convolutional Residual Learning for Visual Tracking |
ICCV |
code |
126 |
| Discriminative Correlation Filter With Channel and Spatial Reliability |
CVPR |
code |
124 |
| SVDNet for Pedestrian Retrieval |
ICCV |
code |
121 |
| Semantic Image Synthesis via Adversarial Learning |
ICCV |
code |
121 |
| Spatiotemporal Multiplier Networks for Video Action Recognition |
CVPR |
code |
121 |
| PoseTrack: Joint Multi-Person Pose Estimation and Tracking |
CVPR |
code |
121 |
| Hierarchical Attentive Recurrent Tracking |
NIPS |
code |
121 |
| Good Semi-supervised Learning That Requires a Bad GAN |
NIPS |
code |
120 |
| Deep Watershed Transform for Instance Segmentation |
CVPR |
code |
120 |
| Associative Domain Adaptation |
ICCV |
code |
119 |
| Learning by Association -- A Versatile Semi-Supervised Training Method for Neural Networks |
CVPR |
code |
119 |
| Value Prediction Network |
NIPS |
code |
119 |
| Unrestricted Facial Geometry Reconstruction Using Image-To-Image Translation |
ICCV |
code |
119 |
| MemNet: A Persistent Memory Network for Image Restoration |
ICCV |
code |
119 |
| Bayesian Optimization with Gradients |
NIPS |
code |
117 |
| TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning |
NIPS |
code |
117 |
| Compressed Sensing using Generative Models |
ICML |
code |
116 |
| Switching Convolutional Neural Network for Crowd Counting |
CVPR |
code |
116 |
| WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation |
CVPR |
code |
116 |
| Show, Adapt and Tell: Adversarial Training of Cross-Domain Image Captioner |
ICCV |
code |
115 |
| Video Frame Synthesis Using Deep Voxel Flow |
ICCV |
code |
114 |
| Multiple Instance Detection Network With Online Instance Classifier Refinement |
CVPR |
code |
113 |
| Deep Pyramidal Residual Networks |
CVPR |
code |
112 |
| Train longer, generalize better: closing the generalization gap in large batch training of neural networks |
NIPS |
code |
112 |
| Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction |
CVPR |
code |
110 |
| Unite the People: Closing the Loop Between 3D and 2D Human Representations |
CVPR |
code |
110 |
| Learning Combinatorial Optimization Algorithms over Graphs |
NIPS |
code |
109 |
| FeUdal Networks for Hierarchical Reinforcement Learning |
ICML |
code |
107 |
| ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression |
ICCV |
code |
105 |
| Learning a Deep Embedding Model for Zero-Shot Learning |
CVPR |
code |
104 |
| ECO: Efficient Convolution Operators for Tracking |
CVPR |
code |
103 |
| SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning |
CVPR |
code |
102 |
| Multi-View Supervision for Single-View Reconstruction via Differentiable Ray Consistency |
CVPR |
code |
100 |
| Task-based End-to-end Model Learning in Stochastic Optimization |
NIPS |
code |
100 |
| Learning to Compose Domain-Specific Transformations for Data Augmentation |
NIPS |
code |
97 |
| Genetic CNN |
ICCV |
code |
97 |
| HashNet: Deep Learning to Hash by Continuation |
ICCV |
code |
97 |
| Interleaved Group Convolutions |
ICCV |
code |
95 |
| Deeply-Learned Part-Aligned Representations for Person Re-Identification |
ICCV |
code |
95 |
| Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model |
NIPS |
code |
94 |
| Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation |
CVPR |
code |
93 |
| Octree Generating Networks: Efficient Convolutional Architectures for High-Resolution 3D Outputs |
ICCV |
code |
92 |
| Semantic Autoencoder for Zero-Shot Learning |
CVPR |
code |
92 |
| Deep Hyperspherical Learning |
NIPS |
code |
92 |
| Decoupled Neural Interfaces using Synthetic Gradients |
ICML |
code |
90 |
| Geometric Matrix Completion with Recurrent Multi-Graph Neural Networks |
NIPS |
code |
90 |
| Practical Bayesian Optimization for Model Fitting with Bayesian Adaptive Direct Search |
NIPS |
code |
90 |
| Optical Flow Estimation Using a Spatial Pyramid Network |
CVPR |
code |
90 |
| AMC: Attention guided Multi-modal Correlation Learning for Image Search |
CVPR |
code |
90 |
| Deep Video Deblurring for Hand-Held Cameras |
CVPR |
code |
89 |
| Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data |
NIPS |
code |
88 |
| Causal Effect Inference with Deep Latent-Variable Models |
NIPS |
code |
87 |
| GANs for Biological Image Synthesis |
ICCV |
code |
85 |
| MMD GAN: Towards Deeper Understanding of Moment Matching Network |
NIPS |
code |
84 |
| Representation Learning by Learning to Count |
ICCV |
code |
84 |
| Optical Flow in Mostly Rigid Scenes |
CVPR |
code |
83 |
| Fast-Slow Recurrent Neural Networks |
NIPS |
code |
82 |
| Unsupervised Video Summarization With Adversarial LSTM Networks |
CVPR |
code |
82 |
| Constrained Policy Optimization |
ICML |
code |
81 |
| A-NICE-MC: Adversarial Training for MCMC |
NIPS |
code |
80 |
| Coarse-To-Fine Volumetric Prediction for Single-Image 3D Human Pose |
CVPR |
code |
80 |
| End-To-End Instance Segmentation With Recurrent Attention |
CVPR |
code |
78 |
| DeLiGAN : Generative Adversarial Networks for Diverse and Limited Data |
CVPR |
code |
78 |
| Learning Shape Abstractions by Assembling Volumetric Primitives |
CVPR |
code |
77 |
| Local Binary Convolutional Neural Networks |
CVPR |
code |
77 |
| Raster-To-Vector: Revisiting Floorplan Transformation |
ICCV |
code |
76 |
| Positive-Unlabeled Learning with Non-Negative Risk Estimator |
NIPS |
code |
76 |
| Hard-Aware Deeply Cascaded Embedding |
ICCV |
code |
75 |
| Deep Image Harmonization |
CVPR |
code |
73 |
| Shape Completion Using 3D-Encoder-Predictor CNNs and Shape Synthesis |
CVPR |
code |
73 |
| Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade |
CVPR |
code |
73 |
| Improved Stereo Matching With Constant Highway Networks and Reflective Confidence Learning |
CVPR |
code |
72 |
| Query-Guided Regression Network With Context Policy for Phrase Grounding |
ICCV |
code |
72 |
| Top-Down Visual Saliency Guided by Captions |
CVPR |
code |
72 |
| Feedback Networks |
CVPR |
code |
72 |
| What Actions Are Needed for Understanding Human Actions in Videos? |
ICCV |
code |
71 |
| Xception: Deep Learning With Depthwise Separable Convolutions |
CVPR |
code |
71 |
| Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning |
CVPR |
code |
71 |
| Video Propagation Networks |
CVPR |
code |
70 |
| Image-To-Image Translation With Conditional Adversarial Networks |
CVPR |
code |
70 |
| Quality Aware Network for Set to Set Recognition |
CVPR |
code |
69 |
| Self-Supervised Learning of Visual Features Through Embedding Images Into Text Topic Spaces |
CVPR |
code |
69 |
| Deep Subspace Clustering Networks |
NIPS |
code |
68 |
| Escape From Cells: Deep Kd-Networks for the Recognition of 3D Point Cloud Models |
ICCV |
code |
68 |
| A Distributional Perspective on Reinforcement Learning |
ICML |
code |
68 |
| Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks |
CVPR |
code |
67 |
| Deep Transfer Learning with Joint Adaptation Networks |
ICML |
code |
67 |
| Training Deep Networks without Learning Rates Through Coin Betting |
NIPS |
code |
66 |
| Full Resolution Image Compression With Recurrent Neural Networks |
CVPR |
code |
66 |
| SurfaceNet: An End-To-End 3D Neural Network for Multiview Stereopsis |
ICCV |
code |
66 |
| Doubly Stochastic Variational Inference for Deep Gaussian Processes |
NIPS |
code |
66 |
| TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals |
ICCV |
code |
66 |
| Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-Identification |
ICCV |
code |
65 |
| Synthesizing 3D Shapes via Modeling Multi-View Depth Maps and Silhouettes With Deep Generative Networks |
CVPR |
code |
65 |
| Dance Dance Convolution |
ICML |
code |
65 |
| Borrowing Treasures From the Wealthy: Deep Transfer Learning Through Selective Joint Fine-Tuning |
CVPR |
code |
64 |
| Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes |
ICCV |
code |
64 |
| Toward Controlled Generation of Text |
ICML |
code |
63 |
| Person Re-Identification in the Wild |
CVPR |
code |
63 |
| ALICE: Towards Understanding Adversarial Learning for Joint Distribution Matching |
NIPS |
code |
63 |
| Differentiable Learning of Logical Rules for Knowledge Base Reasoning |
NIPS |
code |
62 |
| Person Search With Natural Language Description |
CVPR |
code |
61 |
| Multi-Channel Weighted Nuclear Norm Minimization for Real Color Image Denoising |
ICCV |
code |
61 |
| Playing for Benchmarks |
ICCV |
code |
61 |
| Unsupervised Learning by Predicting Noise |
ICML |
code |
60 |
| Localizing Moments in Video With Natural Language |
ICCV |
code |
60 |
| End-To-End 3D Face Reconstruction With Deep Neural Networks |
CVPR |
code |
60 |
| CoupleNet: Coupling Global Structure With Local Parts for Object Detection |
ICCV |
code |
59 |
| AdaGAN: Boosting Generative Models |
NIPS |
code |
59 |
| Convolutional Gaussian Processes |
NIPS |
code |
57 |
| A Deep Regression Architecture With Two-Stage Re-Initialization for High Performance Facial Landmark Detection |
CVPR |
code |
57 |
| Modeling Relationships in Referential Expressions With Compositional Modular Networks |
CVPR |
code |
57 |
| Curiosity-driven Exploration by Self-supervised Prediction |
ICML |
code |
56 |
| Wavelet-SRNet: A Wavelet-Based CNN for Multi-Scale Face Super Resolution |
ICCV |
code |
56 |
| The Neural Hawkes Process: A Neurally Self-Modulating Multivariate Point Process |
NIPS |
code |
56 |
| Online and Linear-Time Attention by Enforcing Monotonic Alignments |
ICML |
code |
56 |
| Neural Expectation Maximization |
NIPS |
code |
56 |
| Dense-Captioning Events in Videos |
ICCV |
code |
55 |
| Factorized Bilinear Models for Image Recognition |
ICCV |
code |
55 |
| Net-Trim: Convex Pruning of Deep Neural Networks with Performance Guarantee |
NIPS |
code |
54 |
| On-the-fly Operation Batching in Dynamic Computation Graphs |
NIPS |
code |
54 |
| Visual Translation Embedding Network for Visual Relation Detection |
CVPR |
code |
54 |
| Learning Blind Motion Deblurring |
ICCV |
code |
54 |
| A Disentangled Recognition and Nonlinear Dynamics Model for Unsupervised Learning |
NIPS |
code |
53 |
| Towards Diverse and Natural Image Descriptions via a Conditional GAN |
ICCV |
code |
53 |
| CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos |
CVPR |
code |
53 |
| A Generic Deep Architecture for Single Image Reflection Removal and Image Smoothing |
ICCV |
code |
52 |
| Deep IV: A Flexible Approach for Counterfactual Prediction |
ICML |
code |
52 |
| Triangle Generative Adversarial Networks |
NIPS |
code |
51 |
| EAST: An Efficient and Accurate Scene Text Detector |
CVPR |
code |
51 |
| SST: Single-Stream Temporal Action Proposals |
CVPR |
code |
51 |
| Predicting Deeper Into the Future of Semantic Segmentation |
ICCV |
code |
51 |
| L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space |
CVPR |
code |
51 |
| TALL: Temporal Activity Localization via Language Query |
ICCV |
code |
50 |
| Hybrid Reward Architecture for Reinforcement Learning |
NIPS |
code |
50 |
| Fast Fourier Color Constancy |
CVPR |
code |
49 |
| Modulating early visual processing by language |
NIPS |
code |
49 |
| Adversarial Examples for Semantic Segmentation and Object Detection |
ICCV |
code |
49 |
| Learning Discrete Representations via Information Maximizing Self-Augmented Training |
ICML |
code |
49 |
| Efficient Diffusion on Region Manifolds: Recovering Small Objects With Compact CNN Representations |
CVPR |
code |
48 |
| Real Time Image Saliency for Black Box Classifiers |
NIPS |
code |
48 |
| FC4: Fully Convolutional Color Constancy With Confidence-Weighted Pooling |
CVPR |
code |
47 |
| Multiple People Tracking by Lifted Multicut and Person Re-Identification |
CVPR |
code |
47 |
| Learned D-AMP: Principled Neural Network based Compressive Image Recovery |
NIPS |
code |
47 |
| GP CaKe: Effective brain connectivity with causal kernels |
NIPS |
code |
46 |
| Predicting Organic Reaction Outcomes with Weisfeiler-Lehman Network |
NIPS |
code |
46 |
| Semantic Video CNNs Through Representation Warping |
ICCV |
code |
46 |
| Grammar Variational Autoencoder |
ICML |
code |
46 |
| EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis |
ICCV |
code |
46 |
| Safe Model-based Reinforcement Learning with Stability Guarantees |
NIPS |
code |
45 |
| Deep Spectral Clustering Learning |
ICML |
code |
45 |
| Semantic Compositional Networks for Visual Captioning |
CVPR |
code |
45 |
| On-Demand Learning for Deep Image Restoration |
ICCV |
code |
45 |
| Video Pixel Networks |
ICML |
code |
45 |
| Stabilizing Training of Generative Adversarial Networks through Regularization |
NIPS |
code |
45 |
| Structured Bayesian Pruning via Log-Normal Multiplicative Noise |
NIPS |
code |
44 |
| Deriving Neural Architectures from Sequence and Graph Kernels |
ICML |
code |
44 |
| Masked Autoregressive Flow for Density Estimation |
NIPS |
code |
44 |
| Unsupervised Adaptation for Deep Stereo |
ICCV |
code |
44 |
| Learning Residual Images for Face Attribute Manipulation |
CVPR |
code |
43 |
| Learning to Generate Long-term Future via Hierarchical Prediction |
ICML |
code |
43 |
| Accurate Optical Flow via Direct Cost Volume Processing |
CVPR |
code |
42 |
| Generalized Orderless Pooling Performs Implicit Salient Matching |
ICCV |
code |
42 |
| Comparative Evaluation of Hand-Crafted and Learned Local Features |
CVPR |
code |
42 |
| SchNet: A continuous-filter convolutional neural network for modeling quantum interactions |
NIPS |
code |
41 |
| Temporal Generative Adversarial Nets With Singular Value Clipping |
ICCV |
code |
41 |
| Multiplicative Normalizing Flows for Variational Bayesian Neural Networks |
ICML |
code |
41 |
| Neural Scene De-Rendering |
CVPR |
code |
40 |
| Semantic Image Inpainting With Deep Generative Models |
CVPR |
code |
40 |
| A Linear-Time Kernel Goodness-of-Fit Test |
NIPS |
code |
40 |
| Least Squares Generative Adversarial Networks |
ICCV |
code |
39 |
| Diversified Texture Synthesis With Feed-Forward Networks |
CVPR |
code |
39 |
| No Fuss Distance Metric Learning Using Proxies |
ICCV |
code |
38 |
| Template Matching With Deformable Diversity Similarity |
CVPR |
code |
38 |
| What's in a Question: Using Visual Questions as a Form of Supervision |
CVPR |
code |
38 |
| Face Normals "In-The-Wild" Using Fully Convolutional Networks |
CVPR |
code |
38 |
| Conditional Image Synthesis with Auxiliary Classifier GANs |
ICML |
code |
37 |
| Neural Episodic Control |
ICML |
code |
37 |
| 3D-PRNN: Generating Shape Primitives With Recurrent Neural Networks |
ICCV |
code |
37 |
| Structured Embedding Models for Grouped Data |
NIPS |
code |
36 |
| Learning Active Learning from Data |
NIPS |
code |
36 |
| Unified Deep Supervised Domain Adaptation and Generalization |
ICCV |
code |
35 |
| Transformation-Grounded Image Generation Network for Novel 3D View Synthesis |
CVPR |
code |
35 |
| Structured Attentions for Visual Question Answering |
ICCV |
code |
34 |
| Geometric Loss Functions for Camera Pose Regression With Deep Learning |
CVPR |
code |
34 |
| VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization |
CVPR |
code |
34 |
| QMDP-Net: Deep Learning for Planning under Partial Observability |
NIPS |
code |
34 |
| Using Ranking-CNN for Age Estimation |
CVPR |
code |
33 |
| Hierarchical Boundary-Aware Neural Encoder for Video Captioning |
CVPR |
code |
33 |
| Unsupervised Learning of Disentangled Representations from Video |
NIPS |
code |
32 |
| Deep Learning on Lie Groups for Skeleton-Based Action Recognition |
CVPR |
code |
32 |
| Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection |
CVPR |
code |
32 |
| 3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder |
CVPR |
code |
32 |
| StyleNet: Generating Attractive Visual Captions With Styles |
CVPR |
code |
32 |
| Dynamic Word Embeddings |
ICML |
code |
32 |
| Learning to Prune Deep Neural Networks via Layer-wise Optimal Brain Surgeon |
NIPS |
code |
31 |
| Continual Learning Through Synaptic Intelligence |
ICML |
code |
31 |
| Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes |
CVPR |
code |
31 |
| Learning Detection With Diverse Proposals |
CVPR |
code |
31 |
| LCNN: Lookup-Based Convolutional Neural Network |
CVPR |
code |
31 |
| Towards Accurate Multi-Person Pose Estimation in the Wild |
CVPR |
code |
30 |
| Real-Time Neural Style Transfer for Videos |
CVPR |
code |
30 |
| Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training |
ICCV |
code |
30 |
| Deep Co-Occurrence Feature Learning for Visual Object Recognition |
CVPR |
code |
29 |
| Joint distribution optimal transportation for domain adaptation |
NIPS |
code |
29 |
| Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields |
CVPR |
code |
29 |
| SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Parallelization |
ICML |
code |
29 |
| The Statistical Recurrent Unit |
ICML |
code |
29 |
| A Unified Approach of Multi-Scale Deep and Hand-Crafted Features for Defocus Estimation |
CVPR |
code |
28 |
| Learning Spread-Out Local Feature Descriptors |
ICCV |
code |
28 |
| Event-Based Visual Inertial Odometry |
CVPR |
code |
27 |
| DropoutNet: Addressing Cold Start in Recommender Systems |
NIPS |
code |
27 |
| Phrase Localization and Visual Relationship Detection With Comprehensive Image-Language Cues |
ICCV |
code |
27 |
| Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations |
CVPR |
code |
27 |
| Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360deg Sports Videos |
CVPR |
code |
27 |
| Neural Message Passing for Quantum Chemistry |
ICML |
code |
27 |
| State-Frequency Memory Recurrent Neural Networks |
ICML |
code |
27 |
| DeepCD: Learning Deep Complementary Descriptors for Patch Representations |
ICCV |
code |
26 |
| Contrastive Learning for Image Captioning |
NIPS |
code |
26 |
| Stochastic Optimization with Variance Reduction for Infinite Datasets with Finite Sum Structure |
NIPS |
code |
26 |
| Learning High Dynamic Range From Outdoor Panoramas |
ICCV |
code |
26 |
| Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors |
CVPR |
code |
26 |
| Learning to Detect Salient Objects With Image-Level Supervision |
CVPR |
code |
26 |
| Improved Variational Autoencoders for Text Modeling using Dilated Convolutions |
ICML |
code |
26 |
| Interspecies Knowledge Transfer for Facial Keypoint Detection |
CVPR |
code |
25 |
| YASS: Yet Another Spike Sorter |
NIPS |
code |
25 |
| Open Set Domain Adaptation |
ICCV |
code |
25 |
| Domain-Adaptive Deep Network Compression |
ICCV |
code |
24 |
| Long Short-Term Memory Kalman Filters: Recurrent Neural Estimators for Pose Regularization |
ICCV |
code |
24 |
| Temporal Context Network for Activity Localization in Videos |
ICCV |
code |
24 |
| Incremental Learning of Object Detectors Without Catastrophic Forgetting |
ICCV |
code |
24 |
| Dense Captioning With Joint Inference and Visual Context |
CVPR |
code |
24 |
| Universal Adversarial Perturbations |
CVPR |
code |
24 |
| Asymmetric Tri-training for Unsupervised Domain Adaptation |
ICML |
code |
24 |
| Reducing Reparameterization Gradient Variance |
NIPS |
code |
24 |
| Exploiting Saliency for Object Segmentation From Image Level Labels |
CVPR |
code |
24 |
| A Dirichlet Mixture Model of Hawkes Processes for Event Sequence Clustering |
NIPS |
code |
24 |
| Shading Annotations in the Wild |
CVPR |
code |
24 |
| Straight to Shapes: Real-Time Detection of Encoded Shapes |
CVPR |
code |
23 |
| Dual Discriminator Generative Adversarial Nets |
NIPS |
code |
23 |
| Zero-Order Reverse Filtering |
ICCV |
code |
23 |
| Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net |
NIPS |
code |
23 |
| Learning Spherical Convolution for Fast Features from 360° Imagery |
NIPS |
code |
22 |
| Learning to Detect Sepsis with a Multitask Gaussian Process RNN Classifier |
ICML |
code |
22 |
| Deep Cross-Modal Hashing |
CVPR |
code |
22 |
| When Unsupervised Domain Adaptation Meets Tensor Representations |
ICCV |
code |
22 |
| Image Super-Resolution Using Dense Skip Connections |
ICCV |
code |
22 |
| Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer |
CVPR |
code |
22 |
| STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling |
CVPR |
code |
22 |
| Learning Continuous Semantic Representations of Symbolic Expressions |
ICML |
code |
22 |
| Deep Growing Learning |
ICCV |
code |
21 |
| Combined Group and Exclusive Sparsity for Deep Neural Networks |
ICML |
code |
21 |
| Hash Embeddings for Efficient Word Representations |
NIPS |
code |
21 |
| Accuracy First: Selecting a Differential Privacy Level for Accuracy Constrained ERM |
NIPS |
code |
21 |
| Disentangled Representation Learning GAN for Pose-Invariant Face Recognition |
CVPR |
code |
21 |
| Learning to Pivot with Adversarial Networks |
NIPS |
code |
21 |
| Learning Dynamic Siamese Network for Visual Object Tracking |
ICCV |
code |
21 |
| POSEidon: Face-From-Depth for Driver Pose Estimation |
CVPR |
code |
20 |
| Deep Metric Learning via Facility Location |
CVPR |
code |
20 |
| Automatic Spatially-Aware Fashion Concept Discovery |
ICCV |
code |
20 |
| The Numerics of GANs |
NIPS |
code |
20 |
| From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur |
CVPR |
code |
20 |
| Unpaired Image-To-Image Translation Using Cycle-Consistent Adversarial Networks |
ICCV |
code |
20 |
| Zero-Inflated Exponential Family Embeddings |
ICML |
code |
20 |
| InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations |
NIPS |
code |
20 |
| Weakly-Supervised Learning of Visual Relations |
ICCV |
code |
20 |
| Multi-Label Image Recognition by Recurrently Discovering Attentional Regions |
ICCV |
code |
20 |
| Scene Parsing With Global Context Embedding |
ICCV |
code |
20 |
| Context Selection for Embedding Models |
NIPS |
code |
20 |
| Deep Mean-Shift Priors for Image Restoration |
NIPS |
code |
20 |
| Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition |
CVPR |
code |
20 |
| Fully-Adaptive Feature Sharing in Multi-Task Networks With Applications in Person Attribute Classification |
CVPR |
code |
19 |
| Learning Compact Geometric Features |
ICCV |
code |
19 |
| Structured Generative Adversarial Networks |
NIPS |
code |
19 |
| Joint Gap Detection and Inpainting of Line Drawings |
CVPR |
code |
19 |
| Chained Multi-Stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection |
ICCV |
code |
19 |
| Adversarial Feature Matching for Text Generation |
ICML |
code |
18 |
| BIER - Boosting Independent Embeddings Robustly |
ICCV |
code |
18 |
| Predictive-Corrective Networks for Action Detection |
CVPR |
code |
18 |
| Stochastic Generative Hashing |
ICML |
code |
18 |
| A Bayesian Data Augmentation Approach for Learning Deep Models |
NIPS |
code |
18 |
| Attentive Semantic Video Generation Using Captions |
ICCV |
code |
18 |
| MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network |
CVPR |
code |
18 |
| Deep Unsupervised Similarity Learning Using Partially Ordered Sets |
CVPR |
code |
17 |
| DualNet: Learn Complementary Features for Image Recognition |
ICCV |
code |
17 |
| Neural system identification for large populations separating “what” and “where” |
NIPS |
code |
17 |
| FALKON: An Optimal Large Scale Kernel Method |
NIPS |
code |
17 |
| Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks |
CVPR |
code |
17 |
| Deep Learning with Topological Signatures |
NIPS |
code |
17 |
| Streaming Sparse Gaussian Process Approximations |
NIPS |
code |
17 |
| RPAN: An End-To-End Recurrent Pose-Attention Network for Action Recognition in Videos |
ICCV |
code |
17 |
| Awesome Typography: Statistics-Based Text Effects Transfer |
CVPR |
code |
17 |
| RoomNet: End-To-End Room Layout Estimation |
ICCV |
code |
17 |
| Deep Spatial-Semantic Attention for Fine-Grained Sketch-Based Image Retrieval |
ICCV |
code |
16 |
| Deep Supervised Discrete Hashing |
NIPS |
code |
16 |
| Few-Shot Learning Through an Information Retrieval Lens |
NIPS |
code |
16 |
| Estimating Accuracy from Unlabeled Data: A Probabilistic Logic Approach |
NIPS |
code |
16 |
| Learning to Push the Limits of Efficient FFT-Based Image Deconvolution |
ICCV |
code |
16 |
| Federated Multi-Task Learning |
NIPS |
code |
16 |
| Label Distribution Learning Forests |
NIPS |
code |
16 |
| Deep Multitask Architecture for Integrated 2D and 3D Human Sensing |
CVPR |
code |
16 |
| Estimating Mutual Information for Discrete-Continuous Mixtures |
NIPS |
code |
16 |
| Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes |
CVPR |
code |
16 |
| StyleBank: An Explicit Representation for Neural Image Style Transfer |
CVPR |
code |
16 |
| Surface Normals in the Wild |
ICCV |
code |
15 |
| Automatic Discovery of the Statistical Types of Variables in a Dataset |
ICML |
code |
15 |
| Learning Diverse Image Colorization |
CVPR |
code |
15 |
| Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems |
ICCV |
code |
15 |
| Non-Local Deep Features for Salient Object Detection |
CVPR |
code |
15 |
| Structure-Measure: A New Way to Evaluate Foreground Maps |
ICCV |
code |
15 |
| Shallow Updates for Deep Reinforcement Learning |
NIPS |
code |
15 |
| Wasserstein Generative Adversarial Networks |
ICML |
code |
15 |
| Recurrent 3D Pose Sequence Machines |
CVPR |
code |
15 |
| Variational Dropout Sparsifies Deep Neural Networks |
ICML |
code |
15 |
| Captioning Images With Diverse Objects |
CVPR |
code |
15 |
| Off-policy evaluation for slate recommendation |
NIPS |
code |
15 |
| Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning |
ICCV |
code |
14 |
| Benchmarking Denoising Algorithms With Real Photographs |
CVPR |
code |
14 |
| Neural Aggregation Network for Video Face Recognition |
CVPR |
code |
14 |
| Learned Contextual Feature Reweighting for Image Geo-Localization |
CVPR |
code |
14 |
| Streaming Weak Submodularity: Interpreting Neural Networks on the Fly |
NIPS |
code |
14 |
| CVAE-GAN: Fine-Grained Image Generation Through Asymmetric Training |
ICCV |
code |
14 |
| VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation |
ICCV |
code |
14 |
| Spherical convolutions and their application in molecular modelling |
NIPS |
code |
14 |
| Multi-Information Source Optimization |
NIPS |
code |
14 |
| Convolutional Neural Network Architecture for Geometric Matching |
CVPR |
code |
14 |
| Neural Face Editing With Intrinsic Image Disentangling |
CVPR |
code |
14 |
| Realistic Dynamic Facial Textures From a Single Image Using GANs |
ICCV |
code |
14 |
| Predictive State Recurrent Neural Networks |
NIPS |
code |
13 |
| Deep TextSpotter: An End-To-End Trainable Scene Text Localization and Recognition Framework |
ICCV |
code |
13 |
| ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events |
NIPS |
code |
13 |
| Hunt For The Unique, Stable, Sparse And Fast Feature Learning On Graphs |
NIPS |
code |
13 |
| Consensus Convolutional Sparse Coding |
ICCV |
code |
13 |
| Weakly Supervised Affordance Detection |
CVPR |
code |
13 |
| Joint Learning of Object and Action Detectors |
ICCV |
code |
13 |
| Light Field Blind Motion Deblurring |
CVPR |
code |
13 |
| Asynchronous Stochastic Gradient Descent with Delay Compensation |
ICML |
code |
13 |
| Unrolled Memory Inner-Products: An Abstract GPU Operator for Efficient Vision-Related Computations |
ICCV |
code |
12 |
| Maximizing Subset Accuracy with Recurrent Neural Networks in Multi-label Classification |
NIPS |
code |
12 |
| Self-Organized Text Detection With Minimal Post-Processing via Border Learning |
ICCV |
code |
12 |
| Coordinated Multi-Agent Imitation Learning |
ICML |
code |
12 |
| Gradient descent GAN optimization is locally stable |
NIPS |
code |
12 |
| Removing Rain From Single Images via a Deep Detail Network |
CVPR |
code |
12 |
| Convexified Convolutional Neural Networks |
ICML |
code |
12 |
| Multigrid Neural Architectures |
CVPR |
code |
12 |
| VegFru: A Domain-Specific Dataset for Fine-Grained Visual Categorization |
ICCV |
code |
12 |
| Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin |
NIPS |
code |
12 |
| Differential Angular Imaging for Material Recognition |
CVPR |
code |
12 |
| A Multilayer-Based Framework for Online Background Subtraction With Freely Moving Cameras |
ICCV |
code |
11 |
| Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation |
NIPS |
code |
11 |
| Max-value Entropy Search for Efficient Bayesian Optimization |
ICML |
code |
11 |
| Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization |
ICCV |
code |
11 |
| Generalized Deep Image to Image Regression |
CVPR |
code |
11 |
| Adversarial Image Perturbation for Privacy Protection -- A Game Theory Perspective |
ICCV |
code |
11 |
| Predicting Human Activities Using Stochastic Grammar |
ICCV |
code |
11 |
| DESIRE: Distant Future Prediction in Dynamic Scenes With Interacting Agents |
CVPR |
code |
11 |
| Fisher GAN |
NIPS |
code |
11 |
| High-Order Attention Models for Visual Question Answering |
NIPS |
code |
11 |
| IM2CAD |
CVPR |
code |
11 |
| On Fairness and Calibration |
NIPS |
code |
11 |
| DeepPermNet: Visual Permutation Learning |
CVPR |
code |
10 |
| f-GANs in an Information Geometric Nutshell |
NIPS |
code |
10 |
| Revisiting IM2GPS in the Deep Learning Era |
ICCV |
code |
10 |
| Attentional Correlation Filter Network for Adaptive Visual Tracking |
CVPR |
code |
10 |
| Learning Cross-Modal Deep Representations for Robust Pedestrian Detection |
CVPR |
code |
10 |
| Confident Multiple Choice Learning |
ICML |
code |
10 |
| Curriculum Dropout |
ICCV |
code |
9 |
| Cognitive Mapping and Planning for Visual Navigation |
CVPR |
code |
9 |
| Optimized Pre-Processing for Discrimination Prevention |
NIPS |
code |
9 |
| Learning Motion Patterns in Videos |
CVPR |
code |
9 |
| Scalable Log Determinants for Gaussian Process Kernel Learning |
NIPS |
code |
9 |
| A Hierarchical Approach for Generating Descriptive Image Paragraphs |
CVPR |
code |
9 |
| Deep Crisp Boundaries |
CVPR |
code |
9 |
| Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Optimization |
NIPS |
code |
9 |
| Practical Data-Dependent Metric Compression with Provable Guarantees |
NIPS |
code |
9 |
| Do Deep Neural Networks Suffer from Crowding? |
NIPS |
code |
9 |
| A Non-Convex Variational Approach to Photometric Stereo Under Inaccurate Lighting |
CVPR |
code |
9 |
| End-To-End Learning of Geometry and Context for Deep Stereo Regression |
ICCV |
code |
9 |
| From Bayesian Sparsity to Gated Recurrent Nets |
NIPS |
code |
8 |
| Regret Minimization in MDPs with Options without Prior Knowledge |
NIPS |
code |
8 |
| Following Gaze in Video |
ICCV |
code |
8 |
| Model-Powered Conditional Independence Test |
NIPS |
code |
8 |
| Cost efficient gradient boosting |
NIPS |
code |
8 |
| Reflectance Adaptive Filtering Improves Intrinsic Image Estimation |
CVPR |
code |
8 |
| DeepNav: Learning to Navigate Large Cities |
CVPR |
code |
8 |
| Look, Listen and Learn |
ICCV |
code |
8 |
| Attention-Aware Face Hallucination via Deep Reinforcement Learning |
CVPR |
code |
8 |
| Plan, Attend, Generate: Planning for Sequence-to-Sequence Models |
NIPS |
code |
8 |
| Introspective Neural Networks for Generative Modeling |
ICCV |
code |
8 |
| Affinity Clustering: Hierarchical Clustering at Scale |
NIPS |
code |
8 |
| Gaze Embeddings for Zero-Shot Image Classification |
CVPR |
code |
8 |
| Input Switched Affine Networks: An RNN Architecture Designed for Interpretability |
ICML |
code |
8 |
| Online multiclass boosting |
NIPS |
code |
8 |
| Towards a Visual Privacy Advisor: Understanding and Predicting Privacy Risks in Images |
ICCV |
code |
8 |
| SubUNets: End-To-End Hand Shape and Continuous Sign Language Recognition |
ICCV |
code |
7 |
| Learning Koopman Invariant Subspaces for Dynamic Mode Decomposition |
NIPS |
code |
7 |
| Unsupervised Monocular Depth Estimation With Left-Right Consistency |
CVPR |
code |
7 |
| Personalized Image Aesthetics |
ICCV |
code |
7 |
| Reasoning About Fine-Grained Attribute Phrases Using Reference Games |
ICCV |
code |
7 |
| Lost Relatives of the Gumbel Trick |
ICML |
code |
7 |
| Weakly Supervised Learning of Deep Metrics for Stereo Reconstruction |
ICCV |
code |
7 |
| Centered Weight Normalization in Accelerating Training of Deep Neural Networks |
ICCV |
code |
6 |
| Scalable Planning with Tensorflow for Hybrid Nonlinear Domains |
NIPS |
code |
6 |
| Convex Global 3D Registration With Lagrangian Duality |
CVPR |
code |
6 |
| Building a Regular Decision Boundary With Deep Networks |
CVPR |
code |
6 |
| Learning Spatial Regularization With Image-Level Supervisions for Multi-Label Image Classification |
CVPR |
code |
6 |
| Forecasting Human Dynamics From Static Images |
CVPR |
code |
6 |
| AOD-Net: All-In-One Dehazing Network |
ICCV |
code |
6 |
| K-Medoids For K-Means Seeding |
NIPS |
code |
6 |
| Diverse Image Annotation |
CVPR |
code |
6 |
| Practical Hash Functions for Similarity Estimation and Dimensionality Reduction |
NIPS |
code |
6 |
| Deep Adaptive Image Clustering |
ICCV |
code |
6 |
| Robust Adversarial Reinforcement Learning |
ICML |
code |
6 |
| Improving Training of Deep Neural Networks via Singular Value Bounding |
CVPR |
code |
6 |
| Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems |
NIPS |
code |
6 |
| Tensor Belief Propagation |
ICML |
code |
6 |
| Sparse convolutional coding for neuronal assembly detection |
NIPS |
code |
6 |
| Unsupervised Pixel-Level Domain Adaptation With Generative Adversarial Networks |
CVPR |
code |
6 |
| Bayesian inference on random simple graphs with power law degree distributions |
ICML |
code |
6 |
| Tensor Biclustering |
NIPS |
code |
6 |
| Riemannian approach to batch normalization |
NIPS |
code |
6 |
| Unsupervised Learning of Object Landmarks by Factorized Spatial Embeddings |
ICCV |
code |
6 |
| Rolling-Shutter-Aware Differential SfM and Image Rectification |
ICCV |
code |
5 |
| Active Decision Boundary Annotation With Deep Generative Models |
ICCV |
code |
5 |
| Object Co-Skeletonization With Co-Segmentation |
CVPR |
code |
5 |
| Discover and Learn New Objects From Documentaries |
CVPR |
code |
5 |
| Understanding Black-box Predictions via Influence Functions |
ICML |
code |
5 |
| Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach |
CVPR |
code |
5 |
| Decoupling "when to update" from "how to update" |
NIPS |
code |
5 |
| MarioQA: Answering Questions by Watching Gameplay Videos |
ICCV |
code |
5 |
| Differentially private Bayesian learning on distributed data |
NIPS |
code |
5 |
| Grad-CAM: Visual Explanations From Deep Networks via Gradient-Based Localization |
ICCV |
code |
5 |
| Question Asking as Program Generation |
NIPS |
code |
5 |
| Conic Scan-and-Cover algorithms for nonparametric topic modeling |
NIPS |
code |
5 |
| Lip Reading Sentences in the Wild |
CVPR |
code |
5 |
| ROAM: A Rich Object Appearance Model With Application to Rotoscoping |
CVPR |
code |
5 |
| NeuralFDR: Learning Discovery Thresholds from Hypothesis Features |
NIPS |
code |
5 |
| Viraliency: Pooling Local Virality |
CVPR |
code |
5 |
| Learning Algorithms for Active Learning |
ICML |
code |
5 |
| Point to Set Similarity Based Deep Feature Learning for Person Re-Identification |
CVPR |
code |
5 |
| Click Here: Human-Localized Keypoints as Guidance for Viewpoint Estimation |
ICCV |
code |
5 |
| The World of Fast Moving Objects |
CVPR |
code |
5 |
| Cross-Modality Binary Code Learning via Fusion Similarity Hashing |
CVPR |
code |
5 |
| Testing and Learning on Distributions with Symmetric Noise Invariance |
NIPS |
code |
5 |
| Sticking the Landing: Simple, Lower-Variance Gradient Estimators for Variational Inference |
NIPS |
code |
5 |
| Diving into the shallows: a computational perspective on large-scale shallow learning |
NIPS |
code |
5 |
| Rotation Equivariant Vector Field Networks |
ICCV |
code |
5 |
| Recursive Sampling for the Nystrom Method |
NIPS |
code |
5 |
| Learning From Video and Text via Large-Scale Discriminative Clustering |
ICCV |
code |
5 |
| Global optimization of Lipschitz functions |
ICML |
code |
5 |
| Device Placement Optimization with Reinforcement Learning |
ICML |
code |
4 |
| Alternating Direction Graph Matching |
CVPR |
code |
4 |
| MEC: Memory-efficient Convolution for Deep Neural Network |
ICML |
code |
4 |
| Expert Gate: Lifelong Learning With a Network of Experts |
CVPR |
code |
4 |
| A Simple yet Effective Baseline for 3D Human Pose Estimation |
ICCV |
code |
4 |
| On Structured Prediction Theory with Calibrated Convex Surrogate Losses |
NIPS |
code |
4 |
| Sub-sampled Cubic Regularization for Non-convex Optimization |
ICML |
code |
4 |
| Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval |
CVPR |
code |
4 |
| Bottleneck Conditional Density Estimation |
ICML |
code |
4 |
| Learning Cooperative Visual Dialog Agents With Deep Reinforcement Learning |
ICCV |
code |
4 |
| Multi-way Interacting Regression via Factorization Machines |
NIPS |
code |
4 |
| Joint Discovery of Object States and Manipulation Actions |
ICCV |
code |
4 |
| Predicting Salient Face in Multiple-Face Videos |
CVPR |
code |
4 |
| From Red Wine to Red Tomato: Composition With Context |
CVPR |
code |
4 |
| Encoder Based Lifelong Learning |
ICCV |
code |
4 |
| Deep Recurrent Neural Network-Based Identification of Precursor microRNAs |
NIPS |
code |
4 |
| Guarantees for Greedy Maximization of Non-submodular Functions with Applications |
ICML |
code |
4 |
| Pose-Aware Person Recognition |
CVPR |
code |
4 |
| Zero-Shot Recognition Using Dual Visual-Semantic Mapping Paths |
CVPR |
code |
4 |
| Asynchronous Distributed Variational Gaussian Processes for Regression |
ICML |
code |
3 |
| Saliency Pattern Detection by Ranking Structured Trees |
ICCV |
code |
3 |
| Toward Goal-Driven Neural Network Models for the Rodent Whisker-Trigeminal System |
NIPS |
code |
3 |
| Learning Non-Maximum Suppression |
CVPR |
code |
3 |
| Deep Latent Dirichlet Allocation with Topic-Layer-Adaptive Stochastic Gradient Riemannian MCMC |
ICML |
code |
3 |
| Discriminative Bimodal Networks for Visual Localization and Detection With Natural Language Queries |
CVPR |
code |
3 |
| AdaNet: Adaptive Structural Learning of Artificial Neural Networks |
ICML |
code |
3 |
| Large Margin Object Tracking With Circulant Feature Maps |
CVPR |
code |
3 |
| Compatible Reward Inverse Reinforcement Learning |
NIPS |
code |
3 |
| Adversarial Surrogate Losses for Ordinal Regression |
NIPS |
code |
3 |
| Non-monotone Continuous DR-submodular Maximization: Structure and Algorithms |
NIPS |
code |
3 |
| Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning |
NIPS |
code |
3 |
| A framework for Multi-A(rmed)/B(andit) Testing with Online FDR Control |
NIPS |
code |
3 |
| Counting Everyday Objects in Everyday Scenes |
CVPR |
code |
3 |
| Loss Max-Pooling for Semantic Image Segmentation |
CVPR |
code |
3 |
| Aesthetic Critiques Generation for Photos |
ICCV |
code |
3 |
| Expectation Propagation with Stochastic Kinetic Model in Complex Interaction Systems |
NIPS |
code |
3 |
| Near-Optimal Edge Evaluation in Explicit Generalized Binomial Graphs |
NIPS |
code |
3 |