nlee-208 / paper_review

Paper Reviewed or Pre-read List

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Paper Review/Read-List Compile

Project Ideas

Paper read-list

2020~2021 - prioritized!!

  1. ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
  1. AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE
  1. MLP-Mixer: An all-MLP Architecture for Vision
  1. Learning to Perturb Word Embeddings for Out-of-distribution QA
  1. Emerging Properties in Self-Supervised Vision Transformers (Dino)
  1. Pay Attention to MLPs
  1. An overview of mixing augmentation methods and augmentation strategies (Slow Read)

CV

Localization/Classification

  • OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks
  • Deep Residual Learning for Image Recognition

Object Detection

1-Stage
  • You Only Look Once: Unified, Real-Time Object Detection
  • YOLO9000: Better, Faster, Stronger
  • YOLOv3: An Incremental Improvement
  • YOLOv4: Optimal Spped & Accuracy of Object Detection
  • SSD: Single Shot Multibox Detector
  • Focal Loss for Dense Object Detection
2-Stage
  • Region-based Convolutional Networks for Accurate Object Detection & Segmentation
  • Fast R-CNN
  • Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
  • Mask R-CNN

Object Segmentation

  • U-Net: Convolutional Networks for Biomedical Image Segmentation

ML

  • CatBoost: Unbiased Boosting w/ Categorical Features
  • LightGBM: A Highly Efficient Gradient Boosting Decision Tree
  • XGBoost : A Scalable Tree System

NLP

  • Efficient Estimation of Word Representations in Vector Space
  • Distributed Representations of Words and Phrases and their Compositionality
  • Enriching Word Vectors with Subword Information
  • Bag of Tricks for Efficient Text Classification
  • Convolutional Neural Networks for Sentence Classification
  • Effective Approaches to Attention based Neural Machine Translation
  • Attention Is All You Need
  • BERT: Pre training of Deep Bidirectional Transformers for Language Understanding

NLP-Commonsense

  • CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge

  • WinoGrande: An Adversarial Winograd Schema Challenge at Scale

  • KagNet: Knowledge-Aware Graph Networks for Commonsense Reasoning

  • Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering

  • From Recognition to Cognition: Visual Commonsense Reasoning

  • UNITER: UNiversal Image-TExt Representation Learning

  • Connective Cognition Network for Directional Visual Commonsense Reasoning

  • VisualCOMET: Reasoning about the Dynamic Context of a Still Image

  • ERNIEVIL Knowledge Enhanced VisionLanguage Representations Through Scene Graph

  • Vilbert Pretraining Task-Agnostic Visiolinguistic Representations

  • VLBert Pretraining of Generic VisualLinguistic Representations

  • Language Models as Knowledge Bases?

  • Commonsense Knowledge Base Completion with Structural and Semantic Context

  • ConceptNet 5.5: An Open Multilingual Graph of General Knowledge

  • ATOMIC: An Atlas of Machine Commonsense for If-Then Reasoning

  • COMET: Commonsense Transformers for Automatic Knowledge Graph Construction

Presented Papers

  • Bag of Tricks for Efficient TC
  • onnecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering
  • From Recognition to Cognition: Visual Commonsense Reasoning
  • Vilbert Pretraining Task-Agnostic Visiolinguistic Representations

Paper Implementations

  • Word2Vec
  • Transformer

Additional Sources

About

Paper Reviewed or Pre-read List