A curated list of awesome embedding models tutorials, projects and communities. Please feel free to pull requests to add links.
Table of Contents
Papers
Word Embeddings
Word2vec
- Efficient Estimation of Word Representations in Vector Space
- Distributed Representations of Words and Phrases and their Compositionality
- word2vec Parameter Learning Explained
- word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method
GloVe
- GloVe: Global Vectors for Word Representation
- Improving Word Representations via Global Context and Multiple Word Prototypes
FastText
Embedding Enhancement
- Retrofitting Word Vectors to Semantic Lexicons
- Better Word Representations with Recursive Neural Networks for Morphology
- Dependency-Based Word Embeddings
- Not All Neural Embeddings are Born Equal
- Two/Too Simple Adaptations of Word2Vec for Syntax Problems
Comparing count-based vs predict-based method
- Linguistic Regularities in Sparse and Explicit Word Representations
- Don’t count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors
- Improving Distributional Similarity with Lessons Learned from Word Embeddings
Evaluation Method
Phrase, Sentence and Document Embeddings
Sentence
Document
Sense Embeddings
- SENSEMBED: Learning Sense Embeddings for Word and Relational Similarity
- Multi-Prototype Vector-Space Models of Word Meaning
Neural Language Models
- Recurrent neural network based language model
- A Neural Probabilistic Language Model
- Linguistic Regularities in Continuous Space Word Representations
Researchers
Courses and Lectures
Datasets
Training
Evaluation
- SemEval-2012 Task 2
- WordSimilarity-353
- Stanford's Contextual Word Similarities (SCWS)
- Stanford Rare Word (RW) Similarity Dataset
Trained Word Vectors
- Huang et al. (2012)'s embeddings (HSMN+csmRNN)
- Collobert et al. (2011)'s embeddings (CW+csmRNN)
- word vectors trained by GloVe