federicoarenasl / Evaluating-w-Embeddings

In this paper we compare and evaluate two simple embedding models which can be constructed directly from a given co-occurrence matrix extracted from Twitter data; Positive Pointwise Mutual Information (PPMI), and Hellinger Principal Component Analysis (H-PCA). For each embedding model we consider three alternative metrics for word similarity: cosine, euclidean and manhattan distance.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

federicoarenasl/Evaluating-w-Embeddings Stargazers