galuhsahid / indonesian-word-embedding

An Indonesian word embedding demo

Home Page:http://indonesian-word-embedding.herokuapp.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Indonesian Word Embedding

A web application that demonstrates Indonesian word embedding, inspired by Word embedding demo.

See the demo: http://indonesian-word-embedding.herokuapp.com (might take a few seconds if the app is "asleep"--it needs to be booted and loading the vectors might take a few secs)

Right now it serves pre-trained word vectors from here. The word vectors are trained on the Word2vec model from Wikipedia texts using the CBOW training algorithm.

πŸ’‘ future improvements

  • Refactoring. This is my first time using Vue so I'm sure there are a lot of things to be refactored that I haven't known yet!
  • Serves multiple models. Not only Word2vec using CBOW but also Word2vec using skip-gram or even FastText.
  • Add more in-depth explanations, possibly for each section.
  • As more explanations are added, consider using Vuex.
  • Indonesian translation.

πŸ’­ contribute

If you're interested to work in improving the points above or if you have any idea, feel free to make a pull request! πŸ™

βš™ setup

# install front-end
cd frontend
npm install

# serve with hot reload at localhost:8080
npm run dev

# build for production/Flask with minification
npm run build

# install back-end
cd ../backend
virtualenv -p python3 venv
source venv/bin/activate
pip install -r requirements.txt
cd ..

# serve back-end at localhost:5000
FLASK_APP=run.py flask run

credits

About

An Indonesian word embedding demo

http://indonesian-word-embedding.herokuapp.com

License:MIT License


Languages

Language:JavaScript 51.4%Language:Vue 41.6%Language:Python 6.4%Language:HTML 0.6%