sor4chi / nlp-knock-100

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

NLP 100 本ノック

Setup

gunzip -c data/chapter3/jawiki-country.json.gz > data/chapter3/jawiki-country.json

https://taku910.github.io/mecab/

から MeCab本体とIPA 辞書 のtar.gzをダウンロードして解凍

tar xvfz mecab-x.x.tar.gz
cd mecab-x.x
./configure --with-charset=utf8
make
make check
sudo make install
tar xvfz mecab-ipadic-x.x.tar.gz
cd mecab-ipadic-x.x
./configure --with-charset=utf8 --enable-utf8-only
make
sudo make install

Word2Vec

GoogleNews-vectors-negative300.bin.gz をダウンロードして解凍

gunzip -c GoogleNews-vectors-negative300.bin.gz > data/chapter7/GoogleNews-vectors-negative300.bin

About


Languages

Language:Python 100.0%