gorogoroumaru / tkrzw-dict

Word dictionary based on DBM

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

================================================================
 Tkrzw-Dict: Word dictionary based on DBM
================================================================

Please read the following documents.

  COPYING           - license
  CONTRIBUTING.md   - how to contribute
  doc/index.xhtml   - main document

To use this package in general, the following software is necessary.

  Tkrzw:
    Install the source package.  See https://dbmx.net/tkrzw/ .

  regex:
    Install pip3 and run "pip3 install regex".

To run tokenize_text.py, the following software is necessary.

  NLTK:
    For Ubuntu, run "apt-get install python3-nltk".
    Then, kick the interactive installation for data packates.
      $ python3
      >>> import nltk
      >>> nltk.download('punkt')
      >>> nltk.download('wordnet')
      >>> nltk.download('averaged_perceptron_tagger')

  MeCab:
    For Ubuntu, run "apt-get install mecab mecab-ipadic-utf8 libmecab2-dev python3-mecab".

To build WordNet databases, WordNet 3.0 and Japanese WordNet 1.1 are necessary.

Thanks.

== END OF FILE ==

About

Word dictionary based on DBM

License:Apache License 2.0


Languages

Language:Python 96.7%Language:JavaScript 2.2%Language:CSS 0.8%Language:HTML 0.3%Language:Shell 0.0%