TaygaHoshi / japanese-i-plus-one-filter

Finds i+1 sentences for a specific word from Jisho.org.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Japanese i+1 Filter

Finds i+1 sentences from Jisho.org, all of which include a word. The term "i+1" is described as explained in this wikipedia article.

How does it work?

  1. User creates "words.txt" next to the .py file.
  2. User adds words to "words.txt", one word per line.
  3. Load "words.txt".
  4. Search sentences for each word in Jisho.org
  5. Divide sentences to words using ja_ginza and take important words (nouns, adjectives, verbs and so on).
  6. Compare words of each sentence to words in known words set.
  7. Count unknown words found in step 4.
  8. If there are exactly 1 unknown word in a sentence, it's an i+1 sentence.
  9. Save results in "results.txt".

Requirements

General Requirements

  • Made using Python 3.10.0
  • Access to Jisho.org

External Libraries

About

Finds i+1 sentences for a specific word from Jisho.org.

License:GNU General Public License v3.0


Languages

Language:Python 100.0%