ismla-japanese-helper / japanese-helper

Web application that analyzes Japanese text and displays pronunciation information, inflection details, English translations, and more. 🇯🇵

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Japanese Helper

Semester project for the course "Industrial Strength Multilingual Language Analysis" at Universität Tübingen, Winter Semester 17-18. This application analyzes Japanese text and displays information about the text that helps English-speaking learners of Japanese, including word segmentation, translation, pronunciation, inflection information etc.

You can import it as a Maven project in Eclipse/IntelliJ IDEA and launch it as a GWT application, after having installed the GWT plugin. Remember to check whether the resources folder is excluded from the Build Path. If so, include it before building and launching the project.

For detailed documentation (information about the project, its use cases, implementation, linguistic background, plus screenshots), please refer to the PDF report.

Tokenizing files

  • Add the (uncompressed) files to src/main/webapp/WEB-INF/tokenize. Make sure Eclipse notices the added files.
  • If you want the tokenized content to be separated by something other than a blank space, change the SEPARATOR constant in LookupServiceImpl.java.
  • Run the project in SuperDev mode, open the website, and press the Tokenize files button. Update logs about which file is currently being read are in the console. Once all files have been processed, there is a pop-up on the website.
  • The tokenized files are in the target/JapaneseHelper-1.0-SNAPSHOT directory. Note that this directory will be rebuilt and the files will be deleted whenever the project is run again!

Licensed content

The Wikimedia Foundation licenses its texts on Wikipedia and Wiktionary under a Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0) license (Here is the full text of the license.). This applies to the files in our resource folders /src/main/webapp/WEB-INF/dictionary, /src/main/webapp/WEB-INF/difficulty-rating, and /src/main/webapp/WEB-INF/inflection-templates, which are based on Wikipedia/Wiktionary articles. These files, including any modifications we made, are also licensed by the same license.

About

Web application that analyzes Japanese text and displays pronunciation information, inflection details, English translations, and more. 🇯🇵


Languages

Language:Java 96.5%Language:HTML 2.9%Language:CSS 0.6%