There are 0 repository under sudachipy topic.
đź“– Program to parse Japanese sentences and give English definitions
This repository contains codes related to the experiments in "An Experimental Evaluation of Japanese Tokenizers for Sentiment-Based Text Classification" presented at https://www.anlp.jp/nlp2021/. Authors: Andre Rusli and Makoto Shishido (Tokyo Denki University).
Nihotip is a web app that lets users explore Japanese text through interactive tokenization and detailed insights. Built with React and Python, it offers a dynamic way to analyze words and symbols with tooltips for deeper understanding.
A tool to transform JmdictFurigana dictionaries for compatibility with SudachiPy. It restructures the data and converts readings to katakana for efficient furigana lookup during morpheme tokenization in SudachiPy.