So Miyagawa's repositories
somiyagawa
Config files for my GitHub profile.
corpus_raw_data
Raw data of the ORAEC corpus
honkoku-data
歴史資料の市民参加型翻刻プラットフォーム「みんなで翻刻」のテキストデータ置き場です。 / Transcription texts created on Minna de Honkoku (https://honkoku.org), a crowdsourced transcription platform for historical Japanese documents.
bert
TensorFlow code and pre-trained models for BERT
CaNDA
A Jekyll documentation theme with built-in search and playground
intro-stats
Code and materials for for an introduction to statistics class in Göttingen (2022)
JSICK
Repository for JSICK
RDM-osf.io
Facilitating Open Science
pdmocrdataset-part2
OCR処理プログラム研究開発事業において作成されたOCR学習用データセット
pdmocrdataset-part1
デジタル化資料OCRテキスト化事業において作成されたOCR学習用データセット
nuxt-mirador
This repository is designed to show integrating Mirador 3 with Nuxt.js.
gatsby-transformer-ceteicean
Transforms XML files for Custom Elements support via CETEIcean.
manifesto
IIIF Presentation API client and server utility library.
tools
Various utilities for processing the data.
headlessui
Completely unstyled, fully accessible UI components, designed to integrate beautifully with Tailwind CSS.
Canvas-Indexer
A flask web application that crawls Activity Streams for IIIF Canvases and offers a search API.
deplacy
CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis
m1n1
A bootloader and experimentation playground for Apple Silicon
lunr.js
A bit like Solr, but much smaller and not as bright
spaCy-SynCha
SynCha-CaboCha-MeCab wrapper for spaCy
SuPar-UniDic
Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese with BERT models
adapter-transformers
Huggingface Transformers + Adapters = ❤️