Megagon Labs's repositories
jrte-corpus
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
UD_Japanese-GSD
Japanese data from the Google UDT 2.0.
instruction_ja
Japanese instruction data (日本語指示データ)
llm-longeval
💵 Code for Less is More for Long Document Summary Evaluation by LLMs (Wu, Iso et al; EACL 2024)
quasi_japanese_reviews
Quasi Japanese Reviews (擬似レビューデータ)
hotel_review_scud
宿泊施設口コミ解釈データ
magneton-examples
Example widgets created using the Magneton framework
000
Language:PythonBSD-3-Clause000
Language:PythonBSD-3-Clause000
Language:JavaScriptBSD-3-Clause000
scud2query
Scud2Query dataset