Language Media Processing Lab, Kyoto University (ku-nlp)

Language Media Processing Lab, Kyoto University

ku-nlp

Organization data from Github https://github.com/ku-nlp

We are working on making NLP better

Location:Kyoto, Japan

Home Page:https://nlp.ist.i.kyoto-u.ac.jp/EN/

GitHub:@ku-nlp

Language Media Processing Lab, Kyoto University's repositories

jumanpp

Juman++ (a Morphological Analyzer Toolkit)

Language:C++License:Apache-2.0Stargazers:399Issues:29Issues:111

kwja

An integrated Japanese analyzer based on foundation models

Language:PythonLicense:MITStargazers:137Issues:4Issues:61

pyknp

A Python Module for JUMAN++/KNP

Language:PythonLicense:NOASSERTIONStargazers:91Issues:10Issues:32

KWDLC

Kyoto University Web Document Leads Corpus

KyotoCorpus

Kyoto University Text Corpus

rhoknp

Yet another Python binding for Juman++/KNP/KWJA

Language:PythonLicense:MITStargazers:34Issues:4Issues:53

knp

A Japanese Parser

Language:CLicense:NOASSERTIONStargazers:32Issues:6Issues:7

AnnotatedFKCCorpus

Annotated Fuman Kaitori Center Corpus

Language:PythonStargazers:18Issues:7Issues:0

text-cleaning

A powerful text cleaner for Japanese web texts

Language:PythonLicense:MITStargazers:12Issues:0Issues:4

kyoto-reader

A processor for KyotoCorpus, KWDLC, and AnnotatedFKCCorpus

Language:PythonLicense:MITStargazers:10Issues:5Issues:2

KyotoCorpusAnnotationTool

An annotation tool for the Kyoto University Corpus

Language:JavaScriptStargazers:7Issues:6Issues:1

KUCI

Kyoto University Commonsense Inference dataset (KUCI)

License:CC-BY-SA-4.0Stargazers:5Issues:3Issues:0

latent_language_of_multilingual_model

Partial code for the arXiv paper 'Beyond English-Centric LLMs: What Language Do Multilingual Language Models Think in?'

Language:Jupyter NotebookStargazers:5Issues:0Issues:0

ProgLoRA

The official implementation of paper: Progressive LoRA for Multimodal Continual Instruction Tuning. (ACL 2025 Findings)

Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0
Language:PythonStargazers:3Issues:1Issues:0

dockerfile-jumanpp-knp

Dockerfiles for Juman++, KNP, and KWJA

Language:DockerfileLicense:MITStargazers:3Issues:4Issues:1

speechBSD

An extension of the BSD corpus with audio and speaker attribute information

License:NOASSERTIONStargazers:3Issues:4Issues:0

RecomMind

Movie recommendation dialogue dataset with first- and second-person annotations of the seeker’s internal state at the entity level.

License:CC-BY-SA-4.0Stargazers:1Issues:3Issues:0

sdg4idrr

Synthetic Data Generation for Implicit Discourse Relation Recognition (SDG4IDRR)

Language:PythonStargazers:1Issues:3Issues:0

Abstractive-Multi-Video-Captioning

The implementation of the paper "Abstractive Multi-Video Captioning: Benchmark Dataset Construction and Extensive Evaluation."

Language:PythonStargazers:0Issues:4Issues:0

AbstrActs

Benchmark dataset for abstractive multi-video captioning.

Stargazers:0Issues:4Issues:0

ARKitSceneRefer

ARKitSceneRefer: Text-based Localization of Small Objects in Diverse Real-World 3D Indoor Scenes (EMNLP 2023 Findings)

License:GPL-3.0Stargazers:0Issues:3Issues:0

Evaluate-Alignment-HVSB

Source code of the paper: Do LLMs Align Human Values Regarding Social Biases? Judging and Explaining Social Biases with LLMs

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0