dahyun-diane-0208's starred repositories
ko-sentence-transformers
한국어 사전학습 모델을 활용한 문장 임베딩
Graph2Topic
G2T: Topic Model based on PLMs and Community Detection
north_korean_embeddings
Word2Vec Word Vectors trained on a North Korean Corpus / 조선어 (북한어) 단어 임베딩
wordvectors
Pre-trained word vectors of 30+ languages
EMPOLITICON-NLP-and-ML-based-Approach-for-Context-Emotion-Classification-of-Political-Speeches-Code
Paper: https://ieeexplore.ieee.org/document/10141612 . Dataset: https://www.kaggle.com/datasets/efatazher/empoliticon-political-speeches-context-and-emotion
HumanRightsTracker
Tool for Tracking Human Rights Cases for Mexico and Latin America
project-dialogism-novel-corpus
The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.
korean-sarcasm
Construct text corpus data and corresponding model for automatic sarcasm detection on korean.
AwesomeKorean_Data
한국어 데이터 세트 링크
GeometryofCulture
Github site with code and data associated with the ASR paper on the Geometry of Culture