SeonjeongHwang's starred repositories
lm-vocab-trimmer
Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting irrelevant tokens from its vocabulary. This repository contains a python-library vocabtrimmer, that remove irrelevant tokens from a multilingual LM vocabulary for the target language.
WeeklyArxivTalk
[Zoom & Facebook Live] Weekly AI Arxiv 시즌2
pytorch-GAT
My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entropy histograms. I've supported both Cora (transductive) and PPI (inductive) examples!
WikiHow-Dataset
A Large Scale Text Summarization Dataset
conversational-QG
[ACL 2019]: Interconnected Question Generation with Coreference Alignment and Conversation Flow Modeling
Question-Generation-Paper-List
A summary of must-read papers for Neural Question Generation (NQG)
GPT2_Summarization
Finetune GPT2 for text summarization
roberta-squad
roBERTa training for SQuAD
DistilKoBERT
Distillation of KoBERT from SKTBrain (Lightweight KoBERT)
AwesomeMRC
IJCAI 2021 Tutorial & code for Retrospective Reader for Machine Reading Comprehension (AAAI 2021)