elwintay's starred repositories
LongDocSum
Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"
document2slides
This repository contains the code to reconstruct the training dataset from NLP/ML Papers in PDF format together with their corresponding slides.
temporal-graph-gen
Pre-trained models for our work on Temporal Graph Generation
TaBERT
This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic parsing. TaBERT is pre-trained on a massive corpus of 26M Web tables and their associated natural language context, and could be used as a drop-in replacement of a semantic parsers original encoder to compute representations for utterances and table schemas (columns).
Giveme5W1H
Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?