Franck Dernoncourt's repositories
pubmed-rct
PubMed 200k RCT dataset: a large dataset for sequential sentence classification.
awesome-text-summarization
A curated list of resources dedicated to text summarization
Stack-Exchange-Image-Dataset
Dump of all the images uploaded to Stack Exchange (i.stack.imgur.com/*)
vtuworkshop.github.io
website for Video Transcript Understanding workshop
ABSA-Datasets
Has a list of all the publicly available datasets for Aspect-based Sentiment Analysis along with the matching subtasks for each.
acl-anthology
Data and software for building the ACL Anthology.
ai-platform-samples
Official Repo for Google Cloud AI Platform
BehanceQA
A Dataset for Identifying Question-Answer Pairs in Video Transcripts
ChineseNLP
Datasets, SOTA results of every fields of Chinese NLP
Ego2Map-NaViT
Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)
EventExtractionPapers
A list of NLP resources focused on event extraction task
LAL-Parser
Neural Adobe-UCSD Parser, the current State of the Art in Constituency and Dependency Parsing.
long-summarization
Resources for the NAACL 2018 paper "A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents"
MadDog
A Web-based System for Acronym Identification and Disambiguation
meta_cross_nlu_qa
Code for reproducing meta-learning for cross-lingual transfer learning in NLU and QA
muffin-aaai23.github.io
Website for Muffin Workshop at AAAI 2023
sent-fusion-transformers
Code for the EMNLP 2020 paper "Learning to Fuse Sentences with Transformers for Summarization"