mounicam / lexical_simplification

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

A Word-Complexity Lexicon and A Neural Readability Ranking Model for Lexical Simplification

This repository contains the code and resources from the following paper

Repo Structure:

  1. word_complexity_lexicon: Lexicon with complexity scores for ~15000 most frequent words from Google Ngram Corpus. The scores are calculated by aggregating over human ratings. We release both the aggregated ratings and the individual ratings by each annotator.

  2. SimplePPDBpp: SimplePPDB++ resource consisting of around 14.1 million paraphrase rules along with their readability scores.

  3. neural_readability_ranker: Code for our neural readability ranker model.

Citation

Please cite if you use the above resources for your research

@InProceedings{EMNLP-2018-Maddela,
  author = 	"Maddela, Mounica and Xu, Wei",
  title = 	"A Word-Complexity Lexicon and A Neural Readability Ranking Model for Lexical Simplification",
  booktitle = 	"Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)",
  year = 	"2018",
}

About

License:MIT License


Languages

Language:Python 100.0%