Varun Gangal's starred repositories
data-augmentation-review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
DataAug4NLP
Collection of papers and resources for data augmentation for NLP.
NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
MS-MARCO-Web-Search
A large-scale information-rich web dataset, featuring millions of real clicked query-document labels
cohere-python
Python Library for Accessing the Cohere API
NLP4SocialGood_Papers
A reading list of up-to-date papers on NLP for Social Good.
Shakespearizing-Modern-English
Code for "Jhamtani H.*, Gangal V.*, Hovy E. and Nyberg E. Shakespearizing Modern Language Using Copy-Enriched Sequence to Sequence Models" Workshop on Stylistic Variation, EMNLP 2017
metaphor-in-context
Code for the paper "Neural Metaphor Detection in Context".
SimileGeneration-EMNLP2020
Code for SCOPE (Style transfer through COmmonsense PropErty) , a style transfer approach to convert literal sentences to similes
WriterForcing
This repository contains the code for our paper on WriterForcing published at the ACL workshop
VerbPhraseEllipsis
Hosting a cleaned annotated dataset by Leif Arda Nielsen, as described in his PhD thesis: A corpus-based study of Verb Phrase Ellipsis Identification and Resolution
EllipsisDetection
This is my Master's thesis: Automatic Ellipsis Detection - Machine Learning vs. Rule-Based Approach