GEM-benchmark / NL-Augmenter

NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data augmentation methods and filters that require the entire dataset

markusbayer109 opened this issue · comments

Hello!

First of all, thanks for the effort to build such a collaborative framework!

At the moment, the augmentation methods and filters are only provided with a single example per call. Since there are many techniques that need the whole dataset with the class information (to be conditioned on the class, to interpolate instances, etc.), I wanted to ask if there are plans to add this to this framework?

Hi @markusbayer109 apologies for the late response! There aren't any immediate plans but it is definitely in the scope of the repository. It could probably be an extension of the dataset class which @Nickeilf created.