There are 0 repository under nlp-dataset topic.
Persian Swear Dataset - you can use in your production to filter unwanted content. دیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها
AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/
A list of Romanian NLP Datasets
Dataset for web-scaled information extraction.
Persian Slang Words (dataset)
Persian sms dataset
Persian News Dataset
a novel Romanian language dataset for offensive message detection with manually annotated comment from a local Romanian news website (stiri de cluj) into five classes
RO-Offense: A Novel Romanian Dataset for Offensive Language in Online Comments