Beast code in Giters

iWAN Research Group's repositories

ArabicSurvey

مستودع الأوراق المسحية في معالجة اللغة العربية (أسبر) A Repository for survey and review papers in Arabic Natural Language processing (ANLP).

81 10 1

Arabic-Topic-Modeling

BERT for Arabic Topic Modeling: An Experimental Study on BERTopic Technique

Language:Jupyter Notebook27 20

A-Monolingual-Arabic-Parallel-Corpus-

8 20

Saudi-Dialect-Irony-Dataset

The Saudi irony dataset was collected using Twitter API and it consists of 19,810 tweets, 8,089 of them are labeled as ironic tweets

CC0-1.07 10

The Arabic paraphrased parallel dataset, sourced from diverse origins and expanded through data augmentation, is invaluable in NLP. It aids education, boosts search engines, supports content creation, aids social media and domain-specific applications, and advances language technology.

4 10

ArabicLLMs

This repository contains resources from the paper A Survey of Large Language Models for Arabic Language and its Dialects

4 10

Saudi-Bank-Sentiment-Dataset

This dataset contains customers’ sentiments on Twitter toward four Saudi Banks. A total of 12k tweets 8,669 of them is labeled as "Negative", 2,143 is labeled as "Positive", and 1,236 tweets is labeled as "Neutral".

GPL-3.04 10

Arabic-Humor

The Arabic humor dataset was collected using Twint and Sketch Engine and it consists of 10k tweets.

3 10

ANLP_dataset

CC0-1.02 10

Arabic-Corpus-for-Error-Detection

2 10

Arabic-Patents

Language:Jupyter NotebookCC0-1.02 10

ARC-WMI

A baseline results towards constructing readability corpus ARC-WMI, a new Arabic collection of written medicine information annotated with readability levels.

NOASSERTION2 10