There are 1 repository under persian-dataset topic.
The all-in-one AI library for Persian, supporting a wide variety of tasks and modalities!
Persian/Farsi text to speech(TTS) training using coqui tts
A comprehensive dataset for determining gender based on Persian names, enriched with English representations.
Persian text-to-speech streamlit interface
Persian(Farsi) Wikipedia Dataset | دیتاست ویکی پدیا فارسی شامل تمامی مقالات فارسی تا تاریخ 12 مرداد 1399
An Image Dataset of Printed Farsi Text for OCR Research
Standardize your Persian text: Preprocessing, Embedding, and more!
Persian Datasets including: Wikipedia, Twitter, Hamshahri, Hellokish, NSURL'19, Peyma, Text_mining.ir
Official github repository, Persis: A persian font recognition pipeline using convolutional neural networks.
ParsiGoo is a Persian multispeaker dataset for text-to-speech purposes. It includes recordings from different speakers and is designed to be used for training and evaluating text-to-speech models.
A modification on the Sharif Emotional Speech Database
Dataset for teenagers' chat in Telegram groups (Persian)
NSURL 2019- Task 7, Named Entity Recognition in Persian
Loading pquad_public dataset from hugging face
Welcome to the Persian Last Names Dataset, a comprehensive collection of over 100,000 Persian surnames accompanied by their respective frequencies. This dataset is curated from a substantial real-world sample of more than 10 million records, ensuring reliable and representative data for various applications.
The projects from the NLP course which I took from senior.
A Farsi (Persian) Semantic Similarity Measurement Dataset (FarSSiM)
Iranian/Persian Datasets. دیتاستهای فارسی و ایرانی
Persian sms dataset
Use Siebert and BERTopic Model on Persian Dataset
Persian News Dataset
parallel annotated data for Relation Extraction and Paraphrase Identification in Persian
final project of information retrieval course - aut 1402-1403 fall