There are 2 repositories under farsi-datasets topic.
Persian/Farsi text to speech(TTS) training using coqui tts
CLIPfa: Connecting Farsi Text and Images
A comprehensive dataset for determining gender based on Persian names, enriched with English representations.
An Image Dataset of Printed Farsi Text for OCR Research
The first intelligent Persian reverse dictionary
Official github repository, Persis: A persian font recognition pipeline using convolutional neural networks.
Persian Datasets including: Wikipedia, Twitter, Hamshahri, Hellokish, NSURL'19, Peyma, Text_mining.ir
Persian to Finglish dataset with all the sentences voice for TTS dataset used to train tacotron2
In this repository, the wavLM model is used for quality and poor quality data for speaker verification task, and the PyCM library is used for evaluation.
A Custom ANN Model for Classifying Farsi News Topics With User Interface.
Simple Script To Crawl Data From Persian News Agencies Including Fars, Mehr.
Welcome to the Persian Last Names Dataset, a comprehensive collection of over 100,000 Persian surnames accompanied by their respective frequencies. This dataset is curated from a substantial real-world sample of more than 10 million records, ensuring reliable and representative data for various applications.
This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.
This repository hosts BonyadAI, a Persian question answering AI Model. We developed an initial web crawler and scraper to gather the dataset. The second phase involved building a machine learning model based on word embeddings and NLP techniques. This AI model operates end-to-end, receiving user voice input and providing responses in Persian voice.
The Small Farsi-English Internet Dictionary is a bilingual dictionary that has been in development for many years as part of a language teaching programme at the Department of Oriental Studies at the University of Vienna.
The Small Farsi-English Internet Dictionary is a bilingual dictionary that has been in development for many years as part of a language teaching programme at the Department of Oriental Studies at the University of Vienna.