There are 2 repositories under farsi-datasets topic.
CLIPfa: Connecting Farsi Text and Images
Persian/Farsi text to speech(TTS) training using coqui tts
An Image Dataset of Printed Farsi Text for OCR Research
The first intelligent Persian reverse dictionary
Official github repository, Persis: A persian font recognition pipeline using convolutional neural networks.
Persian Datasets including: Wikipedia, Twitter, Hamshahri, Hellokish, NSURL'19, Peyma, Text_mining.ir
Persian to Finglish dataset with all the sentences voice for TTS dataset used to train tacotron2
In this repository, the wavLM model is used for quality and poor quality data for speaker verification task, and the PyCM library is used for evaluation.
This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.
Simple Script To Crawl Data From Persian News Agencies Including Fars, Mehr.
This is a trained model to recognize Farsi digits.