Rumeysa Keskin's repositories
Turkish-Text-to-Speech
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
Speaker-Verification
Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, SpeakerNet, TitaNet-L).
Image-Caption-Generation
InceptionV3-Multi-layer GRU based automatic image captioning with Keras and TensorFlow frameworks
EuroSat-Satellite-CNN-and-ResNet
Classifying custom image datasets by creating Convolutional Neural Networks and Residual Networks from scratch with PyTorch
ASR-fine-tuning-for-low-resource-languages
Transfer learning for ASR with subword encoding CTC model (NVIDIA NeMo Citrinet) on low-resource languages
Speech-Datasets-for-ASR
Download speech datasets (English and non-English) for Automatic Speech Recognition
YOLO-Darknet-Video-and-Image-Detection-Traffic-Signs
YOLO Darknet: Traffic sign detection on image and video
Conda-Jupyter-Docker
Create conda environment and launch jupyter notebook in Anaconda docker container
Question-Answering-BERT
Extractive Question-Answering with BERT on SQuAD v2.0 (Stanford Question Answering Dataset) using NVIDIA PyTorch Lightning
Archiconda3-for-ARM64-Jetson-TX1-TX2
Create light-weight conda environment for ARM64 devices
dtw-compare-audio-files
Compute the MFCCs and measure (dis)similarity between two audio files using DTW
Turkish-Text2Speech-Dataset
Download Turkish female and male speech datasets from Mozilla Common Voice for speech synthesis (TTS) systems
ASR-Quantization
Post-training quantization on Nvidia Nemo ASR model
Image-Captioning
Image captioning with a benchmark of CNN-based encoder and GRU-based inject-type (init-inject, pre-inject, par-inject) and merge decoder architectures
Image-Classification-InceptionV3
Transfer learning using Inception V3 for custom image classification dataset with TensorFlow and Keras
mms-turkish-tts
Turkish text to speech model that the part of Facebook's Massively Multilingual Speech
Custom-Object-Detection-PyTorch
Custom object detection on a video dataset using PyTorch Faster RCNN
NGC-docker
NVIDIA GPU Cloud setup and building NVIDIA Containers for Jetson and JetPack
jetson-voice
ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT