Jaskaran Singh (rasenganai)

rasenganai

Geek Repo

Company:Dubverse.ai

Twitter:@Jass_AI

Github PK Tool:Github PK Tool


Organizations
IOSD

Jaskaran Singh's repositories

audio_clip_processing_pipeline

Audio Clips Processing Pipeline

Language:PythonStargazers:0Issues:0Issues:0

ENG-HIN-Machine-Translation

Translating Eng sentences into Hindi Using NLP and SEQ2SEQ model..

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Opinion-Summarization

Research Project based on Abstract Opinion Summarization .

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

License:MITStargazers:0Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

License:MITStargazers:0Issues:0Issues:0

bddm

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

License:Apache-2.0Stargazers:0Issues:0Issues:0

BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

License:MITStargazers:0Issues:0Issues:0

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1 kHz mono/stereo audio.

License:MITStargazers:0Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

License:NOASSERTIONStargazers:0Issues:0Issues:0

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

License:GPL-3.0Stargazers:0Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:CSSStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

phonemizer

Simple text to phones converter for multiple languages

License:GPL-3.0Stargazers:0Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Sentimental_Extraction

About Bert Based approach to solve the kaggle challenge tweet-sentiment-extraction implemented in Tensorflow pipeline , Using high level Keras API . The solution was able to achieve 70.5% accuracy with 5-folds.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Symptom-Disease-Ordering

This Disease Predictor app helps user to identify a disease in real time by answering the various questions . The symptoms selected are then processed to take out the chances of a few particular ailments. Flask,NLP,Unsupervised Clustering.

Language:HTMLStargazers:0Issues:0Issues:0

textlesslib

Library for Textless Spoken Language Processing

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

torchcrepeV2

My own version of crepe, SOTA pitch tracking tool in PyTorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

License:Apache-2.0Stargazers:0Issues:0Issues:0

tts-scores

Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models

Stargazers:0Issues:0Issues:0

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0