RAGHUDATHESH G P's repositories
ADSP_Tutorials
Advanced Signal Processing Notebooks and Tutorials
PDV
MSIS PDV
Hindi-ASR-Challenge-iitm
🎯 Speech Recognition Challenge by Speech Lab - IIT Madras
Coursera
These are my learning exercices from Coursera
Speech_Feature_Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Speech_Signal_Processing_and_Classification
Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].
aws3transcribe
AWS Transcribe and S3 buckets management code. Feel free to contribute or fork.
audino
Open source audio annotation tool for humans™
Students-Performance-Analytics
Students Performance Evaluation using Feature Engineering, Feature Extraction, Manipulation of Data, Data Analysis, Data Visualization and at lat applying Classification Algorithms from Machine Learning to Separate Students with different grades
semetrics
Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)
Mesh-Networking-based-Home-Automation
This Repo contains the code for all the board which I used to show how you can make home automation using Mesh Networking
ArduinoFreeRTOS
MSOISES
open-speech-corpora
A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
cisco-packet-tracer-MSOIS-2019
Workshop Material scenario files, document and PPT
ASR-System-for-Hindi-Language
The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project description is available at :- https://goo.gl/eQZkMP) : It containes the code for the following systems - 1) Monophone-HMM system built using HTK toolkit , 2)Monophone-HMM system built using Kaldi toolkit, 3)Triphone-HMM system built using Kaldi toolkit and 4)DNN-HMM system built using Kaldi toolkit
traditional-speech-enhancement
Spectral Subtraction, Wiener Filtering, MMSE
MSOIS-LBD-RPI-2019
Code for Workshop
awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Computer-Networking-A-Top-Down-Approach-NOTES
《计算机网络-自顶向下方法(原书第6版)》编程作业,Wireshark实验文档的翻译和解答。
numpy-100
100 numpy exercises (100% complete)
ASR_Audio_Data_Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Speech-enhancement
Deep neural network based speech enhancement toolkit
Data-Analysis
Data Science Using Python
Smart_Student_Attendance_System_using_Facial_Recognition_in_ThingsBoard_IoT_Platfrom
Smart Student Attendance System using Facial Recognition in ThingsBoard IoT Platform