RAGHUDATHESH G P (dathu)

dathu

Geek Repo

Company:MAHE

Location:Manipal, Karnataka, INDIA

Home Page:https://raghudathesh.weebly.com

Github PK Tool:Github PK Tool


Organizations
datheshraghu

RAGHUDATHESH G P's repositories

ADSP_Tutorials

Advanced Signal Processing Notebooks and Tutorials

Stargazers:0Issues:0Issues:0

PDV

MSIS PDV

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Hindi-ASR-Challenge-iitm

🎯 Speech Recognition Challenge by Speech Lab - IIT Madras

License:NOASSERTIONStargazers:0Issues:0Issues:0

Coursera

These are my learning exercices from Coursera

Stargazers:0Issues:0Issues:0

Speech_Feature_Extraction

Feature extraction of speech signal is the initial stage of any speech recognition system.

Stargazers:0Issues:0Issues:0

Awesome-Speech-Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

License:MITStargazers:0Issues:0Issues:0

Speech_Signal_Processing_and_Classification

Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].

License:MITStargazers:0Issues:0Issues:0

aws3transcribe

AWS Transcribe and S3 buckets management code. Feel free to contribute or fork.

License:MITStargazers:0Issues:0Issues:0

audino

Open source audio annotation tool for humans™

License:MITStargazers:0Issues:0Issues:0

Students-Performance-Analytics

Students Performance Evaluation using Feature Engineering, Feature Extraction, Manipulation of Data, Data Analysis, Data Visualization and at lat applying Classification Algorithms from Machine Learning to Separate Students with different grades

License:GPL-3.0Stargazers:0Issues:0Issues:0

semetrics

Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)

Stargazers:0Issues:0Issues:0

Mesh-Networking-based-Home-Automation

This Repo contains the code for all the board which I used to show how you can make home automation using Mesh Networking

Stargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0

open-speech-corpora

A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

License:MITStargazers:0Issues:0Issues:0

cisco-packet-tracer-MSOIS-2019

Workshop Material scenario files, document and PPT

Stargazers:1Issues:0Issues:0

ASR-System-for-Hindi-Language

The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project description is available at :- https://goo.gl/eQZkMP) : It containes the code for the following systems - 1) Monophone-HMM system built using HTK toolkit , 2)Monophone-HMM system built using Kaldi toolkit, 3)Triphone-HMM system built using Kaldi toolkit and 4)DNN-HMM system built using Kaldi toolkit

Stargazers:0Issues:0Issues:0

traditional-speech-enhancement

Spectral Subtraction, Wiener Filtering, MMSE

License:MITStargazers:0Issues:0Issues:0

MSOIS-LBD-RPI-2019

Code for Workshop

Language:PythonStargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0
Language:ASPStargazers:0Issues:0Issues:0

awesome-kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

Computer-Networking-A-Top-Down-Approach-NOTES

《计算机网络-自顶向下方法(原书第6版)》编程作业,Wireshark实验文档的翻译和解答。

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

numpy-100

100 numpy exercises (100% complete)

License:MITStargazers:0Issues:0Issues:0

ASR_Audio_Data_Links

A list of publically available audio data that anyone can download for ASR or other speech activities

License:Apache-2.0Stargazers:0Issues:0Issues:0

Speech-enhancement

Deep neural network based speech enhancement toolkit

License:GPL-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Data-Analysis

Data Science Using Python

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Smart_Student_Attendance_System_using_Facial_Recognition_in_ThingsBoard_IoT_Platfrom

Smart Student Attendance System using Facial Recognition in ThingsBoard IoT Platform

Language:PythonLicense:MITStargazers:0Issues:0Issues:0