swekshajha12 / SIH

Comparision of human voice samples

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Voice based login system

This project uses voice as a metric to authorise login based on a Gausian Mixture Model (GMM) trained on coefficients obtained from Mel Frequency Cepstral coefficients (MFCC)

Training

The voice sample is first cleaned to get rid of unnecessary noise. MFCC is calculated for each sample and the data is utilized by a GMM to cluster the voice samples based on the MFCC values.

Deployment

The project runs on Django. The web interface prompts for the user to speak. The recorded voice is then matched with the trained GMM model to find the best cluster it fits into. If the match is above a certain threshold value (say 90%) then the user is authorised.

Improvements

  1. Noise reduction must be taken care
  2. Distinction of an actual voice from a recording
  3. A better understanding of GMM and the clusters made ( visualization of the clusters)
  4. Enhanced training of the model (with more dataset)

About

Comparision of human voice samples


Languages

Language:HTML 42.3%Language:JavaScript 37.6%Language:Python 14.2%Language:CSS 5.9%