Ankush Malaker (AnkushMalaker)

AnkushMalaker

Geek Repo

Company:Voxela.ai

Location:Bengaluru, India

Twitter:@MalakerAnkush

Github PK Tool:Github PK Tool

Ankush Malaker's repositories

speech-emotion-recognition

Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.

pretrained-dcnn-attention-ser

Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"

aits

AI Text Search

Language:PythonStargazers:0Issues:2Issues:0
Stargazers:0Issues:2Issues:0

asr-webservice

ASR Webservice API

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Knowledgebase

Knowledgebase repository started in month of March 2021

Language:JavaScriptStargazers:0Issues:2Issues:0

core

:house_with_garden: Open source home automation that puts local control and privacy first.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

crispy

Crispy is a machine-learning algorithm to make video-games montages efficiently. It uses a neural network to detect highlights in the video-game frames

License:MITStargazers:0Issues:0Issues:0

easy-stt

Easy way to use one of transformer models to do inference locally. Can be done live through mic, or on local files. The first run needs to be online to download necessary models.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:2Issues:5

excalidraw-recognition

Virtual whiteboard for sketching hand-drawn like diagrams

Language:TypeScriptLicense:MITStargazers:0Issues:1Issues:0

homeassistant-satellite

Streaming audio satellite for Home Assistant

License:MITStargazers:0Issues:0Issues:0

icassp2021-mscnn-spu

Code for our paper "Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention" (ICASSP 2021, co-first authorship)

Stargazers:0Issues:1Issues:0

laughr

Recurrent neural network audio manipulation tool to mute "laugh track" audio segments found commonly in sitcoms.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

LiveEd-Smart-Teachers-App

LiveEd is a smart application meant for virtual teachers allowing them to teach from anywhere in the world. It allows teachers to draw in the air as they would using a whiteboard and also import images into the screen to show them to the viewers.

Language:PythonStargazers:0Issues:1Issues:0

nanoGPT-agent

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
License:MITStargazers:0Issues:0Issues:0

openWakeWord

An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pi-camera-stream-flask

[Docker] Create your own live camera stream using a Raspberry Pi 4

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0

piper-recording-studio

Local voice recording for creating Piper datasets

License:MITStargazers:0Issues:0Issues:0

python-audio-interfaces

Easy audio interfaces in python

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

pytorch-attention

Attention mechanisms implemented with basic math and pytorch to gain an understanding. This is kept intentionally feature-poor so as to not be confusing.

Language:PythonStargazers:0Issues:2Issues:0

RustProjects

Rust Projects I made while learning from "The Book"

Language:RustStargazers:0Issues:2Issues:0
Stargazers:0Issues:2Issues:0

translate-with-whisper-live

dibs on implementing a live stream version

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

whisper-obsidian-plugin-local

Speech-to-text in Obsidian using Local Whisper

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-4-ClauseStargazers:0Issues:1Issues:0

wyoming-addons

Docker builds for Home Assistant add-ons using Wyoming protocol

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

wyoming-distil-whisper

Wyoming protocol server for distil-whisper speech to text system

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

yolo_v1_pytorch

PyTorch implementation of YOLO-v1 including training

Language:ShellLicense:MITStargazers:0Issues:1Issues:0