AnkushMalaker

Ankush Malaker's repositories

speech-emotion-recognition

Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.

Language:PureBasic12 2 5

pretrained-dcnn-attention-ser

Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"

Language:Python10 2 1

Knowledgebase

Knowledgebase repository started in month of March 2021

Language:JavaScript020

core

:house_with_garden: Open source home automation that puts local control and privacy first.

Language:PythonApache-2.0000

crispy

Crispy is a machine-learning algorithm to make video-games montages efficiently. It uses a neural network to detect highlights in the video-game frames

Language:PythonMIT010

easy-stt

Easy way to use one of transformer models to do inference locally. Can be done live through mic, or on local files. The first run needs to be online to download necessary models.

Language:PythonGPL-3.002 5

excalidraw-recognition

Virtual whiteboard for sketching hand-drawn like diagrams

Language:TypeScriptMIT010

homeassistant-satellite

Streaming audio satellite for Home Assistant

Language:PythonMIT000

icassp2021-mscnn-spu

Code for our paper "Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention" (ICASSP 2021, co-first authorship)

010

laughr

Recurrent neural network audio manipulation tool to mute "laugh track" audio segments found commonly in sitcoms.

Language:Jupyter NotebookMIT010

LiveEd is a smart application meant for virtual teachers allowing them to teach from anywhere in the world. It allows teachers to draw in the air as they would using a whiteboard and also import images into the screen to show them to the viewers.

Language:Python010

nanoGPT-agent

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT010

obsidian-aicommander-plugin-local

MIT000

openWakeWord

An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.

Language:Jupyter NotebookApache-2.0000

pi-camera-stream-flask

[Docker] Create your own live camera stream using a Raspberry Pi 4

Language:HTMLMIT000

piper-recording-studio

Local voice recording for creating Piper datasets

MIT000

python-audio-interfaces

Easy audio interfaces in python

Language:PythonApache-2.0010

pytorch-attention

Attention mechanisms implemented with basic math and pytorch to gain an understanding. This is kept intentionally feature-poor so as to not be confusing.

Language:Python020

RustProjects

Rust Projects I made while learning from "The Book"

Language:Rust020

TC-ResNet-PyTorch

020

translate-with-whisper-live

dibs on implementing a live stream version

Language:Jupyter Notebook000

whisper-autotune

Language:Python010

whisper-obsidian-plugin-local

Speech-to-text in Obsidian using Local Whisper

Language:TypeScriptMIT000

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-4-Clause010

wyoming-addons

Docker builds for Home Assistant add-ons using Wyoming protocol

Language:PythonMIT000

wyoming-distil-whisper

Wyoming protocol server for distil-whisper speech to text system

Language:PythonMIT000

yolo_v1_pytorch

PyTorch implementation of YOLO-v1 including training

Language:ShellMIT010

AnkushMalaker

Ankush Malaker's repositories

speech-emotion-recognition

pretrained-dcnn-attention-ser

aits

AnkushMalaker

asr-webservice

Knowledgebase

core

crispy

easy-stt

excalidraw-recognition

homeassistant-satellite

icassp2021-mscnn-spu

laughr

LiveEd-Smart-Teachers-App

nanoGPT-agent

obsidian-aicommander-plugin-local

openWakeWord

pi-camera-stream-flask

piper-recording-studio

python-audio-interfaces

pytorch-attention

RustProjects

TC-ResNet-PyTorch

translate-with-whisper-live

whisper-autotune

whisper-obsidian-plugin-local

whisperX

wyoming-addons

wyoming-distil-whisper

yolo_v1_pytorch