t13m's repositories

kaldi-readers-for-tensorflow

readers that enable reading kaldi ark in tensorflow

athena

an open-source implementation of sequence-to-sequence based speech processing engine

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0
Language:TypeScriptStargazers:0Issues:1Issues:0

DawDreamer

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

Language:C++License:GPL-3.0Stargazers:0Issues:0Issues:0

DeepSpeech

A TensorFlow implementation of Baidu's DeepSpeech architecture

Language:C++License:MPL-2.0Stargazers:0Issues:2Issues:0

electron-better-ipc

Simplified IPC communication for Electron apps

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

GoBigger

OpenDILab Multi-Agent Environment

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kaldi

This is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:0Issues:1Issues:0

LabSound

:microscope: :speaker: graph-based audio engine

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

LatticeCtc

A Tensorflow extension to calculate CTC loss against lattices instead of linear sequences.

Language:C++Stargazers:0Issues:2Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

leva

🌋 React-first components GUI

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

naudiodon

Node.js stream bindings for PortAudio

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

NeuFA

Neural network-based forced alignment with bidirectional attention mechanism

Language:PythonStargazers:0Issues:0Issues:0

node-audio

Graph-based audio api for Node.js based on LabSound and JUCE

Language:C++Stargazers:0Issues:0Issues:0

OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

pydrobert-kaldi

SWIG bindings for Kaldi I/O, built with Conda

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Soundpipe

A lightweight music DSP library.

Language:CLicense:MITStargazers:0Issues:0Issues:0

tensorflow

Computation using data flow graphs for scalable machine learning

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

Text-to-sound-Synthesis

The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"

Language:PythonStargazers:0Issues:0Issues:0

The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

Language:TeXLicense:CC0-1.0Stargazers:0Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech)

License:Apache-2.0Stargazers:0Issues:0Issues:0

warp-ctc

Fast parallel CTC.

Language:CudaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

waveform-playlist

Multitrack Web Audio editor and player with canvas waveform preview. Set cues, fades and shift multiple tracks in time. Record audio tracks or provide audio annotations. Export your mix to AudioBuffer or WAV! Project inspired by Audacity.

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

wenet

Transformer based ASR Engine.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:CLicense:MITStargazers:0Issues:0Issues:0