Tomiinek

Tomáš Nekvinda's repositories

Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Language:PythonMIT838 31 79

Star_Tracker

Arduino DIY telescope GoTo for arbitrary mounts.

Language:C++MIT60 120

MultiWOZ_Evaluation

Unified MultiWOZ evaluation scripts for the context-to-response task.

Language:PythonMIT59 4 6

Blizzard2013_Segmentation

Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.

Language:Shell44 2 3

Aargh

Language:PythonMIT12 20

WaveRNN

WaveRNN Vocoder + TTS

Language:PythonMIT11 10

Unity_Tower_Defence

Classical top-down tower defence made in Unity 3D game engine.

Language:C#MIT6 10

Face_Cleaner

Automated trimming and cleaning of 3D facial scans

Language:C++NOASSERTION4 10

Toom_Rendering_Engine

Partial remake of the original Doom 1. Written as an assignment during a programming course. Uses doom-like rendering.

Language:C++GPL-3.03 10

npfl114

Materials for the Deep Learning -- ÚFAL course NPFL114

Language:PythonNOASSERTION1 10

Pascal_Star_Fighter

A simple Star Fighter game which I created as a final assignment in the introductory course of programming during the first semester at the uni.

Language:PascalGPL-3.01 10

Sequicity_Knowledge_Base

Implementation of knowledge base for the sequicity model.

Language:PythonMIT1 10

tomiinek.github.io

Language:HTML1 20

UE4_Endless_Racer

Endless racer (runner) created using Blueprints Visual Scripting system of the Unreal Engine 4.

MIT1 20

WaveRNN-Pytorch

Fatcord's Alternative WaveRNN (Faster training)

Language:PythonMIT100

ai-audio-startups

Community list of startups working with AI in audio and music technology

Apache-2.0000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT000

k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Language:CudaNOASSERTION000

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonApache-2.0000

multiwoz

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)

Language:PythonMIT000

Phaser3_Space_Shooter

A simple 2D shooter exploiting features of the Phaser 3 framework.

Language:TypeScriptMIT020

pyreaper

A python wrapper for REAPER

Language:CythonNOASSERTION000

REAPER

C-interface for REAPER (see cwrap/ for details)

Language:C++Apache-2.0000

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

Language:PythonMIT000

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

Language:PythonMIT000

text

Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.

Language:C++MIT000

TorchPQ

Efficient implementations of Product Quantization and its variants using Pytorch and CUDA

Language:CudaMIT000

TSP_Kiwi

Language:C++020

tts-scores

Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models

Language:Python000

WavTokenizer

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Language:PythonMIT000