Lorenz Diener (halcy)

halcy

Geek Repo

Company:Microsoft

Location:Germany

Home Page:http://halcy.de/

Twitter:@halcy

Github PK Tool:Github PK Tool


Organizations
cognitive-systems-lab
SVatG

Lorenz Diener's starred repositories

DALL-E

PyTorch package for the discrete VAE used for DALL·E.

Language:PythonLicense:NOASSERTIONStargazers:10777Issues:230Issues:89

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonLicense:NOASSERTIONStargazers:8711Issues:67Issues:360

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookLicense:MITStargazers:3812Issues:76Issues:106

BitmapFonts

My collection of bitmap fonts pulled from various demoscene archives over the years

OpenSeeFace

Robust realtime face and facial landmark tracking on CPU with Unity integration

Language:PythonLicense:BSD-2-ClauseStargazers:1427Issues:23Issues:54

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:1140Issues:27Issues:74

Mastodon.py

Python wrapper for the Mastodon ( https://github.com/mastodon/mastodon/ ) API.

Language:PythonLicense:MITStargazers:875Issues:29Issues:235

Lip2Wav

This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"

Language:PythonLicense:MITStargazers:694Issues:27Issues:39

INTERSPEECH-2023-Papers

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

Language:PythonLicense:MITStargazers:376Issues:9Issues:18

snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Language:PythonLicense:MITStargazers:372Issues:7Issues:22

torch-nansypp

NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis

Language:PythonLicense:MITStargazers:141Issues:28Issues:4

AnimeFaceNotebooks

notebooks and some data for playing with animeface stylegan2 and deepdanbooru

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:118Issues:9Issues:3

silent_speech

Code for voicing silent speech from EMG. Official repository for the papers "Digital Voicing of Silent Speech" at EMNLP 2020 and "An Improved Model for Voicing Silent Speech" at ACL 2021. Also includes code for converting silent speech to text.

Language:PythonLicense:MITStargazers:112Issues:7Issues:5

PLC-Challenge

This repo contains required files for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment (PLC) Challenge.

Language:PythonLicense:MITStargazers:73Issues:8Issues:6

biosignalsnotebooks

biosignalsnotebooks project includes a set of Jupyter Notebooks explaining some processing tasks which have been specially designed for biosignalsplux and OpenSignals users. A Python package is also present, containing some functions to support biosignalsnotebooks notebooks or to be used independently.

Language:Jupyter NotebookLicense:MITStargazers:72Issues:5Issues:8

mdctGAN

Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"

Language:PythonLicense:NOASSERTIONStargazers:58Issues:3Issues:3

Krankenkassen-ohne-Homoeopathie

Hier stehen die Krankenkassen die keine Homöopathie anbieten

fediiverse

mastodon for the miiverse applet on your nintendo 3ds!!

Language:PythonLicense:GPL-3.0Stargazers:26Issues:2Issues:3

nordlicht19

3ds demo for nordlicht 2019

Language:CStargazers:25Issues:2Issues:0

RaccoonsAteMyTicket

Nintendo 3DS Demo released at Evoke 2023

Language:CStargazers:11Issues:2Issues:0

AnimeOrAI

Can you pick whether this image is from a real anime or entirely AI generated?

Language:PythonStargazers:10Issues:2Issues:0

closed-loop-seeg-speech-synthesis

Corresponding source code for the study "Real-time Synthesis of Imagined Speech Processes from Minimally Invasive Recordings of Neural Activity"

Language:PythonStargazers:9Issues:2Issues:0

xtouchmini

Utilities for interacting with the Behringer X-Touch Mini MIDI controller

Language:RustLicense:MITStargazers:5Issues:3Issues:1

rasterizer

A very very simple software rasterizer.

Language:CStargazers:4Issues:3Issues:0

mastodon

A GNU Social-compatible microblogging server

Language:RubyLicense:AGPL-3.0Stargazers:3Issues:3Issues:0

eeVR

Blender addon for rendering equirectangular and dome projection images using the eevee rendering engine.

Language:PythonLicense:GPL-3.0Stargazers:1Issues:1Issues:0

DiscordSweeper

Making Minesweeper Games

Language:JavaScriptLicense:MITStargazers:1Issues:1Issues:0

evol

evolution meme

Language:JavaScriptStargazers:1Issues:2Issues:0