Heinrich Dinkel's starred repositories
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
netboot.xyz
Your favorite operating systems in one place. A network-based bootable operating system installer based on iPXE.
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
DeepFilterNet
Noise supression using deep filtering
ml-fastvit
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
stable-audio-tools
Generative models for conditional audio generation
BS-RoFormer
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
libriheavy
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
tiny-audio-diffusion
A repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB VRAM GPU)
VocalForge
Your one-stop solution for voice dataset creation
DTTNet-Pytorch
An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation
hf_transformers_custom_model_ced
🤗 Transformers custom model for CED.