Heinrich Dinkel (RicherMans)

RicherMans

Geek Repo

Company:Xiaomi

Location:China, Beijing

Home Page:richermans.github.io

Github PK Tool:Github PK Tool

Heinrich Dinkel's starred repositories

mojo

The Mojo Programming Language

Language:MojoLicense:NOASSERTIONStargazers:21789Issues:262Issues:1784

inter

The Inter font family

Language:PythonLicense:OFL-1.1Stargazers:17105Issues:162Issues:550

sing-box

The universal proxy platform

Language:GoLicense:NOASSERTIONStargazers:14505Issues:118Issues:1418

candle

Minimalist ML framework for Rust

Language:RustLicense:Apache-2.0Stargazers:13974Issues:148Issues:568

monaspace

An innovative superfamily of fonts for code

Language:TypeScriptLicense:OFL-1.1Stargazers:12865Issues:48Issues:181

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10382Issues:138Issues:307

netboot.xyz

Your favorite operating systems in one place. A network-based bootable operating system installer based on iPXE.

Language:JinjaLicense:Apache-2.0Stargazers:8195Issues:109Issues:470

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6284Issues:61Issues:75

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonLicense:NOASSERTIONStargazers:2104Issues:44Issues:63

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:2033Issues:30Issues:257

ml-fastvit

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Language:PythonLicense:NOASSERTIONStargazers:1749Issues:32Issues:0

vivid

A themeable LS_COLORS generator with a rich filetype datebase

Language:RustLicense:Apache-2.0Stargazers:1613Issues:20Issues:64

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:1500Issues:42Issues:32

AudioSep

Official implementation of "Separate Anything You Describe"

Language:PythonLicense:MITStargazers:1452Issues:66Issues:21

fairseq2

FAIR Sequence Modeling Toolkit 2

Language:PythonLicense:MITStargazers:593Issues:16Issues:84

onnx2tf

Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf). I don't need a Star, but give me a pull request.

Language:PythonLicense:MITStargazers:584Issues:9Issues:221

klassy

Klassy is a highly customizable binary Window Decoration, Application Style and Global Theme plugin for recent versions of the KDE Plasma desktop.

BS-RoFormer

Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs

Language:PythonLicense:MITStargazers:294Issues:10Issues:25

FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Language:PythonLicense:MITStargazers:292Issues:16Issues:42

SONAR

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Language:PythonLicense:NOASSERTIONStargazers:279Issues:14Issues:13

VALOR

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Language:PythonLicense:MITStargazers:235Issues:10Issues:21

VAST

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Language:Jupyter NotebookLicense:MITStargazers:197Issues:18Issues:22

libriheavy

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Language:PythonLicense:Apache-2.0Stargazers:146Issues:5Issues:4

tiny-audio-diffusion

A repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB VRAM GPU)

Language:PythonLicense:MITStargazers:133Issues:6Issues:3

StoryTTS

[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations

Language:HTMLLicense:NOASSERTIONStargazers:126Issues:18Issues:1

VocalForge

Your one-stop solution for voice dataset creation

Language:PythonLicense:MITStargazers:101Issues:9Issues:12

DTTNet-Pytorch

An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation

Language:PythonLicense:Apache-2.0Stargazers:61Issues:4Issues:2
Language:C++License:NOASSERTIONStargazers:25Issues:5Issues:1

hf_transformers_custom_model_ced

🤗 Transformers custom model for CED.

Language:PythonLicense:Apache-2.0Stargazers:5Issues:2Issues:1
Language:PythonLicense:MITStargazers:3Issues:0Issues:0