yearnyeen ho's starred repositories

autocut

用文本编辑器剪视频

Language:PythonLicense:Apache-2.0Stargazers:6366Issues:49Issues:82

KnowledgeGraphData

史上最大规模1.4亿中文知识图谱开源下载

stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

Language:PythonLicense:Apache-2.0Stargazers:4311Issues:57Issues:121

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3278Issues:58Issues:70

riffusion

Stable diffusion for real-time music generation

Language:PythonLicense:MITStargazers:3264Issues:38Issues:92

lovely-tensors

Tensors, ready for human consumption

Language:Jupyter NotebookLicense:MITStargazers:1061Issues:11Issues:19

open-metric-learning

Metric learning and retrieval pipelines, models and zoo.

Language:PythonLicense:Apache-2.0Stargazers:825Issues:11Issues:222

minDiffusion

Self-contained, minimalistic implementation of diffusion models with Pytorch.

musika

Fast Infinite Waveform Music Generation

Language:PythonLicense:MITStargazers:665Issues:23Issues:39

ptranking

Learning to Rank in PyTorch

Language:PythonLicense:MITStargazers:453Issues:10Issues:19

pop2piano

Official Repo of the paper "Pop2Piano : Pop Audio-based Piano Cover Generation"

diffq

DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.

Language:PythonLicense:NOASSERTIONStargazers:230Issues:11Issues:8

PyTSMod

An open-source Python library for audio time-scale modification.

Language:PythonLicense:GPL-3.0Stargazers:185Issues:8Issues:7
Language:PythonLicense:Apache-2.0Stargazers:163Issues:8Issues:7

pyrubberband

python wrapper for rubberband

Language:PythonLicense:ISCStargazers:153Issues:5Issues:11

awesome-semantic-search

Semantic search with embeddings: index anything

language_modeling_via_stochastic_processes

Language modeling via stochastic processes. Oral @ ICLR 2022.

Contrastive_Search_Is_What_You_Need

[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation

Language:PythonStargazers:116Issues:2Issues:0

music-inpainting-ts

A collection of web interfaces for AI-assisted interactive music creation

Language:TypeScriptLicense:GPL-3.0Stargazers:106Issues:10Issues:12

FxNorm-automix

FxNorm-Automix - Implementation of automatic music mixing systems. We show how we can use wet music data and repurpose it to train a fully automatic mixing system

Language:PythonLicense:MITStargazers:78Issues:5Issues:1

ngram-language-model

Python implementation of an N-gram language model with Laplace smoothing and sentence generation.

Language:PythonStargazers:76Issues:2Issues:0

music-audio-representations

Results and Models for Learning Audio Representations of Music Content

Language:PythonLicense:GPL-3.0Stargazers:76Issues:8Issues:3

vectory

Vectory provides a collection of tools to track and compare embedding versions.

Language:PythonLicense:MITStargazers:65Issues:5Issues:5

NU-Wave-pytorch

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

Language:PythonLicense:MITStargazers:38Issues:3Issues:1

sketching_piano_expression

This repository is for an implementation of the accepted paper "Sketching the Expression: Flexible Rendering of Expressive Piano Performance with Self-Supervised Learning"

audio-language-embeddings

Audio-Language Embedding Extractor (Pytorch)

Language:PythonStargazers:14Issues:0Issues:0

unconditional-diff-STFT

Unconditional music synthesis using a diffusion model in the STFT domain

Language:Jupyter NotebookLicense:MITStargazers:12Issues:1Issues:0