Dinghao Zhou's starred repositories

jaxloudnorm

Jax implementation of a flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

julius

Fast PyTorch based DSP for audio and 1D signals

Language:PythonLicense:MITStargazers:409Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:61Issues:0Issues:0

audiotools

Object-oriented handling of audio data, with GPU-powered augmentations, and more.

Language:PythonLicense:MITStargazers:191Issues:0Issues:0

DAC-JAX

A JAX Implementation of the Descript Audio Codec

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

jaxrl

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Language:Jupyter NotebookLicense:MITStargazers:593Issues:0Issues:0

multihost_dataloading

Experimenting with how best to do multi-host dataloading

Language:PythonStargazers:6Issues:0Issues:0

SOFA

SOFA: Singing-Oriented Forced Aligner

Language:PythonLicense:MITStargazers:78Issues:0Issues:0

tokenizers

Go bindings for HuggingFace Tokenizer

Language:GoLicense:MITStargazers:64Issues:0Issues:0

QQMusicSpider

基于Scrapy的QQ音乐爬虫(QQ Music Spider),爬取歌曲信息、歌词、精彩评论等,并且分享了QQ音乐中排名前6400名的内地和港台歌手的49万+的音乐语料

Language:PythonStargazers:295Issues:0Issues:0

legado

阅读APP书源

Language:HTMLStargazers:1914Issues:0Issues:0

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Language:PythonLicense:Apache-2.0Stargazers:845Issues:0Issues:0

SpeechGPT

SpeechGPT Series: Speech Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:978Issues:0Issues:0

Awesome-instruction-tuning

A curated list of awesome instruction tuning datasets, models, papers and repositories.

Language:PythonLicense:Apache-2.0Stargazers:260Issues:0Issues:0

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

License:MITStargazers:2129Issues:0Issues:0

ebook

电子书

Stargazers:272Issues:0Issues:0
Language:PythonStargazers:667Issues:0Issues:0

HA2G

[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"

Language:PythonLicense:GPL-3.0Stargazers:121Issues:0Issues:0

highway

Performance-portable, length-agnostic SIMD with runtime dispatch

Language:C++License:Apache-2.0Stargazers:3713Issues:0Issues:0

iqiyi-parser

解析下载爱奇艺、哔哩哔哩、腾讯视频

Language:PythonLicense:MITStargazers:938Issues:0Issues:0

TorchKMeans

A torch-based implementation of K-Means and K-Means++

Language:PythonStargazers:16Issues:0Issues:0

torch_kmeans

PyTorch implementations of KMeans, Soft-KMeans and Constrained-KMeans which can be run on GPU and work on (mini-)batches of data.

Language:PythonLicense:MITStargazers:43Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:22306Issues:0Issues:0

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:2048Issues:0Issues:0

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:21903Issues:0Issues:0

xcodec

X-Codec: Unified Audio Tokenizer for Audio Language Model

Stargazers:14Issues:0Issues:0

music-dl

Music Searcher and Downloader. - 音乐搜索下载器。

Language:PHPLicense:MITStargazers:616Issues:0Issues:0

UMOE-Scaling-Unified-Multimodal-LLMs

The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"

Language:PythonStargazers:699Issues:0Issues:0

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:2979Issues:0Issues:0

audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

Stargazers:1869Issues:0Issues:0