pppku's repositories

leetcode-master

LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Language:ShellStargazers:0Issues:0Issues:0

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

License:MITStargazers:0Issues:0Issues:0

onnx-simplifier

Simplify your onnx model

License:Apache-2.0Stargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

License:MITStargazers:0Issues:0Issues:0

consistency_models

Official repo for consistency models.

License:MITStargazers:0Issues:0Issues:0

Muskits

An opensource music processing toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

License:NOASSERTIONStargazers:0Issues:0Issues:0

lyra

A Very Low-Bitrate Codec for Speech Compression

License:Apache-2.0Stargazers:0Issues:0Issues:0

FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

Stargazers:0Issues:0Issues:0

viewer

ML models and internal tensors 3D visualizer

Stargazers:0Issues:0Issues:0

SoundStream

This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf

Stargazers:0Issues:0Issues:0

PaSST

Efficient Training of Audio Transformers with Patchout

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:1Issues:0Issues:0

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

SVS_system

A system works on singing voice synthesis

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LeetCodeProject

Write down the process of completing leetcode's projects.

Stargazers:1Issues:0Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

deit

Official DeiT repository

License:Apache-2.0Stargazers:0Issues:0Issues:0

conformer

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

License:Apache-2.0Stargazers:0Issues:0Issues:0

Twins

Two simple and effective designs of vision transformer, which is on par with the Swin transformer

License:Apache-2.0Stargazers:0Issues:0Issues:0

mtg-jamendo-dataset

Metadata, scripts and baselines for the MTG-Jamendo dataset

License:Apache-2.0Stargazers:0Issues:0Issues:0

CS-Notes

:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

Stargazers:0Issues:0Issues:0

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

License:Apache-2.0Stargazers:0Issues:0Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

License:MITStargazers:0Issues:0Issues:0