auzxb

auzxb

Geek Repo

Location:Shenzhen

Github PK Tool:Github PK Tool

auzxb's repositories

Stargazers:2Issues:0Issues:0

actionformer_release

Code release for ActionFormer (ECCV 2022)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AudioCaption

Audio captioning recipe

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

License:MITStargazers:0Issues:0Issues:0

BigVGAN

Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

BigVGAN-1

Official PyTorch implementation of BigVGAN (ICLR 2023)

Stargazers:0Issues:0Issues:0

chatbot-list

行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍

Stargazers:0Issues:0Issues:0

CLAP

Contrastive Language-Audio Pretraining

License:CC0-1.0Stargazers:0Issues:0Issues:0

EfficientAT

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

EnCLAP

Official Implementation of EnCLAP

License:MITStargazers:0Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

License:NOASSERTIONStargazers:0Issues:0Issues:0

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

icassp2022-vocal-transcription

Code for ICASSP2022 paper "Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music"

Stargazers:0Issues:0Issues:0

InternVideo

Video Foundation Models & Data for Multimodal Understanding

License:Apache-2.0Stargazers:0Issues:0Issues:0

lama-cleaner

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

License:Apache-2.0Stargazers:0Issues:0Issues:0

motion-diffusion-model

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

License:MITStargazers:0Issues:0Issues:0

MotionDiffuse

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

OPARL

This is the repository of the paper "Online Game Level Generation from Music" in CoG 2022

Stargazers:0Issues:0Issues:0

pop2piano

Official Repo of the paper "Pop2Piano : Pop Audio-based Piano Cover Generation"

Stargazers:0Issues:0Issues:0

SER-datasets

A collection of datasets for the purpose of emotion recognition/detection in speech.

License:MITStargazers:0Issues:0Issues:0
Language:CSSStargazers:0Issues:0Issues:0

soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

License:MITStargazers:0Issues:0Issues:0

SoundStorm-pytorch-1

Google's SoundStorm: Efficient Parallel Audio Generation

License:MITStargazers:0Issues:0Issues:0

SpecVQGAN

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

License:MITStargazers:0Issues:0Issues:0

Text-to-sound-Synthesis

The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"

Language:PythonStargazers:0Issues:0Issues:0

video-bgm-generation

Video Background Music Generation with Controllable Music Transformer (ACM MM 2021 Best Paper Award)

License:MITStargazers:0Issues:0Issues:0

wechat-chatgpt

Use ChatGPT On Wechat via wechaty

Stargazers:0Issues:0Issues:0