aaronchen's repositories

3D-Speaker

A repository for single- and multi-modal speaker verification, speaker recognition, and speaker diarization.

License:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-chatgpt-dataset

Unlock the Power of LLM: Explore These Datasets to Train Your Own ChatGPT!

License:GPL-3.0Stargazers:0Issues:0Issues:0

Awesome-Diffusion-Personalization

A collection of resources on personalization with diffusion models.

License:MITStargazers:0Issues:0Issues:0

backgroundremover

Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

cav-mae

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

Stargazers:0Issues:0Issues:0

CMGAN

Conformer-based Metric GAN for speech enhancement

License:MITStargazers:0Issues:0Issues:0

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

GenerativeDiffusionPrior

Generative Diffusion Prior for Unified Image Restoration and Enhancement (CVPR2023)

Stargazers:0Issues:0Issues:0

Hitomi-Downloader

:cake: Desktop utility to download images/videos/music/text from various websites, and more.

Stargazers:0Issues:0Issues:0

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

ImageReward

ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

License:Apache-2.0Stargazers:0Issues:0Issues:0

Inter-SubNet

The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

langchain-ChatGLM

langchain-ChatGLM, local knowledge based ChatGLM with langchain | 基于本地知识库的 ChatGLM 问答

Language:VueLicense:Apache-2.0Stargazers:0Issues:0Issues:0

loopy

A data framework for music information retrieval focusing on electronic music.

License:GPL-3.0Stargazers:0Issues:0Issues:0

Mug-Diffusion

High-quality and Controllable Charting AI for Rhythm Games, Modifed from Stable Diffusion

License:MITStargazers:0Issues:0Issues:0

NS2VC

Unofficial implementation of NaturalSpeech2 for Voice Conversion

Language:PythonStargazers:0Issues:0Issues:0

Personalize-SAM

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

License:MITStargazers:0Issues:0Issues:0

sinc

Official PyTorch implementation of the paper "SINC: Spatial Composition of 3D Human Motions for Simultaneous Action Generation"

Stargazers:0Issues:0Issues:0

so-vits-svc-1

SoftVC VITS Singing Voice Conversion

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

StableSR

Exploiting Diffusion Prior for Real-World Image Super-Resolution

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

SVT_SpeechBrain

Deep Audio-Visual Singing Voice Transcription based on Self-Supervised Learning Models

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tango

Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"

License:NOASSERTIONStargazers:0Issues:0Issues:0

tunesformer

TunesFormer: Forming Tunes with Control Codes

License:MITStargazers:0Issues:0Issues:0

vid2avatar

Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition (CVPR2023)

License:NOASSERTIONStargazers:0Issues:0Issues:0

video2midi

youtube synthesia video to midi

License:GPL-3.0Stargazers:0Issues:0Issues:0

Waveformer

An efficient architecture for real-time target sound extraction.

License:MITStargazers:0Issues:0Issues:0