aaronchen's repositories
ai-research
【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结:StableDiffusion 是一种强大的图像生成模型,能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型,它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手,能够加速日常编程活动。
ArtificialSongGenerator
The ArtificialSongGenerator automatically composes and compiles the Artifical Audio Multitrack dataset (AAM).
AudioLDM
Text-to-Audio Generation with Latent Diffusion Models
awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
awesome-music
Awesome Music Projects
BigVGAN
Official implementation of BigVGAN in PyTorch
ChatGPT
Reverse engineered ChatGPT API
ControlLoRA
ControlLoRA: A Light Neural Network To Control Stable Diffusion Spatial Information
ControlNet
Let us control diffusion models
e4t-diffusion
Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models
fish-diffusion
An easy to understand TTS / SVS / SVC framework
iamusica_training
PyTorch software to train and evaluate the ONSETS&VELOCITIES piano model, as presented in our paper: "Onsets and Velocities: Affordable Real-Time Piano Transcription Using Convolutional Neural Networks"
imogen
ultimate vocal harmonizer
INSTA-pytorch
INSTA - Instant Volumetric Head Avatars [CVPR2023]
InternVideo
InternVideo: General Video Foundation Models via Generative and Discriminative Learning (https://arxiv.org/abs/2212.03191)
ISC21-Descriptor-Track-1st
The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.
Mesh2HRTF
Open software for the numerical calculation of head-related transfer functions
MetaPortrait
[CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
open-musiclm
Implementation of MusicLM, a new text to music model published by Google, with a few modifications.
polymath
Convert any music library into a music production sample-library with ML
researchgpt
An open-source LLM based research assistant that allows you to have a conversation with a research paper
StyleHEAT
[ECCV 2022] StyleHEAT: A framework for high-resolution editable talking face generation
T2M-GPT
(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”
Tune-A-Video
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
vall-e
Zero-Shot Text-To-Speech
VCSL
Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]