aaronchen's repositories

ai-research

【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结:StableDiffusion 是一种强大的图像生成模型,能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型,它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手,能够加速日常编程活动。

Stargazers:0Issues:0Issues:0

ArtificialSongGenerator

The ArtificialSongGenerator automatically composes and compiles the Artifical Audio Multitrack dataset (AAM).

Stargazers:0Issues:0Issues:0

AudioLDM

Text-to-Audio Generation with Latent Diffusion Models

License:NOASSERTIONStargazers:0Issues:0Issues:0

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

License:CC0-1.0Stargazers:0Issues:0Issues:0

awesome-chatgpt-prompts-zh

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

License:MITStargazers:0Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:0Issues:0Issues:0

awesome-music

Awesome Music Projects

Stargazers:0Issues:0Issues:0

BigVGAN

Official implementation of BigVGAN in PyTorch

Stargazers:0Issues:0Issues:0

ChatGPT

Reverse engineered ChatGPT API

License:GPL-2.0Stargazers:0Issues:0Issues:0

ControlLoRA

ControlLoRA: A Light Neural Network To Control Stable Diffusion Spatial Information

License:Apache-2.0Stargazers:0Issues:0Issues:0

ControlNet

Let us control diffusion models

License:Apache-2.0Stargazers:0Issues:0Issues:0

e4t-diffusion

Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models

License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

fish-diffusion

An easy to understand TTS / SVS / SVC framework

License:Apache-2.0Stargazers:0Issues:0Issues:0

iamusica_training

PyTorch software to train and evaluate the ONSETS&VELOCITIES piano model, as presented in our paper: "Onsets and Velocities: Affordable Real-Time Piano Transcription Using Convolutional Neural Networks"

License:NOASSERTIONStargazers:0Issues:0Issues:0

imogen

ultimate vocal harmonizer

License:MITStargazers:0Issues:0Issues:0

INSTA-pytorch

INSTA - Instant Volumetric Head Avatars [CVPR2023]

License:NOASSERTIONStargazers:0Issues:0Issues:0

InternVideo

InternVideo: General Video Foundation Models via Generative and Discriminative Learning (https://arxiv.org/abs/2212.03191)

License:Apache-2.0Stargazers:0Issues:0Issues:0

ISC21-Descriptor-Track-1st

The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.

License:MITStargazers:0Issues:0Issues:0

Mesh2HRTF

Open software for the numerical calculation of head-related transfer functions

License:EUPL-1.2Stargazers:0Issues:0Issues:0

MetaPortrait

[CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation

Stargazers:0Issues:0Issues:0

naturalspeech

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

Stargazers:0Issues:0Issues:0

open-musiclm

Implementation of MusicLM, a new text to music model published by Google, with a few modifications.

License:MITStargazers:0Issues:0Issues:0

polymath

Convert any music library into a music production sample-library with ML

License:MITStargazers:0Issues:0Issues:0

researchgpt

An open-source LLM based research assistant that allows you to have a conversation with a research paper

License:MITStargazers:0Issues:0Issues:0

StyleHEAT

[ECCV 2022] StyleHEAT: A framework for high-resolution editable talking face generation

License:MITStargazers:0Issues:0Issues:0

T2M-GPT

(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”

License:Apache-2.0Stargazers:0Issues:0Issues:0

Tune-A-Video

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Stargazers:0Issues:0Issues:0

vall-e

Zero-Shot Text-To-Speech

License:Apache-2.0Stargazers:0Issues:0Issues:0

VCSL

Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]

License:MITStargazers:0Issues:0Issues:0