Jia Guo (nttstar)

nttstar

Geek Repo

Location:China, Shanghai

Github PK Tool:Github PK Tool

Jia Guo's starred repositories

TTS

๐Ÿธ๐Ÿ’ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32353Issues:272Issues:1071

gradio

Build and share delightful machine learning apps, all in Python. ๐ŸŒŸ Star to support our work!

Language:PythonLicense:Apache-2.0Stargazers:31265Issues:167Issues:4547

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29992Issues:190Issues:990

discord.py

An API wrapper for Discord written in Python.

Language:PythonLicense:MITStargazers:14590Issues:262Issues:2917

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

pytorch-cnn-visualizations

Pytorch implementation of convolutional neural network visualization techniques

Language:PythonLicense:MITStargazers:7781Issues:114Issues:106

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Language:Jupyter NotebookLicense:MITStargazers:7535Issues:92Issues:146

hosts

GitHubๆœ€ๆ–ฐhostsใ€‚่งฃๅ†ณGitHubๅ›พ็‰‡ๆ— ๆณ•ๆ˜พ็คบ๏ผŒๅŠ ้€ŸGitHub็ฝ‘้กตๆต่งˆใ€‚

Language:TypeScriptLicense:MITStargazers:4800Issues:77Issues:41

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4717Issues:60Issues:359

SUPIR

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Language:PythonLicense:NOASSERTIONStargazers:4034Issues:70Issues:125

VAR

[GPT beats diffusion๐Ÿ”ฅ] [scaling laws in visual generation๐Ÿ“ˆ] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3893Issues:114Issues:73

T2I-Adapter

T2I-Adapter

Language:PythonLicense:Apache-2.0Stargazers:3335Issues:40Issues:107

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

Language:Jupyter NotebookLicense:MITStargazers:3184Issues:39Issues:107

latex_paper_writing_tips

Tips for Writing a Research Paper using LaTeX

Language:TeXStargazers:2337Issues:23Issues:0

infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

Language:C++License:Apache-2.0Stargazers:2185Issues:25Issues:307

envd

๐Ÿ•๏ธ Reproducible development environment

Language:GoLicense:Apache-2.0Stargazers:1934Issues:22Issues:529

OneTrainer

OneTrainer is a one-stop solution for all your stable diffusion training needs.

Language:PythonLicense:AGPL-3.0Stargazers:1467Issues:23Issues:241

MagicClothing

Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis

Language:PythonLicense:NOASSERTIONStargazers:1249Issues:41Issues:87

Awesome-Talking-Face

๐Ÿ“– A curated list of resources dedicated to talking face.

diffusiondb

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

Language:PythonLicense:MITStargazers:1161Issues:18Issues:15

anycost-gan

[CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing

Language:PythonLicense:MITStargazers:779Issues:23Issues:30

LIA

[ICLR 22] Latent Image Animator: Learning to Animate Images via Latent Space Navigation

Language:PythonLicense:NOASSERTIONStargazers:579Issues:28Issues:23

face-makeup.PyTorch

Lip and hair color editor using face parsing maps.

Language:PythonLicense:MITStargazers:489Issues:10Issues:13

piecewise-rectified-flow

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:375Issues:17Issues:9

Visual_Speech_Recognition_for_Multiple_Languages

Visual Speech Recognition for Multiple Languages

Language:PythonLicense:NOASSERTIONStargazers:309Issues:12Issues:22

Style-Your-Hair

Official Pytorch implementation of "Style Your Hair: Latent Optimization for Pose-Invariant Hairstyle Transfer via Local-Style-Aware Hair Alignment (ECCV 2022)"

unicom

[ICLR 2023] Unicom: Universal and Compact Representation Learning for Image Retrieval

streamlit-oauth

Simple OAuth Component for Streamlit App

Language:PythonLicense:MITStargazers:120Issues:1Issues:15

attribute-control

Fine-Grained Subject-Specific Attribute Expression Control in T2I Models

Language:Jupyter NotebookLicense:MITStargazers:101Issues:6Issues:4

Face-Robustness-Benchmark

An adversarial robustness evaluation library on face recognition.

Language:PythonLicense:Apache-2.0Stargazers:100Issues:4Issues:10