ccyzf

ccyzf

Geek Repo

Github PK Tool:Github PK Tool

ccyzf's starred repositories

PhotoMaker

PhotoMaker [CVPR 2024]

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8883Issues:0Issues:0

AD-NeRF

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Language:PythonLicense:MITStargazers:1009Issues:0Issues:0
Language:PythonLicense:MITStargazers:341Issues:0Issues:0

MoneyPrinterTurbo

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonLicense:MITStargazers:15244Issues:0Issues:0
Language:PythonLicense:MITStargazers:531Issues:0Issues:0
Language:PythonStargazers:155Issues:0Issues:0

IP_LAP

CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors

Language:PythonLicense:Apache-2.0Stargazers:637Issues:0Issues:0

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Language:PythonLicense:MITStargazers:915Issues:0Issues:0

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookLicense:MITStargazers:1304Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29697Issues:0Issues:0

TemporalKit

An all in one solution for adding Temporal Stability to a Stable Diffusion Render via an automatic1111 extension

Language:PythonLicense:GPL-3.0Stargazers:1895Issues:0Issues:0

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Stargazers:7254Issues:0Issues:0

DFDNet

Blind Face Restoration via Deep Multi-scale Component Dictionaries (ECCV 2020)

Language:PythonStargazers:911Issues:0Issues:0

DeOldify

A Deep Learning based project for colorizing and restoring old images (and video!)

Language:PythonLicense:MITStargazers:17804Issues:0Issues:0

edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Language:PythonLicense:GPL-3.0Stargazers:4797Issues:0Issues:0

buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Language:PythonLicense:MITStargazers:11107Issues:0Issues:0

DISTS

IQA: Deep Image Structure and Texture Similarity Metric

Language:PythonLicense:MITStargazers:363Issues:0Issues:0

HDR-VQM

HDR-VQM: An objective quality measure for high dynamic range video

Language:MATLABLicense:MITStargazers:5Issues:0Issues:0

gigagan-pytorch

Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs

Language:PythonLicense:MITStargazers:1723Issues:0Issues:0

BasicVSR_PlusPlus

Official repository of "BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment"

Language:PythonLicense:Apache-2.0Stargazers:574Issues:0Issues:0

BasicVSR_PlusPlus

Official repository of "BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment"

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

VQFR

ECCV 2022, Oral, VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Language:PythonLicense:NOASSERTIONStargazers:314Issues:0Issues:0

VRT

VRT: A Video Restoration Transformer (official repository)

Language:PythonLicense:NOASSERTIONStargazers:1315Issues:0Issues:0
Language:PythonStargazers:5571Issues:0Issues:0

nex-code

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

Language:PythonLicense:MITStargazers:591Issues:0Issues:0

NAFNet

The state-of-the-art image restoration model without nonlinear activation functions.

Language:PythonLicense:NOASSERTIONStargazers:2099Issues:0Issues:0

Final2x

2^x Image Super-Resolution

Language:TypeScriptLicense:BSD-3-ClauseStargazers:5420Issues:0Issues:0

RealBasicVSR

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

Language:PythonLicense:Apache-2.0Stargazers:885Issues:0Issues:0

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2730Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11305Issues:0Issues:0