Naiyuan Liu's repositories
VGGFace2-HQ
A high resolution face dataset for face editing purpose
Ego4d_NLQ_2022_1st_Place_Solution
The 1st place solution of 2022 Ego4d Natural Language Queries.
NNNNAI.github.io
Naiyuan Liu's personal homepage
ZJU-Clock-In
探究浙江大学健康打卡的原理与对抗策略
ContrastiveSeg
Exploring Cross-Image Pixel Contrast for Semantic Segmentation
insightface
Face Analysis Project on MXNet and PyTorch
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, MobileNet-V3/V2, MNASNet, Single-Path NAS, FBNet, and more
ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
Divide-and-Co-training
[Paper 2020] Towards Better Accuracy-efficiency Trade-offs: Divide and Co-training. Plus, an image classification toolbox includes ResNet, Wide-ResNet, ResNeXt, ResNeSt, ResNeXSt, SENet, Shake-Shake, DenseNet, PyramidNet, and EfficientNet.
face-parsing.PyTorch
Using modified BiSeNet for face parsing in PyTorch
GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
HeyGenClone
A simple and open-source analogue of the HeyGen system
mmdetection
OpenMMLab Detection Toolbox and Benchmark
PaddleVideo
基于模块化的设计,提供丰富的视频算法实现、产业级的视频算法优化与应用,包括安防、体育、互联网、媒体等行业的动作定位与识别、行为分析、智能封面、视频标注、视频打标签等,涵盖动作识别与视频分类、动作定位、动作检测、多模态文本视频检索等技术。
so-vits-svc
SoftVC VITS Singing Voice Conversion
stable-diffusion-webui
Stable Diffusion web UI
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
video-long-term-feature-banks
Long-Term Feature Banks for Detailed Video Understanding
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech