Ahsen Khaliq's repositories
animegan2-pytorch
PyTorch implementation of AnimeGANv2
realworld-stylegan2-encoder
Various applications based on Stylegan2 Style mixing that can be inference on cpu.
steerable-nafx
Steerable discovery of neural audio effects
BlendGAN
Official PyTorch implementation of "BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation" (NeurIPS 2021)
BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
ConvNeXt
Code release for ConvNeXt model
Detic
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
kapao
KAPAO is an efficient multi-person human pose estimation model that detects keypoints and poses as objects and fuses the detections to predict human poses.
kogpt
KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)
lang-seg
Language-Driven Semantic Segmentation
mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
midi-ddsp
Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)
mt3
MT3: Multi-Task Multitrack Music Transcription
omnivore
Omnivore: A Single Model for Many Visual Modalities
omnizart
Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.
PaddleGAN
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, wav2lip, picture repair, image editing, photo2cartoon, image style transfer, and so on.
PaddleSpeech
A Speech Toolkit based on PaddlePaddle.
pyxelate
Python class that generates pixel art from images
stylegan2-ada-pytorch
StyleGAN2-ADA - Official PyTorch implementation
SWAG
Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models".
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
UniFormer
[ICLR2022] official implementation of UniFormer
yolov3
YOLOv3 in PyTorch > ONNX > CoreML > TFLite