Seyyed Hossein Hasanpour's repositories
AudioStyleNet
This repository contains the code for my master thesis on Emotion-Aware Facial Animation
bearsnacks
:robot: Jupyter Notebooks on robotics, computer vision, python and more
clifs
Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine-learning model CLIP
CLIP
Contrastive Language-Image Pretraining
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
gnss-ins-sim
Open-source GNSS + inertial navigation, sensor fusion simulator. Motion trajectory generator, sensor models, and navigation
google-map-downloader
Small tools to download Google maps satellite image.
imaginAIry
AI imagined images. Pythonic generation of stable diffusion images.
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
invisible-watermark
python library for invisible image watermark (blind image watermark)
PCA-Knowledge-Distillation
PCA-based knowledge distillation towards lightweight and content-style balanced photorealistic style transfer models
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
ronin
RoNIN: Robust Neural Inertial Navigation in the Wild
rpg_trajectory_evaluation
Toolbox for quantitative trajectory evaluation of VO/VIO
S2ML-Generators
Multiple notebooks which allow the use of various machine learning methods to generate or modify multimedia content
slambook2
edition 2 of the slambook
stylegan3
Official PyTorch implementation of StyleGAN3
SuperGluePretrainedNetwork
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
Thin-Plate-Spline-Motion-Model
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
tlgan
Time-Lapse Disentanglement With Conditional GANs [SIGGRAPH 2022]
VToonify
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.