Seyyed Hossein Hasanpour's repositories
SimpleNet_Pytorch
SimpleNetV1 Implementation in Pytorch
ros-noetic-PX4-easy-installer
This is an easy installer for installing ROS noetic, PX4 (mavros), gazebo,... on ubuntu 20.04
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
chat2api
chatgpt接口模拟API接口转换网关
CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
ControlNet
Let us control diffusion models
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Depth-Anything
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
DragGAN
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)
DragGAN-1
Official Code for DragGAN (SIGGRAPH 2023)
faceswap
Deepfakes Software For All
flip
Official Open Source code for "Scaling Language-Image Pre-training via Masking"
gtav-sourcecode-build-guide
GTA V Source Code Build Tutorial with an extra additions!
hub
Submission to https://pytorch.org/hub/
imaginAIry
AI imagined images. Pythonic generation of stable diffusion images.
InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
lama-cleaner
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
llama-chat
Chat with Meta's LLaMA models at home made easy
llama.cpp
Port of Facebook's LLaMA model in C/C++
Mangio-RVC-Fork
*CREPE+HYBRID TRAINING* A very experimental fork of the Retrieval-based-Voice-Conversion-WebUI repo that incorporates a variety of other f0 methods, along with a hybrid f0 nanmedian method.
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
so-vits-svc-fork
so-vits-svc fork with REALTIME support (voice changer) and greatly improved interface.
spleeter
Deezer source separation library including pretrained models.
warp-yxip
warp endpoint scanner