Sumanth Reddy Kaliki's starred repositories
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
recommenders
Best Practices on Recommendation Systems
supervision
We write your reusable computer vision tools. 💜
ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
pytorch-deep-learning
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
techniques
Techniques for deep learning with satellite & aerial imagery
mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
Machine-Learning-Interviews
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
sd-webui-roop
roop extension for StableDiffusion web-ui
bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
awesome-notebooks
A powerful data & AI notebook templates catalog: prompts, plugins, models, workflow automation, analytics, code snippets - following the IMO framework to be searchable and reusable in any context.
pytorch-GAT
My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entropy histograms. I've supported both Cora (transductive) and PPI (inductive) examples!
Awesome-Edge-Detection-Papers
:books: A collection of edge/contour/boundary detection papers and toolbox.
sd-webui-deoldify
DeOldify for Stable Diffusion WebUI:This is an extension for StableDiffusion's AUTOMATIC1111 web-ui that allows colorize of old photos and old video. It is based on deoldify.
ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
SVCC23_FastSVC
Singing Voice Conversion Challenge 2023 Starter Kit: FastSVC Reimplementation