Roger Condori's repositories
SoniTranslate
Synchronized Translation for Videos. Video dubbing
InsightSolver-Colab
InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.
SD_diffusers_interactive
A widgets-based interactive notebook for SD
ConversaDocs
Program that enables seamless interaction with your documents through an advanced vector database and the power of Large Language Model (LLM) technology.
gpt_sovits_python
Python wrapper for fast inference with GPT-SoVITS
generative_agents_llama
Generative Agents: Interactive Simulacra of Human Behavior
Riffusion_audio_to_audio_style_transfer
Riffusion is a project that enables audio style transfer using pre-trained models. This repository contains the code and resources needed to perform audio style transfer and generate impressive results.
asdff
adetailer for Diffusers
awesome-spectral-indices
A ready-to-use curated list of Spectral Indices for Remote Sensing applications.
demo-container
MLOps packaging: build and push to GitHub Container Registry
Image-Captioning-Tool
Select elements within an image and generate captions for those elements
inpaint_anything_colab
Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
ModelAtlas
This repository serves as a collection of various models
openvoice_package
Instant voice cloning by MyShell.
piper-phonemize
C++ library for converting text to phonemes for Piper
portfolio1
Portfolio
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
text-generation-webui-colab
A colab gradio web UI for running Large Language Models
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)