Pol Vila's starred repositories
awesome-ai-music
A curated list of awesome AI tools for music composition, generation, enhancement, and more.
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
riffusion-manipulation
tools to manipulate audio with riffusion
ComfyUI-StableAudioSampler
The New Stable Diffusion Audio Sampler 1.0 In a ComfyUI Node. Make some beats!
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
metastable
🖼️ A project-based Stable Diffusion Web UI, for easier organization of generated images. Work in progress.
ToonCrafter
a research paper for generative cartoon interpolation
cog-consistent-character
Create images of a given character in different poses
EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
docker-tinyproxy
Docker container for Tinyproxy
Mario-Builder-64
Mario Builder 64 is a Super Mario 64 ROM hack that allows you to create custom levels in-game.
xtts-api-server
A simple FastAPI Server to run XTTSv2
parler-tts
Inference and training library for high-quality TTS models.
CrewAI-Visualizer
Interactive user interface for CrewAI package.
freegenius
FreeGenius AI, an advanced AI assistant that can talk and take multi-step actions. Supports numerous open-source LLMs via Llama.cpp or Ollama or Groq Cloud API, with optional integration with AutoGen agents, OpenAI API, Google Gemini Pro and unlimited plugins.
curl-impersonate
curl-impersonate: A special build of curl that can impersonate Chrome & Firefox
ComfyUI-PixelArt-Detector
Generate, downscale, change palletes and restore pixel art images with SDXL.
FreeAskInternet
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use.
steam-audio
Steam Audio
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild