ArEnSc's starred repositories
facefusion
Industry leading face manipulation platform
fuzzywuzzy
Fuzzy String Matching in Python
instaloader
Download pictures (or videos) along with their captions and other metadata from Instagram.
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
LLamaSharp
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
ChainForge
An open-source visual programming environment for battle-testing prompts to LLMs.
llm-reasoners
A library for advanced large language model reasoning
bevy-inspector-egui
Inspector plugin for the bevy game engine
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
aphrodite-engine
Large-scale LLM inference engine
Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
python-osc
Open Sound Control server and client in pure python
finetuned-qlora-falcon7b-medical
Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset
CoMoSpeech
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
localLLM_guidance
Local LLM ReAct Agent with Guidance
Multi-Model-RVC-Inference
RVC Inference with multiple model and huggingface support
QuIP-for-Llama
Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models
LFAI-LLM
This GitHub repository hosts an innovative project featuring an LSTM-based embedding GPT-like neural network. This network is designed to fuse diverse data modalities such as images, audio, sensor inputs, and text, creating a holistic and human-like sentient AI system with the ability to comprehend and respond across multiple data formats.