elias_t's starred repositories
stable-diffusion-webui
Stable Diffusion web UI
Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
dream-textures
Stable Diffusion built-in to Blender
roop-unleashed
Evolved Fork of roop with Web Server and lots of additions
audio-webui
A webui for different audio related Neural Networks
Latios-Framework
A Unity DOTS framework for my personal projects
MocapNET
We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance
Attend-and-Excite
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
StyleGANEX
[ICCV 2023] StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces
DL-Art-School
TorToiSe fine-tuning with DLAS
tecoGAN_app
tecoGAN Windows application ( EXE )
ControllableTalkNet
A web app that lets you play around with TalkNet models
Richard-roop
for those who wants some speed
novel_view_synthesis_3d
Implementation of "Novel view synthesis with Diffusion models" by Google in JAX distributed