shimomura kei's repositories
AnimateLCM
AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!
ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
diffusers
๐ค Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
doomemacs
An Emacs framework for the stubborn martian hacker
DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
eco-ci-energy-estimation
Eco CI Energy estimation for Github Actions Runner VMs
Fooocus
Focus on prompting and generating
gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
go
The Go programming language
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
grok-1
Grok open release
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Low-Cost-Mocap
Low cost motion capture system for room scale tracking
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
PaddleNLP
๐ Easy-to-use and powerful NLP and LLM library with ๐ค Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including ๐Text Classification, ๐ Neural Search, โ Question Answering, โน๏ธ Information Extraction, ๐ Document Intelligence, ๐ Sentiment Analysis etc.
Qwen-VL
The official repo of Qwen-VL (้ไนๅ้ฎ-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Scrapegraph-ai
Python scraper based on AI
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
stable-diffusion-webui
Stable Diffusion web UI
tenacity
Mirror of https://codeberg.org/tenacityteam/tenacity. Pull requests are IGNORED!
TTS
๐ธ๐ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
vampnet
music generation with masked transformers!
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection