SakuraEntropia's repositories
assimp
The official Open-Asset-Importer-Library Repository. Loads 40+ 3D-file-formats into one unified and clean data structure.
audio-slicer
A simple GUI application that slices audio with silence detection
BEMsim3D
Implementation of a full-wave reference simulator for computing surface reflectance.
ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
danbooru
A taggable image board written in Rails.
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
examples
Deep Learning Examples
FastSAM
Fast Segment Anything
flash-attention
Fast and memory-efficient exact attention
FreeDrag
Official Implementation of FreeDrag
IJCAI2023-CoNR
IJCAI2023 - Collaborative Neural Rendering using Anime Character Sheets
infinigen
Infinite Photorealistic Worlds using Procedural Generation
LLFF
Code release for Local Light Field Fusion at SIGGRAPH 2019
LOMO
LOMO: LOw-Memory Optimization
Magic123
Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
NSF-BigVGAN
BigVGAN with Neural Source-Filter
OpenSTM
A Scanning Tunneling Microscope Project
Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
PythonPark
Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。
qiskit-metal
Quantum Hardware Design. Open-source project for engineers and scientists to design superconducting quantum devices with ease.
so-vits-svc-5.0
Core Engine of Singing Voice Conversion & Singing Voice Clone
stable-diffusion
A latent text-to-image diffusion model
T5-Textual-Inversion
Textual Inversion for DeepFloyd IF
threestudio
A unified framework for 3D content generation.
TigerBot
TigerBot: A multi-language multi-task LLM
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Waifu2x-Extension-GUI
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
X-SING
vi-singer & vits-svc & nsf-bigvgan