linnan wang's repositories
adetailer
Auto detecting, masking and inpainting with detection model.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
CoDeF
Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
distill-sd
Segmind Distilled diffusion
DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
ebsynth
Fast Example-based Image Synthesis and Style Transfer
facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
generative-models
Generative Models by Stability AI
geneval
GenEval: An object-focused framework for evaluating text-to-image alignment
guidance
A guidance language for controlling large language models.
insightface
State-of-the-art 2D and 3D Face Analysis Project
JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
langflow
⛓️ Langflow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
neuralangelo
Official implementation of "Neuralangelo: High-Fidelity Neural Surface Reconstruction" (CVPR 2023)
nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
roop
one-click face swap
sam-hq
Segment Anything in High Quality
stable-diffusion
A latent text-to-image diffusion model
taichi
Productive & portable high-performance programming in Python.
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (ggml), Llama models.
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Xwin-LM
Xwin-LM: a collection of LLM alignment technologies and models