ygfrancois

Guang YANG's starred repositories

llm-transparency-tool

LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo

Language:PythonNOASSERTION70800

Awesome-Video-Diffusion-Models

[Arxiv] A Survey on Video Diffusion Models

162400

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

Apache-2.0316700

aesthetic-predictor-v2-5

SigLIP-based Aesthetic Score Predictor

Language:PythonAGPL-3.011400

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonMIT51500

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonApache-2.092600

Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

MIT204400

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02128300

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonApache-2.01072200

Fooocus

Focus on prompting and generating

Language:PythonGPL-3.03946300

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonGPL-3.04729400

automatic

SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models

Language:PythonAGPL-3.0541300

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.013817400

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonMIT3808800

stable-diffusion-webui-wd14-tagger

Labeling extension for Automatic1111's Web UI

Language:Python130500

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonAGPL-3.0263200

lac

百度NLP：分词，词性标注，命名实体识别，词重要性

Language:C++Apache-2.0382300

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language:Python283800

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03279100

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT750500

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonApache-2.0707800

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonMIT165400

generative-models

Generative Models by Stability AI

Language:PythonMIT2381800

LVM

Language:PythonApache-2.0172400

labelImg

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.

Language:PythonMIT2236100