LiuZhuang (liuzhuang1024)

liuzhuang1024

Geek Repo

Company:TJNU

Location:TJNU

Github PK Tool:Github PK Tool

LiuZhuang's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:47874Issues:521Issues:187

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:10093Issues:149Issues:142

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:8758Issues:98Issues:303

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:7927Issues:78Issues:27

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonLicense:GPL-3.0Stargazers:7841Issues:52Issues:340
Language:PythonLicense:Apache-2.0Stargazers:6814Issues:66Issues:61

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:3676Issues:44Issues:339

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonLicense:Apache-2.0Stargazers:3388Issues:29Issues:78

Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonLicense:Apache-2.0Stargazers:1103Issues:19Issues:30

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonLicense:Apache-2.0Stargazers:1008Issues:14Issues:60

BrushNet

The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Language:PythonLicense:NOASSERTIONStargazers:851Issues:46Issues:20

PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Language:PythonLicense:AGPL-3.0Stargazers:766Issues:31Issues:54

SiT

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Language:PythonLicense:MITStargazers:489Issues:10Issues:13

cord

CORD: A Consolidated Receipt Dataset for Post-OCR Parsing

bgpt

Beyond Language Models: Byte Models are Digital World Simulators

Language:PythonLicense:MITStargazers:257Issues:6Issues:1

MARCONet

Learning Generative Structure Prior for Blind Text Image Super-resolution [CVPR 2023]

Language:PythonLicense:NOASSERTIONStargazers:167Issues:4Issues:20

DocDiff

ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.

Language:PythonLicense:MITStargazers:167Issues:4Issues:23

FogRemoval

[ACCV22] Structure Representation Network and Uncertainty Feedback Learning for Dense Non-Uniform Fog Removal, https://arxiv.org/abs/2210.03061

Emote-hack

using chatgpt (now Claude 3) to reverse engineer code from Emote white paper. (abandoned)

NTIRE23-RTSR

CVPR NTIRE 2023 Challenge on Real-Time Super-Resolution

Language:JavaScriptLicense:Apache-2.0Stargazers:104Issues:8Issues:6

ChartAst

ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.

Language:PythonLicense:NOASSERTIONStargazers:54Issues:6Issues:13

SRFormer-Text-Det

[AAAI'24] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression

LAST

Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition

Language:PythonLicense:GPL-3.0Stargazers:21Issues:4Issues:4

2024-TIP-CREAM

PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

License:MITStargazers:2Issues:0Issues:0

MyAutoBuildActions

A python script to automaticly create a clone of your react native application and auto replace based on given regexs. - A Python script to get a list of all open issues in a repository with specific labels, and fetch their corresponding bodies and comments in chronological order (oldest to newest).

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0