Jie An (pkuanjie)

pkuanjie

Geek Repo

Company:University of Rochester

Location:Rochester, NY, US

Github PK Tool:Github PK Tool

Jie An's starred repositories

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23907Issues:191Issues:3759

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23216Issues:249Issues:277

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20206Issues:176Issues:353

PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Language:PythonLicense:MITStargazers:15988Issues:221Issues:156

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:11040Issues:96Issues:336

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLLicense:MITStargazers:10396Issues:268Issues:43

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonLicense:Apache-2.0Stargazers:4144Issues:49Issues:94

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonLicense:Apache-2.0Stargazers:3642Issues:35Issues:90

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Language:PythonLicense:MITStargazers:3379Issues:57Issues:100

Awesome-Visual-Transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2524Issues:46Issues:0

CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".

Language:PythonLicense:Apache-2.0Stargazers:1628Issues:56Issues:63

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1560Issues:21Issues:85

PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Language:PythonLicense:AGPL-3.0Stargazers:1433Issues:38Issues:100

LAMA

LAnguage Model Analysis

Language:PythonLicense:NOASSERTIONStargazers:1323Issues:72Issues:48

video-diffusion-pytorch

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch

Language:PythonLicense:MITStargazers:1185Issues:28Issues:34

Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

LibFewShot

LibFewShot: A Comprehensive Library for Few-shot Learning. TPAMI 2023.

Language:PythonLicense:MITStargazers:867Issues:25Issues:73

MotionDiffuse

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Language:PythonLicense:NOASSERTIONStargazers:807Issues:29Issues:33

ClipBERT

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

Language:PythonLicense:MITStargazers:693Issues:9Issues:58

xlnet-Pytorch

Simple XLNet implementation with Pytorch Wrapper

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:574Issues:15Issues:17
Language:PythonLicense:Apache-2.0Stargazers:535Issues:15Issues:16

remi

"Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions", ACM Multimedia 2020

Language:PythonLicense:GPL-3.0Stargazers:529Issues:14Issues:37

LVDM

LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation

Language:PythonLicense:MITStargazers:422Issues:27Issues:21

LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

Language:PythonLicense:GPL-3.0Stargazers:272Issues:8Issues:30

AlignProp

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion

Language:PythonLicense:MITStargazers:203Issues:7Issues:13

ConsistI2V

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)

Language:PythonLicense:MITStargazers:173Issues:16Issues:18

d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Language:PythonLicense:MITStargazers:132Issues:7Issues:10