Wenyi Hong (wenyihong)

wenyihong

Geek Repo

Company:Tsinghua University

Github PK Tool:Github PK Tool

Wenyi Hong's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130177Issues:1117Issues:15385

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLLicense:MITStargazers:10548Issues:265Issues:45

GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:7652Issues:98Issues:198

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5734Issues:66Issues:412

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Language:Jupyter NotebookLicense:MITStargazers:5612Issues:77Issues:215

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:4052Issues:40Issues:349

Text2Video-Zero

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Language:PythonLicense:NOASSERTIONStargazers:3939Issues:65Issues:71

CogVideo

Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"

Language:PythonLicense:Apache-2.0Stargazers:3576Issues:103Issues:38

frame-interpolation

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Language:PythonLicense:Apache-2.0Stargazers:2773Issues:42Issues:0

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

awesome-human-pose-estimation

A collection of awesome resources in Human Pose estimation.

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:2141Issues:22Issues:310

CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".

Language:PythonLicense:Apache-2.0Stargazers:1649Issues:56Issues:63

ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Language:PythonLicense:Apache-2.0Stargazers:1054Issues:14Issues:80

Neighborhood-Attention-Transformer

Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

Language:PythonLicense:MITStargazers:1027Issues:16Issues:75

SwissArmyTransformer

SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.

Language:PythonLicense:Apache-2.0Stargazers:876Issues:30Issues:73
Language:PythonLicense:NOASSERTIONStargazers:714Issues:8Issues:64

cycle-diffusion

[ICCV 2023] A latent space for stochastic diffusion models

Language:PythonLicense:NOASSERTIONStargazers:543Issues:14Issues:31

RelayDiffusion

The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]

Language:PythonLicense:Apache-2.0Stargazers:250Issues:11Issues:9

ScreenAgent

ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)

Language:PythonLicense:NOASSERTIONStargazers:224Issues:5Issues:28

kinetics-datasets-downloader

Download DeepMind's Kinetics dataset.

Language:Jupyter NotebookStargazers:19Issues:3Issues:0
Language:PythonStargazers:11Issues:5Issues:0
Language:PythonStargazers:3Issues:2Issues:0