Chen Wu (吴尘) (ChenWu98)

ChenWu98

Geek Repo

Company:Carnegie Mellon University

Location:Pittsburgh, PA

Home Page:https://chenwu.io/

Github PK Tool:Github PK Tool


Organizations
HKUNLP

Chen Wu (吴尘)'s repositories

cycle-diffusion

[ICCV 2023] A latent space for stochastic diffusion models

Language:PythonLicense:NOASSERTIONStargazers:524Issues:14Issues:31

unified-generative-zoo

[ICCV 2023] https://arxiv.org/abs/2210.05559

Language:PythonLicense:NOASSERTIONStargazers:118Issues:9Issues:1

generative-visual-prompt

[NeurIPS 2022] (Amortized) distributional control for pre-trained generative models

Language:PythonLicense:NOASSERTIONStargazers:114Issues:1Issues:0

Point-Then-Operate

Code for the ACL 2019 paper ``A Hierarchical Reinforced Sequence Operation Method for Unsupervised Text Style Transfer``

Language:PythonLicense:Apache-2.0Stargazers:45Issues:3Issues:1

cliport-batchify

A batched version of CLIPort: What and Where Pathways for Robotic Manipulation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5Issues:0Issues:0

Coupled-VAE

Code for the ACL 2020 paper ``On the Encoder-Decoder Incompatibility in Variational Text Modeling and Beyond``

Language:PythonStargazers:5Issues:1Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0