ChenWu98

Chen Wu's repositories

[ICCV 2023] A latent space for stochastic diffusion models

Language:PythonNOASSERTION644 11 35

[ICCV 2023] https://arxiv.org/abs/2210.05559

Language:PythonNOASSERTION122 8 1

[NeurIPS 2022] (Amortized) distributional control for pre-trained generative models

Language:PythonNOASSERTION121 10

[ICLR 2025] Dissecting adversarial robustness of multimodal language model agents

Language:PythonMIT108 3 1

[ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

Language:Python7200

Code for the ACL 2019 paper ``A Hierarchical Reinforced Sequence Operation Method for Unsupervised Text Style Transfer``

Language:PythonApache-2.045 3 1

A batched version of CLIPort: What and Where Pathways for Robotic Manipulation

Language:Jupyter NotebookApache-2.0500

Code for the ACL 2020 paper ``On the Encoder-Decoder Incompatibility in Variational Text Modeling and Beyond``

Language:Python5 10

VisualWebArena is a benchmark for multimodal agents.

Language:HTMLMIT200

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonMIT100

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonApache-2.0000

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonNOASSERTION000

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.0000

Language:Python000

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Language:PythonMIT000

Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

Language:PythonApache-2.0000