Shengqiong Wu (ChocoWu)

ChocoWu

Geek Repo

Company:National University of Singapore

Home Page:https://chocowu.github.io/

Github PK Tool:Github PK Tool

Shengqiong Wu's starred repositories

scene_graph_commonsense

This is the official implementation of the paper "Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge" in PyTorch.

Language:Jupyter NotebookLicense:MITStargazers:15Issues:0Issues:0

Awesome-Scene-Graph-for-CrossModal-Learning

This is a repository for listing papers on scene graph generation and application.

Stargazers:4Issues:0Issues:0

VSD

Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"

Language:PythonStargazers:24Issues:0Issues:0

SymbCoT

Codes and Data for ACL 2024 Paper "Faithful Logical Reasoning via Symbolic Chain-of-Thought".

Language:PythonLicense:MITStargazers:126Issues:0Issues:0

path2generalist.github.io

Path to Multimodal Generalist: Level, Benchmark and Model

Language:HTMLStargazers:2Issues:0Issues:0

EmpathyEar

Multimodal Empathetic Chatbot

Language:PythonStargazers:11Issues:0Issues:0

SEED-X

Multimodal Models in Real World

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:339Issues:0Issues:0

HiKER-SGG

Code for HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation (CVPR 2024)

Language:PythonLicense:MITStargazers:42Issues:0Issues:0

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:3086Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:133Issues:0Issues:0

torch_kmeans

PyTorch implementations of KMeans, Soft-KMeans and Constrained-KMeans which can be run on GPU and work on (mini-)batches of data.

Language:PythonLicense:MITStargazers:47Issues:0Issues:0

Vitron

A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Language:PythonStargazers:258Issues:0Issues:0

NExT-Chat

The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".

Language:PythonLicense:Apache-2.0Stargazers:184Issues:0Issues:0

DiaASQ

ACL 2023 (Findings) : DiaASQ: A Benchmark of Conversational Aspect-based Sentiment Quadruple Analysis

Language:PythonLicense:MITStargazers:48Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:6127Issues:0Issues:0

NExT-QA

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

Language:PythonLicense:MITStargazers:111Issues:0Issues:0

GLIGEN

Open-Set Grounded Text-to-Image Generation

Language:PythonLicense:MITStargazers:1921Issues:0Issues:0

FactualSceneGraph

FACTUAL benchmark dataset, the pre-trained textual scene graph parser trained on FACTUAL.

Language:PythonStargazers:90Issues:0Issues:0
Language:PythonStargazers:30Issues:0Issues:0

DisNER-PtrNet

Codes for the AAAI 2021 paper Rethinking Boundaries: End-To-End Recognition of Discontinuous Mentions with Pointer Networks.

License:GPL-3.0Stargazers:1Issues:0Issues:0

THOR-ISA

Codes for ACL 2023 paper: Reasoning Implicit Sentiment with Chain-of-Thought Prompting

Stargazers:1Issues:0Issues:0

StruMatchDL

Codes for ICML 2022 paper: Matching Structure for Dual Learning

License:Apache-2.0Stargazers:1Issues:0Issues:0
Language:JavaScriptStargazers:1Issues:0Issues:0

DiaRE-D2G

Codes of of the IJCAI 2022 Paper Global Inference with Explicit Syntactic and Discourse Structures for Dialogue-Level Relation Extraction

License:MITStargazers:1Issues:0Issues:0

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonLicense:MITStargazers:886Issues:0Issues:0

APTM

The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"

Language:PythonLicense:MITStargazers:125Issues:0Issues:0

NExT-GPT.github.io

NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:HTMLStargazers:16Issues:0Issues:0

PENET

[CVPR 2023]Official Pytorch code for paper "Prototype-based Embedding Network for Scene Graph Generation"

Language:Jupyter NotebookLicense:MITStargazers:42Issues:0Issues:0

MRE-ISE

About Codes for ACL 2023 paper: Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling.

Language:PythonStargazers:14Issues:0Issues:0