Weijian Xu's starred repositories

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLLicense:CC0-1.0Stargazers:107376Issues:1392Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:45686Issues:303Issues:658

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:17566Issues:173Issues:2116

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14248Issues:116Issues:375

cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Language:TypeScriptLicense:MITStargazers:11899Issues:183Issues:4000

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonLicense:Apache-2.0Stargazers:2376Issues:21Issues:361

viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1638Issues:88Issues:46

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Language:PythonLicense:NOASSERTIONStargazers:1123Issues:13Issues:24

self-critical.pytorch

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Language:PythonLicense:MITStargazers:989Issues:20Issues:279

refer

Referring Expression Datasets API

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:429Issues:7Issues:19

BARTScore

BARTScore: Evaluating Generated Text as Text Generation

Language:PythonLicense:Apache-2.0Stargazers:309Issues:7Issues:44

GRiT

GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)

Language:PythonLicense:MITStargazers:286Issues:2Issues:18

Stable-Pix2Seq

A full-fledged version of Pix2Seq

Language:PythonLicense:Apache-2.0Stargazers:235Issues:7Issues:19

infinibatch

Efficient, check-pointed data loading for deep learning with massive data sets.

Language:PythonLicense:MITStargazers:201Issues:36Issues:11

clipscore

CLIPScore EMNLP code

Language:PythonLicense:MITStargazers:176Issues:2Issues:14

coco-cn

Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks

Language:OpenEdge ABLLicense:MITStargazers:174Issues:5Issues:34

diht

Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:127Issues:21Issues:5

image-paragraph-captioning

[EMNLP 2018] Training for Diversity in Image Paragraph Captioning

FactualSceneGraph

FACTUAL benchmark dataset, the pre-trained textual scene graph parser trained on FACTUAL.

Language:PythonLicense:MITStargazers:84Issues:2Issues:14

captionGAN

Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"

Language:PythonLicense:MITStargazers:65Issues:5Issues:5

video-paragraph

Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021

Language:PythonLicense:MITStargazers:64Issues:4Issues:15

Pretrained-Pix2Seq

Replication of Pix2Seq with Pretrained Model

batch-face

Batch Face Processing for Modern Research, including face detection, face alignment, face reconstruction, head pose estimation

Language:PythonLicense:MITStargazers:54Issues:2Issues:8

iglue

[ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"

Language:ShellLicense:MITStargazers:48Issues:1Issues:14

FuseCap

FuseCap: Large Language Model for Visual Data Fusion in Enriched Caption Generation

Language:PythonLicense:MITStargazers:45Issues:7Issues:8

clair

CLAIR: A (surprisingly) simple semantic text metric with large language models.

Language:PythonLicense:NOASSERTIONStargazers:12Issues:3Issues:3