xwjabc

followers

following

stars

https://weijianxu.com

Weijian Xu's starred repositories

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLCC0-1.0107376 13920

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.045686 303 658

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptApache-2.017566 173 2116

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookApache-2.014248 116 375

cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Language:TypeScriptMIT11899 183 4000

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonApache-2.02376 21 361

viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Language:Jupyter NotebookNOASSERTION1638 88 46

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Language:PythonNOASSERTION1123 13 24

self-critical.pytorch

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Language:PythonMIT989 20 279

refer

Referring Expression Datasets API

Language:Jupyter NotebookApache-2.0429 7 19

BARTScore

BARTScore: Evaluating Generated Text as Text Generation

Language:PythonApache-2.0309 7 44

GRiT

GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)

Language:PythonMIT286 2 18

Stable-Pix2Seq

A full-fledged version of Pix2Seq

Language:PythonApache-2.0235 7 19

infinibatch

Efficient, check-pointed data loading for deep learning with massive data sets.

Language:PythonMIT201 36 11

clipscore

CLIPScore EMNLP code

Language:PythonMIT176 2 14

coco-cn

Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks

Language:OpenEdge ABLMIT174 5 34

diht

Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training

Language:PythonNOASSERTION127 21 5

polygon-transformer

Language:PythonApache-2.0117 11 27

image-paragraph-captioning

[EMNLP 2018] Training for Diversity in Image Paragraph Captioning

Language:Python91 6 19

FactualSceneGraph

FACTUAL benchmark dataset, the pre-trained textual scene graph parser trained on FACTUAL.

Language:Python90 9 6

mBLIP

Language:PythonMIT84 2 14

captionGAN

Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"

Language:PythonMIT65 5 5

video-paragraph

Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021

Language:PythonMIT64 4 15

Pretrained-Pix2Seq

Replication of Pix2Seq with Pretrained Model

Language:Python61 6 13

batch-face

Batch Face Processing for Modern Research, including face detection, face alignment, face reconstruction, head pose estimation

Language:PythonMIT54 2 8

Hallucination

Language:Python51 5 1

iglue

[ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"

Language:ShellMIT48 1 14

FuseCap

FuseCap: Large Language Model for Visual Data Fusion in Enriched Caption Generation

Language:PythonMIT45 7 8

clair

CLAIR: A (surprisingly) simple semantic text metric with large language models.

Language:PythonNOASSERTION12 3 3

VTCM-based-image-paragraph-caption

Language:Python7 1 2