Beast code in Giters

Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.

Language:PythonApache-2.0179 8 12

ParroT

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

Language:Python166 2 10

Visual-Instruction-Tuning

SVIT: Scaling up Visual Instruction Tuning

Language:PythonMIT159 5 15

TheoremQA

The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset

Language:PythonMIT153 5 2

MultiInstruct

MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning

Language:PythonApache-2.0130 7 4

VL-CheckList

Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations.

Language:Python124 6 11

BYTESIZED32

Byte-sized text games for code generation tasks on virtual environments

Language:PythonApache-2.017 8 2

This project is designed to automate the process of downloading and processing large datasets from the web: specifically, it scrapes and downloads .snappy.parquet files, converts them to CSV, extracts URLs, downloads associated PDFs, performs OCR on the PDFs to extract text and bounding boxes, and finally organizes and archives the data.

Language:Python4 10

anas-awadalla

Anas Awadalla's starred repositories

Awesome-Multimodal-Large-Language-Models

whisperX

reactpy

Voyager

Bard-API

grobid

LISA

awesome-grounding

LLM-Training-Puzzles

visprog

SEED

LLM-groundedDiffusion

med-flamingo

Awesome-Multimodal-LLM

minimal-text-diffusion

pylatexenc

TexSoup

grobid_client_python

LRV-Instruction

MM-Vet

OBELICS

ParroT

composed_image_retrieval

Visual-Instruction-Tuning

TheoremQA

MultiInstruct

VL-CheckList

few-shot-clustering

BYTESIZED32

PDF-to-Image-Cluster