Anas Awadalla (anas-awadalla)

anas-awadalla

Geek Repo

Location:Seattle, Washington

Home Page:https://anas-awadalla.streamlit.app

Twitter:@anas_awadalla

Github PK Tool:Github PK Tool

Anas Awadalla's starred repositories

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:10611Issues:127Issues:669

reactpy

It's React, but in Python

Language:PythonLicense:MITStargazers:7805Issues:59Issues:366

Voyager

An Open-Ended Embodied Agent with Large Language Models

Language:JavaScriptLicense:MITStargazers:5442Issues:63Issues:144

Bard-API

The unofficial python package that returns response of Google Bard through cookie value.

Language:PythonLicense:MITStargazers:5344Issues:47Issues:204

grobid

A machine learning software for extracting information from scholarly documents

Language:JavaLicense:Apache-2.0Stargazers:3375Issues:96Issues:857

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1720Issues:11Issues:135

awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

LLM-Training-Puzzles

What would you do with 1000 H100s...

Language:Jupyter NotebookLicense:MITStargazers:810Issues:11Issues:3

visprog

Official code for VisProg (CVPR 2023 Best Paper!)

Language:PythonLicense:Apache-2.0Stargazers:679Issues:15Issues:17

SEED

Official implementation of SEED-LLaMA (ICLR 2024).

Language:PythonLicense:NOASSERTIONStargazers:553Issues:15Issues:49

LLM-groundedDiffusion

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD)

Awesome-Multimodal-LLM

Research Trends in LLM-guided Multimodal Learning.

minimal-text-diffusion

A minimal implementation of diffusion models for text generation

Language:PythonLicense:MITStargazers:295Issues:8Issues:19

pylatexenc

Simple LaTeX parser providing latex-to-unicode and unicode-to-latex conversion

Language:PythonLicense:MITStargazers:292Issues:6Issues:82

TexSoup

fault-tolerant Python3 package for searching, navigating, and modifying LaTeX documents

Language:PythonLicense:BSD-2-ClauseStargazers:276Issues:9Issues:107

grobid_client_python

Python client for GROBID Web services

Language:PythonLicense:Apache-2.0Stargazers:276Issues:6Issues:54

LRV-Instruction

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Language:PythonLicense:BSD-3-ClauseStargazers:242Issues:11Issues:22

MM-Vet

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)

Language:PythonLicense:Apache-2.0Stargazers:240Issues:2Issues:7

OBELICS

Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.

Language:PythonLicense:Apache-2.0Stargazers:179Issues:8Issues:12

ParroT

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

Visual-Instruction-Tuning

SVIT: Scaling up Visual Instruction Tuning

Language:PythonLicense:MITStargazers:159Issues:5Issues:15

TheoremQA

The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset

Language:PythonLicense:MITStargazers:153Issues:5Issues:2

MultiInstruct

MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning

Language:PythonLicense:Apache-2.0Stargazers:130Issues:7Issues:4

VL-CheckList

Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations.

BYTESIZED32

Byte-sized text games for code generation tasks on virtual environments

Language:PythonLicense:Apache-2.0Stargazers:17Issues:8Issues:2

PDF-to-Image-Cluster

This project is designed to automate the process of downloading and processing large datasets from the web: specifically, it scrapes and downloads .snappy.parquet files, converts them to CSV, extracts URLs, downloads associated PDFs, performs OCR on the PDFs to extract text and bounding boxes, and finally organizes and archives the data.

Language:PythonStargazers:4Issues:1Issues:0