Anas Awadalla (anas-awadalla)

anas-awadalla

Geek Repo

Location:Seattle, Washington

Home Page:https://anas-awadalla.streamlit.app

Twitter:@anas_awadalla

Github PK Tool:Github PK Tool

Anas Awadalla's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:25357Issues:218Issues:4096

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:25035Issues:277Issues:77

mlx

MLX: An array framework for Apple silicon

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:15874Issues:68Issues:203

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14582Issues:114Issues:382

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:13083Issues:96Issues:357

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10522Issues:109Issues:19

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:9642Issues:78Issues:121

layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

Language:PythonLicense:Apache-2.0Stargazers:4741Issues:74Issues:147

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4373Issues:49Issues:286
Language:PythonLicense:Apache-2.0Stargazers:4016Issues:52Issues:114

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2217Issues:32Issues:7

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1881Issues:44Issues:107

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:1052Issues:42Issues:72

multimodal-maestro

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥

Language:PythonLicense:MITStargazers:1016Issues:14Issues:7

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:927Issues:7Issues:9

ml-aim

This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models

Language:PythonLicense:NOASSERTIONStargazers:677Issues:18Issues:5

papermage

library supporting NLP and CV research on scientific papers

Language:PythonLicense:Apache-2.0Stargazers:659Issues:9Issues:33

ringattention

Transformers with Arbitrarily Large Context

Language:PythonLicense:Apache-2.0Stargazers:571Issues:5Issues:15

MS-AMP

Microsoft Automatic Mixed Precision Library

Language:PythonLicense:MITStargazers:501Issues:11Issues:63

ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Language:PythonLicense:MITStargazers:443Issues:11Issues:14

annotated-mamba

Annotated version of the Mamba paper

Language:Jupyter NotebookLicense:MITStargazers:440Issues:22Issues:3

MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Language:PythonLicense:Apache-2.0Stargazers:312Issues:4Issues:28

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonLicense:Apache-2.0Stargazers:234Issues:11Issues:11

CapsFusion

[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale

llama-qrlhf

Implementation of the Llama architecture with RLHF + Q-learning

Language:PythonLicense:MITStargazers:155Issues:21Issues:1

chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Language:PythonLicense:Apache-2.0Stargazers:140Issues:11Issues:3

touchdown

Cornell Touchdown natural language navigation and spatial reasoning dataset.

Language:PythonLicense:CC-BY-4.0Stargazers:92Issues:13Issues:3

triton-autodiff

Experiment of using Tangent to autodiff triton

Language:PythonLicense:MITStargazers:66Issues:6Issues:0