anas-awadalla

followers

following

stars

Seattle, Washington

https://anas-awadalla.streamlit.app

Anas Awadalla's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.025357 218 4096

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION25035 277 77

mlx

MLX: An array framework for Apple silicon

Language:C++MIT16250 141 490

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonGPL-3.015874 68 203

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookApache-2.014582 114 382

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonMIT13083 96 357

ml-engineering

Machine Learning Engineering Open Book

Language:PythonCC-BY-SA-4.010522 109 19

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonGPL-3.09642 78 121

layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

Language:PythonApache-2.04741 74 147

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.04373 49 286

alphageometry

Language:PythonApache-2.04016 52 114

low_cost_robot

Language:PythonMIT2911 50 26

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookApache-2.02217 32 7

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.01881 44 107

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonApache-2.01052 42 72

multimodal-maestro

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥

Language:PythonMIT1016 14 7

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookApache-2.0927 7 9

ml-aim

This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models

Language:PythonNOASSERTION677 18 5

papermage

library supporting NLP and CV research on scientific papers

Language:PythonApache-2.0659 9 33

ringattention

Transformers with Arbitrarily Large Context

Language:PythonApache-2.0571 5 15

MS-AMP

Microsoft Automatic Mixed Precision Library

Language:PythonMIT501 11 63

ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Language:PythonMIT443 11 14

annotated-mamba

Annotated version of the Mamba paper

Language:Jupyter NotebookMIT440 22 3

MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Language:PythonApache-2.0312 4 28

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonApache-2.0234 11 11

CapsFusion

[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale

Language:Python187 21 6

llama-qrlhf

Implementation of the Llama architecture with RLHF + Q-learning

Language:PythonMIT155 21 1

chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Language:PythonApache-2.0140 11 3

touchdown

Cornell Touchdown natural language navigation and spatial reasoning dataset.

Language:PythonCC-BY-4.092 13 3

triton-autodiff

Experiment of using Tangent to autodiff triton

Language:PythonMIT66 60