Omar Moustafa 's starred repositories

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:185819Issues:7596Issues:39818

supervision

We write your reusable computer vision tools. 💜

Language:PythonLicense:MITStargazers:22825Issues:155Issues:416

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:18494Issues:111Issues:1223

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13625Issues:115Issues:1047

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:9959Issues:85Issues:133

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:9169Issues:136Issues:445

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonLicense:GPL-3.0Stargazers:8894Issues:56Issues:525

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8831Issues:63Issues:213

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:6127Issues:50Issues:1015

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Language:PythonLicense:AGPL-3.0Stargazers:5182Issues:60Issues:2014

Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:3418Issues:29Issues:156

deepdoctection

A Repo For Document AI

Language:PythonLicense:Apache-2.0Stargazers:2525Issues:18Issues:180

optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Language:PythonLicense:Apache-2.0Stargazers:2497Issues:55Issues:740

tensorflow-onnx

Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2300Issues:59Issues:1041

Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:2255Issues:41Issues:95

table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.

Language:PythonLicense:MITStargazers:2224Issues:39Issues:141

DIS

This is the repo for our new project Highly Accurate Dichotomous Image Segmentation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2222Issues:91Issues:124

Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.

Language:PythonLicense:MITStargazers:1755Issues:23Issues:31

AniTalker

[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1389Issues:64Issues:33

RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Language:PythonLicense:NOASSERTIONStargazers:632Issues:25Issues:34

unitable

UniTable: Towards a Unified Table Foundation Model

Language:Jupyter NotebookLicense:MITStargazers:353Issues:9Issues:29

Hi-SAM

[arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation

Language:PythonLicense:Apache-2.0Stargazers:192Issues:12Issues:18

Dlib_Windows_Python3.x

Dlib compiled binary (.whl) for Python 3.7-3.12 and Windows x64

geo-clip

This is an official PyTorch implementation of our NeurIPS 2023 paper "GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization"

Language:PythonLicense:MITStargazers:128Issues:2Issues:15
Language:PythonLicense:MITStargazers:126Issues:8Issues:2

ultralyticsplus

Huggingface utilities for Ultralytics/YOLOv8

Language:PythonLicense:GPL-3.0Stargazers:77Issues:2Issues:0

RADAM

We propose a new method named Random encoding of Aggregated Deep Activation Maps (RADAM) for feature extraction from pre-trained Deep CNNs. The technique consists of encoding the output at different depths of the CNN using a Randomized Autoencoder, producing a single image descriptor

Language:PythonLicense:MITStargazers:32Issues:4Issues:1

segformer-tf-transformers

This repository demonstrates how to use TensorFlow based SegFormer model in 🤗 transformers package.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:30Issues:4Issues:4

EALPR

A New Benchmark Dataset for Egyptian License Plate Detection and Recognition

clustertabnet

Implementation of the table detection and table structure recognition deep learning model described in the paper "ClusterTabNet: Supervised clustering method for table detection and table structure recognition".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7Issues:6Issues:3