Kirolos Ataallah (KerolosAtef)

KerolosAtef

Geek Repo

Company:KAUST University

Location:KSA

Github PK Tool:Github PK Tool

Kirolos Ataallah's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:135038Issues:1125Issues:16147

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:30256Issues:247Issues:5258

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25425Issues:218Issues:464

rembg

Rembg is a tool to remove images background

Language:PythonLicense:MITStargazers:16982Issues:149Issues:507

LWM

Large World Model -- Modeling Text and Video with Millions Context

Language:PythonLicense:Apache-2.0Stargazers:7150Issues:66Issues:71

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:1214Issues:15Issues:122

color-thief-py

Grabs the dominant color or a representative color palette from an image. Uses Python and Pillow.

Language:PythonLicense:NOASSERTIONStargazers:1042Issues:17Issues:22

pytubefix

Python3 library for downloading YouTube Videos.

Language:PythonLicense:MITStargazers:700Issues:17Issues:185

PLLaVA

Official repository for the paper PLLaVA

MiniGPT4-video

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Language:PythonLicense:BSD-3-ClauseStargazers:554Issues:12Issues:40

MovieChat

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:525Issues:10Issues:80

ChatCaptioner

Official Repository of ChatCaptioner

Language:Jupyter NotebookLicense:MITStargazers:452Issues:4Issues:7

TubeDETR

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

Language:PythonLicense:Apache-2.0Stargazers:171Issues:3Issues:22

MLVU

🔥🔥MLVU: Multi-task Long Video Understanding Benchmark

FunQA

FunQA benchmarks funny, creative, and magic videos for challenging tasks including timestamp localization, video description, reasoning, and beyond.

Language:PythonLicense:MITStargazers:96Issues:2Issues:8

MiniGPT-Med

Open-sourced code of miniGPT-Med

Language:PythonLicense:Apache-2.0Stargazers:82Issues:4Issues:5

LVBench

LVBench: An Extreme Long Video Understanding Benchmark

CGSTVG

[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding

Language:PythonLicense:BSD-3-ClauseStargazers:12Issues:1Issues:0
Language:Jupyter NotebookStargazers:1Issues:2Issues:0
Language:Jupyter NotebookStargazers:1Issues:2Issues:0