Fraser Greenlee (Fraser-Greenlee)

Fraser-Greenlee

Geek Repo

Company:Stealth

Location:Scotland, UK

Home Page:frasgreen.com

Twitter:@FraserGreenlee

Github PK Tool:Github PK Tool

Fraser Greenlee's starred repositories

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:42807Issues:438Issues:9258

AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

Language:TypeScriptLicense:GPL-3.0Stargazers:31397Issues:296Issues:457

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25310Issues:221Issues:458

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18738Issues:117Issues:527

DALL-E

PyTorch package for the discrete VAE used for DALL·E.

Language:PythonLicense:NOASSERTIONStargazers:10776Issues:230Issues:89

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5891Issues:65Issues:421

memory_profiler

Monitor Memory usage of Python code

Language:PythonLicense:NOASSERTIONStargazers:4332Issues:81Issues:238

vimGPT

Browse the web with GPT-4V and Vimium

Language:PythonLicense:MITStargazers:2601Issues:28Issues:22

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Language:Jupyter NotebookLicense:MITStargazers:2361Issues:23Issues:231

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1820Issues:16Issues:29

vq-vae-2-pytorch

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Language:PythonLicense:NOASSERTIONStargazers:1604Issues:20Issues:77

webllama

Llama-3 agents that can browse the web by following instructions and talking to you

Language:PythonLicense:MITStargazers:1320Issues:23Issues:9

coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

SoM

Set-of-Mark Prompting for GPT-4V and LMMs

Language:PythonLicense:MITStargazers:1101Issues:21Issues:35

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonLicense:MITStargazers:895Issues:9Issues:17

groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Paella

Official Implementation of Paella https://arxiv.org/abs/2211.07292v2

Language:Jupyter NotebookLicense:MITStargazers:737Issues:16Issues:25
Language:PythonLicense:Apache-2.0Stargazers:668Issues:15Issues:59

message-book

make a book from imessages

BrowserGym

BrowserGym, a gym environment for web task automation in the Chromium browser.

Language:PythonLicense:NOASSERTIONStargazers:266Issues:9Issues:37

ChartVLM

Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

Language:PythonLicense:CC-BY-4.0Stargazers:207Issues:13Issues:15

OBELICS

Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.

Language:PythonLicense:Apache-2.0Stargazers:184Issues:7Issues:12

SeeClick

The model, data and code for the visual GUI Agent SeeClick

Language:HTMLLicense:Apache-2.0Stargazers:184Issues:1Issues:40

ViTamin

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

Language:PythonLicense:Apache-2.0Stargazers:162Issues:6Issues:11

LARC

Language-annotated Abstraction and Reasoning Corpus

Language:JavaScriptLicense:NOASSERTIONStargazers:76Issues:4Issues:1

guesswhat

GuessWhat?! Baselines

Language:PythonLicense:Apache-2.0Stargazers:72Issues:11Issues:25

agent_reasoning_benchmark

🔧 Compare how Agent systems perform on several benchmarks. 📊🚀

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:41Issues:2Issues:4