zhangisland

zhangisland

Geek Repo

Github PK Tool:Github PK Tool

zhangisland's starred repositories

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonLicense:MITStargazers:44308Issues:0Issues:0

awesome-ai-agents

A list of AI autonomous agents

License:NOASSERTIONStargazers:10721Issues:0Issues:0

RD-Agent

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through our open source R&D automation tool RD-Agent, which lets AI drive data-driven AI.

Language:PythonLicense:MITStargazers:905Issues:0Issues:0

baml

BAML is a language that helps you get structured data from LLMs, with the best DX possible. Works with all languages. Check out the promptfiddle.com playground

Language:RustLicense:Apache-2.0Stargazers:1160Issues:0Issues:0

WindowsAgentArena

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Language:PythonLicense:MITStargazers:314Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:25365Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:12262Issues:0Issues:0

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:6525Issues:0Issues:0

screen_annotation

The Screen Annotation dataset consists of pairs of mobile screenshots and their annotations. The annotations are in text format, and describe the UI elements present on the screen: their type, location, OCR text and a short description. It has been introduced in the paper `ScreenAI: A Vision-Language Model for UI and Infographics Understanding`.

Stargazers:46Issues:0Issues:0

UIED

An accurate GUI element detection approach based on old-fashioned CV algorithms [Upgraded on 5/July/2021]

Language:PythonLicense:Apache-2.0Stargazers:381Issues:0Issues:0

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:30221Issues:0Issues:0

intercode

[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898

Language:PythonLicense:MITStargazers:189Issues:0Issues:0

SwissArmyTransformer

SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.

Language:PythonLicense:Apache-2.0Stargazers:975Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:26678Issues:0Issues:0

playwright

Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

Language:TypeScriptLicense:Apache-2.0Stargazers:66288Issues:0Issues:0

UFO

A UI-Focused Agent for Windows OS Interaction.

Language:PythonLicense:MITStargazers:7726Issues:0Issues:0

MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Language:PythonLicense:MITStargazers:2823Issues:0Issues:0

IoA

An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.

Language:PythonLicense:Apache-2.0Stargazers:569Issues:0Issues:0

langroid

Harness LLMs with Multi-Agent Programming

Language:PythonLicense:MITStargazers:2497Issues:0Issues:0

autogen

A programming framework for agentic AI 🤖

Language:C#License:CC-BY-4.0Stargazers:31986Issues:0Issues:0

MM-REACT

Official repo for MM-REACT

Language:PythonLicense:MITStargazers:930Issues:0Issues:0

netron

Visualizer for neural network, deep learning and machine learning models

Language:JavaScriptLicense:MITStargazers:27853Issues:0Issues:0

openai-python

The official Python library for the OpenAI API

Language:PythonLicense:Apache-2.0Stargazers:22471Issues:0Issues:0

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:4405Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:13718Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5947Issues:0Issues:0

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

License:Apache-2.0Stargazers:17033Issues:0Issues:0

AutoAct

[ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning

Language:PythonLicense:Apache-2.0Stargazers:171Issues:0Issues:0

Data-Copilot

Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow

Language:PythonLicense:MITStargazers:1385Issues:0Issues:0