gz475's repositories
AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
Awesome-Prompting-on-Vision-Language-Model
This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
CoDeBetHe.jl
A Julia package to performing efficient spectral community detection on static and dynamical graphs.
custom_data_gpt
A chatbot based on OpenAI, suitable for enterprise privatized data fine-tuning. It can answer various questions related to enterprise products raised by users.
Emu
Emu Series: Generative Multimodal Models from BAAI
generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
gpt-researcher
GPT based autonomous agent that does online comprehensive research on any given topic
gpt4all
gpt4all: open-source LLM chatbots that you can run anywhere
gpt4free
The official gpt4free repository | various collection of powerful language models
langchain
⚡ Building applications with LLMs through composability ⚡
LayoutGPT
Official repo for LayoutGPT
lida
Automatic Generation of Visualizations and Infographics using Large Language Models
LL3DA
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.
LLM-scientific-feedback
Can large language models provide useful feedback on research papers? A large-scale empirical analysis.
MotionGPT
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
pipeline
Pipeline is an open source API for building AI/ML workflows
poe-api-wrapper
👾 A Python API wrapper for Poe.com, using Httpx. With this, you will have free access to ChatGPT, Claude, Llama, Google-PaLM and more! 🚀
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
sinr
Spatial Implicit Neural Representations for Global-Scale Species Mapping - ICML 2023
ToolBench
An open platform for training, serving, and evaluating large language model for tool learning.
toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
UUKG
UUKG: Unified Urban Knowledge Graph Dataset for Knowledge-Enhanced Urban Spatiotemporal Prediction
Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Zero-shot-RIS
[CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"