shanyou92's starred repositories

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1443Issues:0Issues:0

ChatGPT-Next-Web

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

Language:TypeScriptLicense:MITStargazers:72656Issues:0Issues:0

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:3924Issues:0Issues:0

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:595Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:81Issues:0Issues:0

GUICourse

GUICourse: From General Vision Langauge Models to Versatile GUI Agents

Language:PythonStargazers:36Issues:0Issues:0

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonLicense:Apache-2.0Stargazers:3458Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:296Issues:0Issues:0

ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Language:PythonLicense:NOASSERTIONStargazers:469Issues:0Issues:0

Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

Stargazers:1065Issues:0Issues:0

Data-centric_multimodal_LLM

Survey on Data-centric Large Language Models

Stargazers:48Issues:0Issues:0

Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Stargazers:297Issues:0Issues:0

Efficient-Multimodal-LLMs-Survey

Efficient Multimodal Large Language Models: A Survey

License:Apache-2.0Stargazers:166Issues:0Issues:0

Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Language:PythonLicense:Apache-2.0Stargazers:6992Issues:0Issues:0

Awesome-LLM-for-RecSys

Survey: A collection of AWESOME papers and resources on the large language model (LLM) related recommender system topics.

License:MITStargazers:828Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3076Issues:0Issues:0

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:1854Issues:0Issues:0
Language:PythonStargazers:334Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:15985Issues:0Issues:0

Image-Downloader

Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.

Language:PythonLicense:MITStargazers:2149Issues:0Issues:0

Awesome-Multimodal-LLM-Autonomous-Driving

[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

License:MITStargazers:173Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:7857Issues:0Issues:0

ViTamin

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

Language:PythonLicense:Apache-2.0Stargazers:147Issues:0Issues:0

reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonLicense:MITStargazers:2145Issues:0Issues:0

Agent-FLAN

[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

License:Apache-2.0Stargazers:294Issues:0Issues:0

Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Language:PythonLicense:Apache-2.0Stargazers:107Issues:0Issues:0

CoT-Reasoning-Survey

[ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future

License:MITStargazers:243Issues:0Issues:0

AlignBench

大模型多维度中文对齐评测基准 (ACL 2024)

Language:PythonStargazers:249Issues:0Issues:0

Awesome-LLM-Eval

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.

License:MITStargazers:343Issues:0Issues:0

LaMP

Codes for papers on Large Language Models Personalization (LaMP)

Language:PythonStargazers:93Issues:0Issues:0