Beast code in Giters

YNCao's starred repositories

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause25247 221 456

TelegramGroup

2024最新悄咪咪收集的10000+个Telegram群合集，附带全网最有趣最好用的机器人BOT🤖【tg百科】

13567 1890

rectg

我们从5000多个Telegram群组、频道和机器人中精心挑选了最优质的资源。本项目中的所有内容均来自互联网，仅用于学习和技术研究目的。

Language:PythonApache-2.05961 51 33

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookApache-2.04822 61 367

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonApache-2.02783 26 178

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonBSD-3-Clause2664 32 154

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonApache-2.02383 20 363

hallelujahIM

hallelujahIM(哈利路亚英文输入法) is an intelligent English input method with auto-suggestions and spell check features.

Language:Objective-C++GPL-3.02145 32 152

VLM_survey

Collection of AWESOME vision-language models for vision tasks

2105 120 7

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonApache-2.01876 24 87

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonApache-2.01713 11 135

FreeU

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

MIT1654 44 29

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonApache-2.01589 21 85

RSS-to-Telegram-Bot

A Telegram RSS bot that cares about your reading experience

Language:PythonAGPL-3.01435 5 103

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonCC-BY-4.01119 13 118