YNCao's starred repositories

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25247Issues:221Issues:456

TelegramGroup

2024最新悄咪咪收集的10000+个Telegram群合集,附带全网最有趣最好用的机器人BOT🤖【tg百科】

rectg

我们从5000多个Telegram群组、频道和机器人中精心挑选了最优质的资源。本项目中的所有内容均来自互联网,仅用于学习和技术研究目的。

Language:PythonLicense:Apache-2.0Stargazers:5961Issues:51Issues:33

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4822Issues:61Issues:367

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2783Issues:26Issues:178

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2664Issues:32Issues:154

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonLicense:Apache-2.0Stargazers:2383Issues:20Issues:363

hallelujahIM

hallelujahIM(哈利路亚 英文输入法) is an intelligent English input method with auto-suggestions and spell check features.

Language:Objective-C++License:GPL-3.0Stargazers:2145Issues:32Issues:152

VLM_survey

Collection of AWESOME vision-language models for vision tasks

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1876Issues:24Issues:87

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1713Issues:11Issues:135

FreeU

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1589Issues:21Issues:85

RSS-to-Telegram-Bot

A Telegram RSS bot that cares about your reading experience

Language:PythonLicense:AGPL-3.0Stargazers:1435Issues:5Issues:103

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:1119Issues:13Issues:118

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:926Issues:71Issues:22

CLIP_benchmark

CLIP-like model evaluation

Language:Jupyter NotebookLicense:MITStargazers:560Issues:12Issues:64

SEED

Official implementation of SEED-LLaMA (ICLR 2024).

Language:PythonLicense:NOASSERTIONStargazers:549Issues:14Issues:49

self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Language:PythonLicense:Apache-2.0Stargazers:541Issues:13Issues:20

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:514Issues:29Issues:35

TinyLLaVABench

A Framework of Small-scale Large Multimodal Models

Language:PythonLicense:Apache-2.0Stargazers:241Issues:6Issues:51

mPLUG-2

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)

Language:PythonLicense:Apache-2.0Stargazers:213Issues:6Issues:23

VQASynth

Compose multimodal datasets 🎹

TLImporter

📲 Telegram Chat Importer: Import chats from WhatsApp or other services into Telegram

Language:PythonLicense:AGPL-3.0Stargazers:126Issues:10Issues:25
Language:PythonLicense:MITStargazers:40Issues:1Issues:2

TLMerger

🔀 Telegram Chat Merger: Merge chats in Telegram

Language:PythonLicense:AGPL-3.0Stargazers:34Issues:4Issues:8

UnionChannel_telegram

With this bot you can merge (combine) all your telegram channels into one news feed

Language:PythonLicense:GPL-3.0Stargazers:20Issues:5Issues:4

Levels_image_captioning_NICE

NICE challenge 2023 Track2 2nd result(total 4th) (CVPR 2023) sponsered by LG AI/Shutterstock/SNU

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:11Issues:2Issues:0

awesome-diffusion

A collection of papers and resources on diffusion models

Language:PythonStargazers:9Issues:3Issues:0