Beast code in Giters

yangmin09's starred repositories

awesome-chatgpt-prompts-zh

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.045103 299 650

TaskMatrix

Language:PythonNOASSERTION34456 309 348

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause25096 219 449

paper-reading

深度学习经典、新论文逐段精读

Apache-2.024774 7010

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT18962 296 1316

CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

16807 281 202

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookApache-2.013953 114 369

OpenChatKit

Language:PythonApache-2.09017 121 98

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookBSD-3-Clause4412 34 188

OpenGpt

Create your own ChatGPT App in seconds.

Language:TypeScriptGPL-3.03950 34 49

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonMIT3527 47 170

EVA

EVA Series: Visual Representation Fantasies from BAAI

Language:PythonMIT2049 31 150

visual-openllm

something like visual-chatgpt, 文心一言的开源版

Language:Python1211 25 43

flamingo-pytorch

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Language:PythonMIT1159 21 13

unit-minions

《AI 研发提效：自己动手训练 LoRA》，包含 Llama （Alpaca LoRA）模型、ChatGLM （ChatGLM Tuning）相关 Lora 的训练。训练内容：用户故事生成、测试代码生成、代码辅助生成、文本转 SQL、文本生成代码……

Language:Jupyter Notebook1003 20 12

VideoX

VideoX: a collection of video cross-modal models

Language:PythonNOASSERTION939 22 109

SAM-Adapter-PyTorch

Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts

Language:PythonMIT836 10 74

Image2Paragraph

[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

Language:PythonApache-2.0767 11 28

Deep-Metric-Learning-Baselines

PyTorch Implementation for Deep Metric Learning Pipelines

Language:PythonApache-2.0572 17 24

VLog

Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.

Language:PythonMIT503 6 10

UniFormerV2

[ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer

Language:PythonApache-2.0274 7 75

ovr-cnn

A new framework for open-vocabulary object detection, based on maskrcnn-benchmark

Language:PythonMIT214 5 28

Cap4Video

【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

Language:PythonMIT214 9 28

unicom

[ICLR 2023] Unicom: Universal and Compact Representation Learning for Image Retrieval

Language:Python204 8 21

Text4Vis

【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective

Language:PythonMIT198 6 23

BEVT

PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

Language:PythonApache-2.0151 7 10

BIKE

【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models

Language:PythonMIT150 12 20

TubeViT

An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"

Language:PythonMIT76 10 13

DeepLogo2

A brand logo detection system by DETR

Language:PythonMIT50 3 8