coura (courao)

courao

User data from Github https://github.com/courao

Company:NJU

Location:Hangzhou

GitHub:@courao

coura's starred repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:41033Issues:393Issues:1306

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:39097Issues:360Issues:1901

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:16911Issues:115Issues:405

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonLicense:NOASSERTIONStargazers:15752Issues:133Issues:621

FastSAM

Fast Segment Anything

Language:PythonLicense:AGPL-3.0Stargazers:8056Issues:54Issues:215

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Language:Jupyter NotebookLicense:MITStargazers:5508Issues:36Issues:348

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5380Issues:45Issues:131

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:4153Issues:32Issues:279

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:4150Issues:42Issues:358

FlagAI

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Language:PythonLicense:Apache-2.0Stargazers:3864Issues:41Issues:211

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonLicense:MITStargazers:3227Issues:80Issues:164

CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Language:PythonLicense:MITStargazers:1928Issues:15Issues:83

Vary

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

invisible-watermark

python library for invisible image watermark (blind image watermark)

Language:PythonLicense:MITStargazers:1776Issues:14Issues:33

dlrover

DLRover: An Automatic Distributed Deep Learning System

Language:PythonLicense:NOASSERTIONStargazers:1408Issues:44Issues:275

VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Language:PythonLicense:Apache-2.0Stargazers:829Issues:14Issues:46

RegionCLIP

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Language:PythonLicense:Apache-2.0Stargazers:756Issues:9Issues:102

UniDetector

Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".

Language:PythonLicense:Apache-2.0Stargazers:564Issues:15Issues:35

Chinese-LLaVA

支持中英文双语视觉-文本对话的开源可商用多模态模型。

Language:PythonLicense:Apache-2.0Stargazers:370Issues:5Issues:9

mvits_for_class_agnostic_od

[ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".

Language:PythonLicense:MITStargazers:308Issues:6Issues:32

object-centric-ovd

[NeurIPS 2022] Official repository of paper titled "Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:290Issues:5Issues:23

SimCSE-Chinese-Pytorch

SimCSE在中文上的复现,有监督+无监督

Language:PythonLicense:MITStargazers:275Issues:1Issues:23

LLaVAR

Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"

Language:PythonLicense:Apache-2.0Stargazers:269Issues:6Issues:21

good

[ICLR'23] GOOD: Exploring Geometric Cues for Detecting Objects in an Open World

Language:PythonLicense:MITStargazers:40Issues:6Issues:6

ONNX-ImageNet-1K-Object-Detector

Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX. The repository combines a class agnostic object localizer to first detect the objects in the image, and next a ResNet50 model trained on ImageNet is used to label each box.

Language:PythonLicense:MITStargazers:34Issues:4Issues:5

BiomedCLIP-LoRA

Pytorch implementation of BiomedCLIP vision model with LoRA tuning

MoCo-v2-SupContrast

Supervised Contrastive Learning (SupContrast) based on MoCo-v2

Language:PythonLicense:NOASSERTIONStargazers:16Issues:1Issues:1