Beast code in Giters

zqcrafts's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.047552 308 668

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT25824 323 403

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonApache-2.013995 104 1052

open_clip

An open source implementation of CLIP.

Language:PythonNOASSERTION10272 78 487

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.06715 42 303

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonMIT5969 52 604

efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Language:PythonMIT4074 36 39

reproducible-image-denoising-state-of-the-art

Collection of popular and reproducible image denoising works.

2404 117 10

awesome-community-detection

A curated list of community detection research papers with implementations.

Language:PythonCC0-1.02331 110 8

ml-4m

4M: Massively Multimodal Masked Modeling

Language:PythonApache-2.01602 33 21

SoM

Set-of-Mark Prompting for GPT-4V and LMMs

Language:PythonMIT1165 22 35

Grounded-SAM-2

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Language:Jupyter NotebookApache-2.01060 9 49

SAM-Adapter-PyTorch

Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts

Language:PythonMIT1043 8 95

Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Language:PythonApache-2.0770 14 44

graphrag-local-ollama

Local models support for Microsoft's graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction

Language:PythonMIT742 9 40

all-seeing

[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"

Language:Python457 23 22