Beast code in Giters

Roman Gudchenko's starred repositories

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT30737 426 4211

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

CC0-1.019684 382 27

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause10105 97 676

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonApache-2.06223 68 432

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookApache-2.04918 42 125

Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Language:Python2425 23 96

VMamba

VMamba: Visual State Space Models，code is based on mamba

Language:PythonMIT2309 17 342

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Language:PythonNOASSERTION1649 25 90

CMC

[arXiv 2019] "Contrastive Multiview Coding", also contains implementations for MoCo and InstDis

Language:PythonBSD-2-Clause1311 28 70

MaskDINO

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Language:PythonApache-2.01238 35 113

SimMIM

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Language:PythonMIT936 23 42

PointTransformerV3

[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)

Language:PythonMIT880 14 124

objaverse-xl

🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!

Language:PythonApache-2.0815 10 54

OpenSeeD

[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"

Language:PythonApache-2.0675 22 39

UniMatch

[CVPR 2023] Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation

Language:PythonMIT490 3 118

Awesome-CV-Foundational-Models

472 19 6

maxvit

[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...

Language:Jupyter NotebookApache-2.0453 9 20

DSVT

[CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"

Language:PythonApache-2.0392 8 78

oneformer3d

[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Language:PythonNOASSERTION369 9 99

VOTR

Voxel Transformer for 3D object detection

Language:Python248 3 31

Swin3D

A shift-window based transformer for 3D sparse tasks

Language:CudaMIT223 9 30

Battle-of-the-Backbones

198 5 2

multi_token

Embed arbitrary modalities (images, audio, documents, etc) into large language models.

Language:PythonApache-2.0175 3 25

jetson-intro-to-distillation

A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson

Language:PythonNOASSERTION160 4 2

CoolGraph

Make GNN easy to start with

Language:Jupyter NotebookMIT126 5 1

semivl

[ECCV'24] Official Implementation of SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance

Language:PythonApache-2.0117 5 11

M3I-Pretraining

[CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.

90 12 5

multi-view-AE

Multi-view-AE: An extensive collection of multi-modal autoencoders implemented in a modular, scikit-learn style framework.

Language:PythonMIT46 3 16

GPT4V-Medical-Report

Language:Python43 10

SurgicalGPT

Language:Python24 1 3