1170300714's starred repositories

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:19494Issues:160Issues:1492

ARC-AGI

The Abstraction and Reasoning Corpus

Language:JavaScriptLicense:Apache-2.0Stargazers:3345Issues:96Issues:67

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonLicense:Apache-2.0Stargazers:1975Issues:6Issues:243

X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Language:PythonLicense:Apache-2.0Stargazers:1284Issues:34Issues:69

PETR

[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

Language:PythonLicense:NOASSERTIONStargazers:856Issues:14Issues:161

GLUE-baselines

[DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations

FedProx

Federated Optimization in Heterogeneous Networks (MLSys '20)

Language:PythonLicense:MITStargazers:636Issues:5Issues:29

DoRA

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Language:PythonLicense:NOASSERTIONStargazers:579Issues:9Issues:16

VisionLLaMA

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

SparseBEV

[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Language:PythonLicense:MITStargazers:338Issues:9Issues:82

Spike-Driven-Transformer

Offical implementation of "Spike-driven Transformer" (NeurIPS2023)

Language:PythonLicense:Apache-2.0Stargazers:208Issues:3Issues:12

NExT-Chat

The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".

Language:PythonLicense:Apache-2.0Stargazers:205Issues:2Issues:21

FedNova

PyTorch implementation of FedNova (NeurIPS 2020), and a class of federated learning algorithms, including FedAvg, FedProx.

squad

Starter code for Stanford CS224n default final project on SQuAD 2.0

Language:PythonLicense:MITStargazers:184Issues:8Issues:3

Endo-FM

[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train

Language:PythonLicense:Apache-2.0Stargazers:154Issues:2Issues:25

commonsenseqa

Author implementation of the paper "CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge"

TPT

Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))

Language:PythonLicense:MITStargazers:136Issues:3Issues:15

GenerateU

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

Overcoming-Catastrophic-forgetting-in-Neural-Networks

Elastic weight consolidation technique for incremental learning.

Language:Jupyter NotebookStargazers:124Issues:3Issues:8

BlackVIP

Official implementation for CVPR'23 paper "BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning"

talk2bev

Talk2BEV: Language-Enhanced Bird's Eye View Maps (Accepted to ICRA'24)

Language:PythonLicense:BSD-3-ClauseStargazers:93Issues:2Issues:9

POP3D

Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"

MineDreamer

This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "

Language:PythonLicense:Apache-2.0Stargazers:66Issues:4Issues:2

nuscenes-download

script for downloading nuscenes

CoMFormer

Official implementation of "CoMFormer: Continual Learning in Semantic and Panoptic Segmentation"

Language:PythonLicense:NOASSERTIONStargazers:37Issues:3Issues:7

GMM

Generative Multi-modal Models are Good Class Incremental Learners, CVPR 2024 [PyTorch Code]

GS-LoRA

Continual Forgetting for Pre-trained Vision Models (CVPR 2024)

Language:PythonLicense:MITStargazers:32Issues:3Issues:7

DualCross

[IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception

Language:PythonLicense:MITStargazers:28Issues:2Issues:1

annotation2mask

Converting coco-like annotation json files to png masks

Language:PythonStargazers:24Issues:1Issues:0