Beast code in Giters

Minh Tran's starred repositories

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonApache-2.06902 54 1460

lovely-tensors

Tensors, ready for human consumption

Language:Jupyter NotebookMIT1060 11 19

ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language:PythonNOASSERTION816 40 42

transfiner

Mask Transfiner for High-Quality Instance Segmentation, CVPR 2022

Language:PythonApache-2.0522 11 55

CEDNet

CEDNet: A Cascade Encoder-Decoder Network for Dense Prediction

MIT104 3 18

OpenFusion

[ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation

Language:Python78 6 4

AIOZ-GDANCE

AIOZ-GDANCE: a large-scale dataset & baseline for music-driven group dance generation. (CVPR 2023)

Language:PythonNOASSERTION69 8 4

VLTinT

[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

Language:Jupyter Notebook64 4 14

FASeg

[CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".

Language:PythonNOASSERTION54 4 5

Qualia2.0

Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.

Language:PythonMIT48 30

torch-warmup-lr

Warmup learning rate wrapper for Pytorch Scheduler

Language:PythonMIT39 2 3

copy_paste_aug_detectron2

Copy-paste augmentation in detectron2 pipeline

Language:Jupyter Notebook33 2 3

SAM3D

[ISBI 2024] An implementation of SAM3D which adapts Segment Anything Model for Volumetric Medical Image Segmentation

Language:PythonMIT33 3 5

VLCAP

[ICIP 2022] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

Language:Jupyter Notebook28 3 11

DirecFormer

[CVPR'22] DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition

Language:PythonApache-2.025 1 2

ECG_SSL_12Lead

[IEEE BHI 2022] Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning

Language:Python24 10

AOE-Net

[IJCV] AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation

Language:Python19 2 9

AerialFormer

[preprint] AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation

Language:Python18 3 1

detectron2-xyz

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonApache-2.016 10

TAPG-AgentEnvInteration

[BMVC 2021 Oral] AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposal Generation

Language:Python10 1 2

detectron2_ema

Language:PythonApache-2.07 20

IAI

[WACV 2024] Decoding Radiologists’ Intense Focus for Accurate CXR Diagnoses: A Controllable & Interpretable AI System

Language:Python6 40

Video_Representation

[Asilomar 2022] Contextual Explainable Video Representation: Human Perception-based Understanding

5 10

stock-trend-predictions

Stock trend prediction based on the news headlines

Language:PythonApache-2.03 20

gotrongluan.github.io

Language:HTML2 10

rebiber

A simple tool to update bib entries with their official information.

Language:PythonMIT200

Medical-in-CVPR

Language:Jupyter Notebook200

ivos-demo

Language:Python2 10

DINO-Libtorch-CPP

An example of the DINO detector using C++ and the Libtorch library

Language:C++100

ZEETAD

[WACV2024] ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection

1 2 1