Minh Tran (trqminh)

trqminh

Geek Repo

Github PK Tool:Github PK Tool


Organizations
aioz-ai

Minh Tran's starred repositories

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:6902Issues:54Issues:1460

lovely-tensors

Tensors, ready for human consumption

Language:Jupyter NotebookLicense:MITStargazers:1060Issues:11Issues:19

ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language:PythonLicense:NOASSERTIONStargazers:816Issues:40Issues:42

transfiner

Mask Transfiner for High-Quality Instance Segmentation, CVPR 2022

Language:PythonLicense:Apache-2.0Stargazers:522Issues:11Issues:55

CEDNet

CEDNet: A Cascade Encoder-Decoder Network for Dense Prediction

OpenFusion

[ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation

AIOZ-GDANCE

AIOZ-GDANCE: a large-scale dataset & baseline for music-driven group dance generation. (CVPR 2023)

Language:PythonLicense:NOASSERTIONStargazers:69Issues:8Issues:4

VLTinT

[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

Language:Jupyter NotebookStargazers:64Issues:4Issues:14

FASeg

[CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".

Language:PythonLicense:NOASSERTIONStargazers:54Issues:4Issues:5

Qualia2.0

Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.

Language:PythonLicense:MITStargazers:48Issues:3Issues:0

torch-warmup-lr

Warmup learning rate wrapper for Pytorch Scheduler

Language:PythonLicense:MITStargazers:39Issues:2Issues:3

copy_paste_aug_detectron2

Copy-paste augmentation in detectron2 pipeline

Language:Jupyter NotebookStargazers:33Issues:2Issues:3

SAM3D

[ISBI 2024] An implementation of SAM3D which adapts Segment Anything Model for Volumetric Medical Image Segmentation

Language:PythonLicense:MITStargazers:33Issues:3Issues:5

VLCAP

[ICIP 2022] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

Language:Jupyter NotebookStargazers:28Issues:3Issues:11

DirecFormer

[CVPR'22] DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition

Language:PythonLicense:Apache-2.0Stargazers:25Issues:1Issues:2

ECG_SSL_12Lead

[IEEE BHI 2022] Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning

Language:PythonStargazers:24Issues:1Issues:0

AOE-Net

[IJCV] AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation

AerialFormer

[preprint] AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation

detectron2-xyz

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:16Issues:1Issues:0

TAPG-AgentEnvInteration

[BMVC 2021 Oral] AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposal Generation

Language:PythonLicense:Apache-2.0Stargazers:7Issues:2Issues:0

IAI

[WACV 2024] Decoding Radiologists’ Intense Focus for Accurate CXR Diagnoses: A Controllable & Interpretable AI System

Language:PythonStargazers:6Issues:4Issues:0

Video_Representation

[Asilomar 2022] Contextual Explainable Video Representation: Human Perception-based Understanding

stock-trend-predictions

Stock trend prediction based on the news headlines

Language:PythonLicense:Apache-2.0Stargazers:3Issues:2Issues:0

rebiber

A simple tool to update bib entries with their official information.

Language:PythonLicense:MITStargazers:2Issues:0Issues:0
Language:Jupyter NotebookStargazers:2Issues:0Issues:0
Language:PythonStargazers:2Issues:1Issues:0

DINO-Libtorch-CPP

An example of the DINO detector using C++ and the Libtorch library

Language:C++Stargazers:1Issues:0Issues:0

ZEETAD

[WACV2024] ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection