lalith's repositories
Surgical_VQA
Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformers", MICCAI 2022.
Global-reasoned-multi-task-model
Globally reasoned multi-task model for surgical scene understanding. A multi-task model for segmentation and scene graph. Offical Implementation of "Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding", ICRA 2022 & RA-L.
Domain-adaptation-in-MTL
Domain adaptation in surgical scene understanding. A MTL model for scene graph and scene captioning. Offical implementation of "Task-Aware Asynchronous MTL with Class Incremental Contrastive Learning for Surgical Scene Understanding", IJCARs 2022.
Domain-Generalization-for-Surgical-Scene-Graph
Domain Generalization in surgical scene graph.
GradCAMDownstreamTask
End-to-end surgical scene understanding models using gradient-based localization. Official implementation of "Rethinking Feature Extraction: Gradient-based Localized Feature Extraction for End-to-End Surgical Downstream Tasks", RAL 2022.
Surgical_SceneGraph_Generation
MICCAI 2020
GloRe
Global Reasoning module for visual recognition
Surgical-VQLA
ICRA 2023
ILARS
Interactive Learning Assistant for Robotic Surgery
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.