lalith's starred repositories
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
openai-python
The official Python library for the OpenAI API
Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
autoresearcher
⚡ Automating scientific workflows with AI ⚡
sawIntuitiveResearchKit
cisst/SAW stack for the da Vinci Research Kit (dVRK)
surgical_robotics_challenge
Interactive Robot Assisted Suturing Simulation
Surgical_VQA
Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformers", MICCAI 2022.
Surgical-VQLA
Official implementation of "Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery", ICRA 2023.
Domain-adaptation-in-MTL
Domain adaptation in surgical scene understanding. A MTL model for scene graph and scene captioning. Offical implementation of "Task-Aware Asynchronous MTL with Class Incremental Contrastive Learning for Surgical Scene Understanding", IJCARs 2022.
GradCAMDownstreamTask
End-to-end surgical scene understanding models using gradient-based localization. Official implementation of "Rethinking Feature Extraction: Gradient-based Localized Feature Extraction for End-to-End Surgical Downstream Tasks", RAL 2022.