Omkar Thawakar's repositories
composed-video-retrieval
Composed Video Retrieval
OWVISFormer
Open-World Video Instance Segmentation
OmkarThawakar.github.io
My Personal Website
Recurrent-Seqformer
Fast Video Instance Segmentation via Recurrent Encoder-based Transformers
anylabeling
Effortless AI-assisted data labeling with AI support from Segment Anything and YOLO!
arcgis-python-api
Documentation and samples for ArcGIS API for Python
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
CVRR-ES
Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
DiffusionDet
PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
github-readme-stats
:zap: Dynamically generated stats for your github readmes
grok-1
Grok open release
grounded-segment-any-parts
Grounded Segment Anything: From Objects to Parts
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
mlc
multi class classification
OmkarThawakar
git Readme
Samba
Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
SeqFormer-1
SeqFormer: a Frustratingly Simple Model for Video Instance Segmentation
Text-Generation-Using-GROQ
Text Generation Using GROQ
unetr_plus_plus
UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation
ViTAE-Transformer-Remote-Sensing
The official repo for the paper "An Empirical Study of Remote Sensing Pretraining"
XrayGPT-1
XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.