Tathagata Ghosh's starred repositories

LLMs-from-scratch

Implementing a ChatGPT-like LLM from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:14045Issues:184Issues:31
Language:PythonLicense:Apache-2.0Stargazers:9380Issues:97Issues:273

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:5654Issues:62Issues:54

awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:2970Issues:49Issues:92

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

VLM_survey

Collection of AWESOME vision-language models for vision tasks

Multimodal-GPT

Multimodal-GPT

Language:PythonLicense:Apache-2.0Stargazers:1402Issues:12Issues:15

TinyGPT-V

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Language:PythonLicense:BSD-3-ClauseStargazers:1180Issues:18Issues:32

Efficient-LLMs-Survey

Efficient Large Language Models: A Survey

diffusion-classifier

Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training

pfgmpp

Code for ICML 2023 paper, "PFGM++: Unlocking the Potential of Physics-Inspired Generative Models"

Language:PythonLicense:NOASSERTIONStargazers:340Issues:10Issues:13

PromptIR

PromptIR: Prompting for All-in-One Blind Image Restoration [NeurIPS 2023]

Language:PythonLicense:NOASSERTIONStargazers:265Issues:5Issues:28

bigvsan

Pytorch implementation of BigVSAN

Language:PythonLicense:MITStargazers:179Issues:28Issues:3

DeepMIR

Teaching material for the course "Deep Learning for Music Analysis and Generation" I taught at National Taiwan University (2023 Fall)

sketchy-vision

Each week I create sketches covering key Computer Vision concepts. If you want to learn more about CV stick around!

pytorch_HMM

HMMs in PyTorch

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:129Issues:4Issues:3

multi-domain-imbalance

[ECCV 2022] Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization, and Beyond

Language:PythonLicense:MITStargazers:117Issues:4Issues:6

dyffusion

[NeurIPS 2023] A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting

Language:PythonLicense:Apache-2.0Stargazers:105Issues:4Issues:2

ATS

Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)

Language:ShellLicense:Apache-2.0Stargazers:77Issues:3Issues:8

Beyond-INet

Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"

Language:PythonLicense:MITStargazers:76Issues:4Issues:0

triton-autodiff

Experiment of using Tangent to autodiff triton

Language:PythonLicense:MITStargazers:66Issues:4Issues:0

forgetting

Repository of code for the experiments for the ICLR submission "An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Networks"

PLIKS

PLIKS: A Pseudo-Linear Inverse Kinematic Solver for 3D Human Body Estimation

Language:PythonStargazers:35Issues:0Issues:0

prometheus-vision

[ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on customized score rubric, Prometheus-Vision is a good alternative for human evaluation and GPT-4V evaluation.

Language:PythonLicense:Apache-2.0Stargazers:34Issues:0Issues:4

eP-ALM

[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.

Language:Jupyter NotebookLicense:MITStargazers:26Issues:4Issues:3
License:GPL-3.0Stargazers:18Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:8Issues:5Issues:0