tathaghosh

Tathagata Ghosh's starred repositories

LLMs-from-scratch

Implementing a ChatGPT-like LLM from scratch, step by step

Language:Jupyter NotebookNOASSERTION14045 184 31

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

8929 212 89

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonGPL-3.05654 62 54

awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

MIT4281 221 48

LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Language:PythonApache-2.02970 49 92

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

MIT1924 39 3

VLM_survey

Collection of AWESOME vision-language models for vision tasks

1739 108 6

TinyGPT-V

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Language:PythonBSD-3-Clause1180 18 32

Efficient-LLMs-Survey

Efficient Large Language Models: A Survey

Apache-2.0694 16 9

Awesome-CV-Foundational-Models

410 20 6

diffusion-classifier

Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training

Language:Python346 17 23

pfgmpp

Code for ICML 2023 paper, "PFGM++: Unlocking the Potential of Physics-Inspired Generative Models"

Language:PythonNOASSERTION340 10 13

PromptIR

PromptIR: Prompting for All-in-One Blind Image Restoration [NeurIPS 2023]

Language:PythonNOASSERTION265 5 28

bigvsan

Pytorch implementation of BigVSAN

Language:PythonMIT179 28 3

DeepMIR

Teaching material for the course "Deep Learning for Music Analysis and Generation" I taught at National Taiwan University (2023 Fall)

NOASSERTION169 6 1

sketchy-vision

Each week I create sketches covering key Computer Vision concepts. If you want to learn more about CV stick around!

146 160

pytorch_HMM

HMMs in PyTorch

Language:Jupyter NotebookApache-2.0129 4 3

multi-domain-imbalance

[ECCV 2022] Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization, and Beyond

Language:PythonMIT117 4 6

dyffusion

[NeurIPS 2023] A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting

Language:PythonApache-2.0105 4 2

ATS

Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)

Language:ShellApache-2.077 3 8

Beyond-INet

Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"

Language:PythonMIT76 40

triton-autodiff

Experiment of using Tangent to autodiff triton

Language:PythonMIT66 40

forgetting

Repository of code for the experiments for the ICLR submission "An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Networks"

Language:Python63 8 1

PLIKS

PLIKS: A Pseudo-Linear Inverse Kinematic Solver for 3D Human Body Estimation

Language:Python3500

[ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on customized score rubric, Prometheus-Vision is a good alternative for human evaluation and GPT-4V evaluation.

Language:PythonApache-2.03404