Kumara Kahatapitiya (kkahatapitiya)

kkahatapitiya

Geek Repo

Company:Stony Brook University

Location:NY

Home Page:www3.cs.stonybrook.edu/~kkahatapitiy/

Twitter:@kkahatapitiy

Github PK Tool:Github PK Tool

Kumara Kahatapitiya's starred repositories

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:55581Issues:519Issues:959

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36519Issues:348Issues:1768

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:27234Issues:224Issues:4545

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14811Issues:114Issues:385

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10371Issues:104Issues:146

StableCascade

Official Code for Stable Cascade

Language:Jupyter NotebookLicense:MITStargazers:6516Issues:61Issues:121

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonLicense:GPL-3.0Stargazers:5698Issues:78Issues:142

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:4477Issues:71Issues:82

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2722Issues:32Issues:156

spconv

Spatial Sparse Convolution Library

Language:PythonLicense:Apache-2.0Stargazers:1842Issues:24Issues:690

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookLicense:MITStargazers:1336Issues:18Issues:63

Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

MotionCtrl

Official Code for MotionCtrl [SIGGRAPH 2024]

Language:PythonLicense:Apache-2.0Stargazers:1273Issues:50Issues:31

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:1161Issues:14Issues:119

visual_anagrams

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

Language:Jupyter NotebookLicense:MITStargazers:835Issues:9Issues:13

sige

[NeurIPS 2022, T-PAMI 2023] Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:257Issues:5Issues:2

X3D-Multigrid

PyTorch implementation of X3D models with Multigrid training.

Language:PythonLicense:MITStargazers:92Issues:2Issues:12

LLoVi

Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"

Language:PythonLicense:MITStargazers:81Issues:6Issues:6

Coarse-Fine-Networks

Code for our CVPR 2021 paper "Coarse-Fine Networks for Temporal Activity Detection in Videos"

Language:PythonLicense:MITStargazers:55Issues:2Issues:11

crossway_diffusion

The official code of our ICRA'24 paper Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning

Language:PythonLicense:MITStargazers:51Issues:2Issues:7

LangRepo

Language Repository for Long Video Understanding

Language:PythonLicense:MITStargazers:27Issues:2Issues:1

mvu

Multimodal Video Understanding Framework (MVU)

Language:PythonLicense:MITStargazers:22Issues:2Issues:0

lifelong-memory

Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos

Language:PythonLicense:MITStargazers:13Issues:2Issues:1

LinearConv

Code for our WACV 2021 paper "Exploiting the Redundancy in Convolutional Filters for Parameter Reduction"

Language:PythonLicense:MITStargazers:9Issues:2Issues:3

SSDet

Code for our AAAI 2023 paper "Weakly-guided Self-supervised Pretraining for Temporal Activity Detection"

Language:PythonLicense:MITStargazers:9Issues:1Issues:0

open_x_pytorch_dataloader

An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment

Language:PythonLicense:MITStargazers:7Issues:2Issues:0

SWAT

Code for our IJCAI 2023 paper "SWAT: Spatial Structure Within and Among Tokens"

Language:PythonLicense:MITStargazers:3Issues:1Issues:0