Akash Kumar (AKASH2907)

AKASH2907

Geek Repo

Company:University of Central Florida

Location:Orlando, FL

Home Page:https://akash2907.github.io/

Github PK Tool:Github PK Tool

Akash Kumar's starred repositories

awesome-video-self-supervised-learning

A curated list of awesome self-supervised learning methods in videos

Stargazers:62Issues:0Issues:0
Language:PythonStargazers:32Issues:0Issues:0

SeViLA

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Language:PythonLicense:BSD-3-ClauseStargazers:159Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks —— An Open-Source Alternative to ViT-22B

Language:Jupyter NotebookLicense:MITStargazers:691Issues:0Issues:0

CM-Erase-REG

Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"

Language:PythonStargazers:34Issues:0Issues:0

visil

Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]

Language:PythonLicense:Apache-2.0Stargazers:198Issues:0Issues:0

FriendsDontLetFriends

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

Language:RLicense:MITStargazers:5670Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:13699Issues:0Issues:0
Language:PythonStargazers:37Issues:0Issues:0

first-order-model

This repository contains the source code for the paper First Order Motion Model for Image Animation

Language:Jupyter NotebookLicense:MITStargazers:14190Issues:0Issues:0

STAN

Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"

Language:PythonLicense:Apache-2.0Stargazers:81Issues:0Issues:0

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonStargazers:21198Issues:0Issues:0

S3D_HowTo100M

S3D Text-Video model trained on HowTo100M using MIL-NCE

Language:PythonLicense:Apache-2.0Stargazers:184Issues:0Issues:0

tabilize

Simple code for generating a color-coded latex table from raw data

Language:Jupyter NotebookStargazers:146Issues:0Issues:0

acgcn

Code for the paper "Spot What Matters: Learning Context Using Graph Convolutional Networks for Weakly-Supervised Action Detection"

Language:PythonStargazers:12Issues:0Issues:0

EMA-VFI

[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio

Language:PythonLicense:Apache-2.0Stargazers:308Issues:0Issues:0

hiera

Hiera: A fast, powerful, and simple hierarchical vision transformer.

Language:PythonLicense:Apache-2.0Stargazers:691Issues:0Issues:0

Awesome-Referring-Image-Segmentation

:books: A collection of papers about Referring Image Segmentation.

Stargazers:528Issues:0Issues:0

EVAD

[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement

Language:PythonLicense:NOASSERTIONStargazers:19Issues:0Issues:0

MI-AOD

Code for Multiple Instance Active Learning for Object Detection, CVPR 2021

Language:PythonLicense:Apache-2.0Stargazers:323Issues:0Issues:0
Language:PythonStargazers:15Issues:0Issues:0

NExT-QA

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

Language:PythonLicense:MITStargazers:97Issues:0Issues:0

CLIP-Help-SimCLR

Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning

Language:PythonStargazers:19Issues:0Issues:0

VideoX

VideoX: a collection of video cross-modal models

Language:PythonLicense:NOASSERTIONStargazers:927Issues:0Issues:0

frozen-in-time

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]

Language:PythonLicense:MITStargazers:330Issues:0Issues:0

singularity

[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"

Language:PythonLicense:MITStargazers:124Issues:0Issues:0

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Language:PythonLicense:Apache-2.0Stargazers:6360Issues:0Issues:0

mil_pytorch

Multiple instance learning model implemented in pytorch

Language:PythonStargazers:29Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:7522Issues:0Issues:0

PySceneDetect

:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.

Language:PythonLicense:NOASSERTIONStargazers:2771Issues:0Issues:0