Frost (frostinassiky)

frostinassiky

Geek Repo

Company:ZJU -> KAUST -> Meta

Location:London

Home Page:xumengmeng.com

Github PK Tool:Github PK Tool

Frost's starred repositories

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:55816Issues:521Issues:962

fiftyone

Refine high-quality datasets and visual AI models

Language:PythonLicense:Apache-2.0Stargazers:8704Issues:55Issues:1513

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:7839Issues:118Issues:292

DiffusionDet

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Language:PythonLicense:NOASSERTIONStargazers:2070Issues:17Issues:113

ClipBERT

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

Language:PythonLicense:MITStargazers:699Issues:9Issues:58

TimeSformer-pytorch

Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

Language:PythonLicense:MITStargazers:689Issues:17Issues:18

ResizeRight

The correct way to resize images or tensors. For Numpy or Pytorch (differentiable).

Language:PythonLicense:MITStargazers:547Issues:12Issues:13

gtad

The official implementation of G-TAD: Sub-Graph Localization for Temporal Action Detection

Language:PythonLicense:Apache-2.0Stargazers:217Issues:7Issues:54

ActionDetection-AFSD

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

Language:PythonLicense:NOASSERTIONStargazers:171Issues:4Issues:60

OpenTAD

OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:156Issues:3Issues:29

LDMVFI

[AAAI'2024] "LDMVFI: Video Frame Interpolation with Latent Diffusion Models", Duolikun Danier, Fan Zhang, David Bull

Language:PythonLicense:MITStargazers:125Issues:6Issues:25

NExT-QA

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

Language:PythonLicense:MITStargazers:123Issues:2Issues:27

just-ask

[ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:117Issues:5Issues:12

TSP

TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks (ICCVW 2021)

Language:PythonLicense:MITStargazers:107Issues:2Issues:24

CCL

PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning

Language:PythonLicense:Apache-2.0Stargazers:86Issues:5Issues:8

image-to-recipe-transformers

Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning

Language:PythonLicense:Apache-2.0Stargazers:81Issues:7Issues:3

Temporal_Query_Networks

The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding

CodeForInterview

面试整理的一些基础知识点,从summary.md开始阅读

Language:HTMLStargazers:57Issues:0Issues:0

vq2d_cvpr

This repo contains the code for the recipe of the winning entry to the Ego4d VQ2D challenge at CVPR 2022.

Language:PythonLicense:MITStargazers:39Issues:5Issues:1

Ego4d_NLQ_2022_1st_Place_Solution

The 1st place solution of 2022 Ego4d Natural Language Queries.

Language:PythonLicense:MITStargazers:32Issues:2Issues:2

DiffusionTAD

[ICCV 2023] Official PyTorch implementation of the paper "DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion"

RaNet

source code of our RaNet in EMNLP 2021

VLG-Net

VLG-Net: Video-Language Graph Matching Networks for Video Grounding

Language:PythonLicense:MITStargazers:30Issues:3Issues:10
Language:HTMLStargazers:25Issues:0Issues:0
Language:PythonLicense:MITStargazers:24Issues:3Issues:2

bsp

Placeholder for code of BSP.

denoiseloc

The official implementation of DenoiseLoc: Boundary Denoising for Video Activity Localization, ICLR 2024

Language:PythonLicense:MITStargazers:5Issues:1Issues:0