danieljf24

danieljf24

Geek Repo

Company:Zhejiang University

Location:China

Home Page:http://danieljf24.github.io/

Github PK Tool:Github PK Tool

danieljf24's starred repositories

RaTSG

This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"

Language:PythonStargazers:2Issues:0Issues:0

DL-DKD

Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval

Language:PythonStargazers:13Issues:0Issues:0

UmURL

This is a repository contains the implementation of our ACM MM 2023 paper Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action Understanding.

Language:PythonLicense:Apache-2.0Stargazers:10Issues:0Issues:0

MLCMR

ChinaMM2023 Best Student Paper Award-多语言文本-视频跨模态检索的新基线模型 | 《计算机学报》收录-面向多语言-视觉公共空间学习的多语言文本-视频 跨模态检索模型

Language:RoffLicense:Apache-2.0Stargazers:5Issues:0Issues:0

RPF

This is a repository contains the implementation of our SIGIR'23 full paper From Region to Patch: Attribute-Aware Foreground-Background Contrastive Learning for Fine-Grained Fashion Retrieval.

Language:PythonStargazers:6Issues:0Issues:0

HiCo

This is a repository contains the implementation of our AAAI'23 oral paper Hierarchical Contrast for Unsupervised Skeleton-based Action Representation Learning.

Language:PythonLicense:Apache-2.0Stargazers:33Issues:0Issues:0

awesome-video-text-retrieval

A curated list of deep learning resources for video-text retrieval.

Stargazers:588Issues:0Issues:0

nrccr

Source code of our MM'22 paper Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning

Language:PythonLicense:Apache-2.0Stargazers:13Issues:0Issues:0

ms-sl

Source code of our MM'22 paper Partially Relevant Video Retrieval

Language:PythonLicense:Apache-2.0Stargazers:51Issues:0Issues:0

rivrl

Source code of our TCSVT'22 paper Reading-strategy Inspired Visual Representation Learning for Text-to-Video Retrieval

Language:PythonLicense:Apache-2.0Stargazers:19Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:5Issues:0Issues:0
Language:PythonStargazers:33Issues:0Issues:0

CBLN

Code for CVPR 2021 paper: Context-aware Biaffine Localizing Network for Temporal Sentence Grounding

License:MITStargazers:20Issues:0Issues:0

hybrid_space

Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrieval.

Language:PythonLicense:Apache-2.0Stargazers:87Issues:0Issues:0

awesome-vision-language-pretraining-papers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

Stargazers:1138Issues:0Issues:0
Language:PythonStargazers:31Issues:0Issues:0

zju-icicles

浙江大学课程攻略共享计划

Language:HTMLStargazers:37154Issues:0Issues:0

virtex

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

Language:PythonLicense:MITStargazers:557Issues:0Issues:0

PyRetri

Open source deep learning based unsupervised image retrieval toolbox built on PyTorch🔥

Language:PythonLicense:Apache-2.0Stargazers:1165Issues:0Issues:0

ClassyVision

An end-to-end PyTorch framework for image and video classification

Language:PythonLicense:MITStargazers:1590Issues:0Issues:0

pytorch-template

PyTorch deep learning projects made easy.

Language:PythonLicense:MITStargazers:4728Issues:0Issues:0

awesome-fashion-ai

A repository to curate and summarise research papers related to fashion and e-commerce

Stargazers:1167Issues:0Issues:0

pytext

A natural language modeling framework based on PyTorch

Language:PythonLicense:NOASSERTIONStargazers:6338Issues:0Issues:0

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonLicense:MITStargazers:8802Issues:0Issues:0

Computer-Vision-Leaderboard

Comparison of famous convolutional neural network models

License:MITStargazers:321Issues:0Issues:0

gcn_metric_learning

Metric Learning with Graph Convolutional Neural Networks

Language:PythonLicense:MITStargazers:202Issues:0Issues:0

twostreamfusion

Code release for "Convolutional Two-Stream Network Fusion for Video Action Recognition", CVPR 2016.

Language:CudaLicense:NOASSERTIONStargazers:714Issues:0Issues:0
Language:PythonStargazers:5Issues:0Issues:0

tirg

deep learning, image retrieval, vision and language

Language:PythonLicense:Apache-2.0Stargazers:296Issues:0Issues:0

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonLicense:NOASSERTIONStargazers:5489Issues:0Issues:0