Keji (hekj)

hekj

Geek Repo

Company:Institute of Automation, Chinese Academy of Sciences

Location:BEIJING, CHINA

Github PK Tool:Github PK Tool

Keji's repositories

FDA

Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)

Landmark-RxR

A human-annotated, fine-grained dataset for Vision-and-Language Navigation

awesome-embodied-vision

Reading list for research topics in embodied vision

License:MITStargazers:1Issues:0Issues:0

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

License:MITStargazers:0Issues:0Issues:0

Awesome-Multimodal-Research

A curated list of Multimodal Related Research.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

cvpr-latex-template

Extended LaTeX template for CVPR/ICCV papers

Language:TeXStargazers:0Issues:0Issues:0

Recurrent-VLN-BERT

Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation

Language:PythonStargazers:0Issues:0Issues:0

RxR

Room-across-Room (RxR) is a large-scale, multilingual dataset for Vision-and-Language Navigation (VLN) in Matterport3D environments. It contains 126k navigation instructions in English, Hindi and Telugu, and 126k navigation following demonstrations. Both annotation types include dense spatiotemporal alignments between the text and the visual perceptions of the annotators

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

VLN-BEVBert

[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"

Language:PythonStargazers:0Issues:0Issues:0