hekj

followers

following

stars

Institute of Automation, Chinese Academy of Sciences

BEIJING, CHINA

Keji's repositories

FDA

Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)

Language:Python11 3 2

Landmark-RxR

A human-annotated, fine-grained dataset for Vision-and-Language Navigation

awesome-embodied-vision

Reading list for research topics in embodied vision

MIT100

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

MIT000

Awesome-Multimodal-Research

A curated list of Multimodal Related Research.

Language:PythonMIT000

cvpr-latex-template

Extended LaTeX template for CVPR/ICCV papers

Language:TeX000

Recurrent-VLN-BERT

Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation

Language:Python000

RxR

Room-across-Room (RxR) is a large-scale, multilingual dataset for Vision-and-Language Navigation (VLN) in Matterport3D environments. It contains 126k navigation instructions in English, Hindi and Telugu, and 126k navigation following demonstrations. Both annotation types include dense spatiotemporal alignments between the text and the visual perceptions of the annotators

Language:PythonCC-BY-4.0000

VLN-BEVBert

[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"

Language:Python000