heng-hw's repositories

SpaCap3D

[IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)

V2A-Mapper

[AAAI 2024] V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models

License:NOASSERTIONStargazers:8Issues:1Issues:1