There are 0 repository under multimodal-action-recognition topic.
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.