leduckhai / wav2graph

Information Retrieval from Audio via Knowledge Graph

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech

Please press ⭐ button and/or cite papers if you feel helpful.

wav2graph_pipeline

This repository contains the codebase for the wav2graph paper:

wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech

https://www.arxiv.org/abs/2408.04174

Project Overview

In the wav2graph paper, we introduce the first framework for supervised learning knowledge graph from speech data. This repository provides the necessary scripts, configurations, and setup instructions to reproduce the experiments discussed in the paper.

Setup

To set up the environment and run the experiments, follow the steps below:

1. Create a Virtual Environment

Before you start, create a Python virtual environment and install the required dependencies.

pip install -r requirements.txt

2. Configure Hugging Face Token

You will need a Hugging Face API token to access certain resources used in this project. Insert your Hugging Face token into the relevant YAML configuration files.

3. Run the Experiments

Once the environment is set up and the configurations are complete, you can run the experiments using the provided script.

sh run.sh

Cite our work

@misc{leduc2024wav2graphframeworksupervisedlearning,
      title={wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech}, 
      author={Khai Le-Duc and Quy-Anh Dang and Tan-Hanh Pham and Truong-Son Hy},
      year={2024},
      eprint={2408.04174},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2408.04174}, 
}

Contact

Core developers:

Khai Le-Duc

University of Toronto, Canada
Email: duckhai.le@mail.utoronto.ca
GitHub: https://github.com/leduckhai

Quy-Anh Dang

VNU University of Science, Vietnam
GitHub: https://github.com/QuyAnh2005
Facebook: https://www.facebook.com/anh.q.dang.5

About

Information Retrieval from Audio via Knowledge Graph

License:MIT License


Languages

Language:Python 90.4%Language:Shell 9.6%