marxav / gipfa

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GIPFA

This repository is the official implementation of GIPFA: Generating IPA Pronunciation from Audio.

Requirements

  • Require Ubuntu 18.04, Python 3.6+.

  • Download files on your machine

  • Go to the gipfa main directory

    • cd gipfa
  • Create a virtual environment

    • python3 -m venv gipfa
  • Activate virtual environment

    • source gipfa/bin/activate
  • Load the python librairies needed for GIPFA (e.g. numpy, pandas, torch...) from the requirements file

    • python -m pip install --upgrade pip
    • python -m pip install --upgrade setuptools
    • python -m pip install -r requirements.txt
  • Download the dataset (size=1.2Go)

    • python3 download_dataset.py

Evaluation and Training

In order to run this code, you need to:

  • Run the gipfa.ipynb in order to create the ANN model, train it, test it and display the results of the paper.

Results

GIPFA achieves the following performance:

Tested samples Mean Accuracy
1000 75.0

The detailed performance is availaible in this CSV file (with '\t' as a separator)

License

GIPFA is released under the CC BY-SA 4.0.

Citation

The pipeline is described in the following paper:

@misc{marjou2020gipfa,
    title={GIPFA: Generating IPA Pronunciation from Audio},
    author={Xavier Marjou},
    year={2020},
    eprint={2006.07573},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

About


Languages

Language:Jupyter Notebook 99.6%Language:Python 0.4%