onkyo14taro / investigation-phase

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

An Investigation of the Effectiveness of Phase for Audio Classification

About

This repository contains the implementation of "An Investigation of the Effectiveness of Phase for Audio Classification". In this paper, we proposed LEAF-extended, a learnable front-end that can compute the phase and its derivatives. Our code is written by using PyTorch. The original LEAF implementation, which calculates only the amplitude, is available at the following link: https://github.com/google-research/leaf-audio (TensorFlow implementation).

Contents

  • notebooks/: demonstrations, building datasets
    • complexnn.ipynb: the complex batch normalization
    • data.ipynb: a dataset class
    • format_datasets.ipynb: building datasets
    • leaf.ipynb: LEAF-extended
    • phase.ipynb: some phase functions
    • specaug.ipynb: SpecAugment
  • scripts/: Python scripts
  • shells/: example shell scripts

Dependencies

See pyproject.toml.

python = "^3.8"
einops = "^0.3.0"
librosa = "^0.8.1"
matplotlib = "^3.4.2"
numpy = "^1.20.3"
pandas = "^1.2.4"
pyarrow = "^4.0.1"
pytorch-lightning = "^1.3.4"
seaborn = "^0.11.1"
SoundFile = "^0.10.3"
PyYAML = "^5.4.1"
torch = "^1.8.1"

Reference

Link: An Investigation of the Effectiveness of Phase for Audio Classification

@INPROCEEDINGS{9746037,
  author={Hidaka, Shunsuke and Wakamiya, Kohei and Kaburagi, Tokihiko},
  booktitle={ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, 
  title={An Investigation of the Effectiveness of Phase for Audio Classification}, 
  year={2022},
  volume={},
  number={},
  pages={3708-3712},
  doi={10.1109/ICASSP43922.2022.9746037}}

About

License:Apache License 2.0


Languages

Language:Jupyter Notebook 83.8%Language:Python 15.4%Language:Shell 0.8%