AudioNTT2020Feature

Question

AudioNTT2020Feature

Myblackcat0216 opened this issue 9 months ago · comments

Hello, what is the AudioNTT2020Feature from byol_a.models import AudioNTT2020Feature in byola_extract_lavdf.py
Traceback (most recent call last):
File "/home/UMMAFormer-main/bylo-a/byola_extract_lavdf.py", line 3, in
from byol_a.models import AudioNTT2020Feature
ImportError: cannot import name 'AudioNTT2020Feature' from 'byol_a.models' (/home/UMMAFormer-main/bylo-a/byol_a/models.py)
Another question is what parameters need to be changed when extracting audio features using byol-a？
Thank you for your excellent work and look forward to your reply

Rui Zhang · Answer 1 · Tue Nov 21 2023 17:22:18 GMT+0800 (China Standard Time)

Thank you for your attention. If you want to use BYOL-A to extract features, you need to use the code available at https://github.com/nttcslab/byol-a. What I provided is just a basic example of calling the relevant code.

Daisuke Niizumi · Answer 2 · Wed Nov 22 2023 07:36:56 GMT+0800 (China Standard Time)

@ymhzyj Dear author, thank you for using BYOL-A and excuse me for cutting in.
Is AudioNTT2020Feature the same as AudioNTT2020Task6? That could be the answer to the question.
https://github.com/nttcslab/byol-a/blob/master/byol_a/models.py#L48

@Myblackcat0216 I think you can use AudioNTT2020Task6 if you want to extract audio features for each time frame.
If you want a single feature vector for an audio clip, please try AudioNTT2020 instead.

Myblackcat0216 · Answer 3 · Wed Nov 22 2023 14:52:05 GMT+0800 (China Standard Time)

Thank you very much for your help. It solved my problem