YuanGongND / ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Eval_metrics

joemzhao opened this issue · comments

Hi Yuan, thanks a lot for the nice work.

It seems the evaluation scripts for computing cider/spice scores are missing. For example:
"""
from eval_metrics import evaluate_metrics
"""
(https://github.com/YuanGongND/ltu/blob/main/src/ltu/eval/eval_audiocaps.py#L14)

could you share about this? Thank you very much

hi there,

Thanks for pointing this out.

The script depends on this package: https://github.com/audio-captioning/caption-evaluation-tools (not our work, but metrics script used by DCASE).

We share our script just to show how we postprocess the prediction. The evaluation part is standard.

-Yuan

Understood -- thanks for the help!

Let me know if there's any problem running the script. I think you should be able to run it if you put our script in the same dir of that package.

-Yuan

Hi Yuan -- thanks for the kind help! Yes, I can run the scripts to obtain the scores properly.