Inference index info in indentification from trained model

Question

Inference index info in indentification from trained model

Tortoise17 opened this issue 2 months ago · comments

I have tried

python speakerlab/bin/infer_sv.py --model_id $model_id --wavs input.wav

This exports the numpy array file. How can I get the inference info from trained model that this object is corresponding to which speaker number or which speaker in terms of identification?

If you could guide me.

Chen Yafeng · Answer 1 · Fri Jun 21 2024 16:05:07 GMT+0800 (China Standard Time)

https://github.com/modelscope/3D-Speaker/blob/main/speakerlab/bin/infer_sv.py#L290-L291
You can extract the corresponding embeddings according to the order of enroll and test sets in your wav_path.

Tortoise17 · Answer 2 · Fri Jun 21 2024 16:10:09 GMT+0800 (China Standard Time)

@yfchenlucky Great, thank you. If I have to match one wav with files form folder, which has multiple. Could this be possible? to make closest top1-2 matches.?

Tortoise17 · Answer 3 · Fri Jun 21 2024 16:18:37 GMT+0800 (China Standard Time)

I guess little change in pipeline will match the possible closest index. Thank you again. If there is still problem, I will request for help.

Chen Yafeng · Answer 4 · Fri Jun 21 2024 16:19:46 GMT+0800 (China Standard Time)

There are many ways. If you don't want to change the code, you can construct wav1, wav2\n wav1 wav3\n and so on, then test pairs and calculate the scores. Or you can simply modify infer_sv.py by fixing the enroll wav and constantly changing the test wav.

Tortoise17 · Answer 5 · Fri Jun 21 2024 16:23:15 GMT+0800 (China Standard Time)

@yfchenlucky Great help. Really excellent work. Super top architecture models.