pirxus / personalVAD

An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Issues with ntss and tss confusion

HolgerBovbjerg opened this issue · comments

Hi,

I am currently trying to replicate the results from the Personal VAD paper, and I am having some issues with the model not properly distinguishing between target speaker and non-target speaker speech, with a heavy bias towards target speaker speech.
Have you had any such issues?

Best regards,
Holger Severin Bovbjerg