can we synthesis speaker-A's tone with speaker-B's prosody?

Question

can we synthesis speaker-A's tone with speaker-B's prosody?

niu0717 opened this issue 5 years ago · comments

when i read gst paper, i found it contains not only the token but also the tone of the speaker. In other word, can we separate prosody from the ref audio as much as possible?