Unused character embeddings?

Question

Unused character embeddings?

g-milis opened this issue a year ago · comments

I noticed that since each utterance is converted to a phoneme sequence, the character embeddings are never used. A quick visualization with 2D PCA shows that the embeddings corresponding to A-Z and a-z seem random, while the phoneme embeddings have a meaningful structure. Is that intentional?

wabmhnsbn · Answer 1 · Thu Sep 14 2023 09:43:26 GMT+0800 (China Standard Time)

Hi,sorry to bother you.I'm a student majored in Linguistic and NLP. Recently i had confront a challenge about extracting feature vectors in phonemes and characters. I would like to know if there is a pre-training model to get the feature vector for each phoneme and character.It seems that the "PAC of TTS Phonemes/Characters" you provided is close to my requirements!
I would like to know if you have any suggestions for this.

Georgios Milis · Answer 2 · Thu Sep 14 2023 14:53:27 GMT+0800 (China Standard Time)

@wabmhnsbn the visualizations above are 2D projections (with PCA) of the trainable 256D embeddings, defined in the model's encoder. I just accessed them with model.encoder.src_word_emb.weight. Note that the model has to be trained otherwise they will be random.

wabmhnsbn · Answer 3 · Thu Sep 14 2023 15:05:16 GMT+0800 (China Standard Time)

@wabmhnsbn the visualizations above are 2D projections (with PCA) of the trainable 256D embeddings, defined in the model's encoder. I just accessed them with model.encoder.src_word_emb.weight. Note that the model has to be trained otherwise they will be random.

Okay, I'll give it a try. Thank you！

Aidos Sarsembayev · Answer 4 · Tue Apr 23 2024 14:51:38 GMT+0800 (China Standard Time)

Great question! I have the same.