Add self attention encoder

Question

Add self attention encoder

Adamits opened this issue 10 months ago · comments

With the decoupling of encoders and decoders, we have added a Linear encoder, which seems to just embed the inputs and pass them along. We should also add a SelfAttention encoder, which encodes the embeddings with a self attention layer (and no positional encoding).

This contextualizes the embeddings by representing each as a linear combination of itself wrt all other embeddings.

Kyle Gorman · Answer 1 · Fri Sep 29 2023 00:46:44 GMT+0800 (China Standard Time)

+1. Makes sense.