deep-learning attention-mechanism transformer d2l-attention-mechanisms attention d2l tensorflow

Tensorflow 2 code for D2L book's Attention Mechanisms chapter

This code has been merged with D2L book. See PR: 1756, 1768

This repository contains Tensorflow 2 code for Attention Mechanisms chapter of Dive into Deep Learning (D2L) book. The chapter has 7 sections and code for each section can be found at the following links.

10.1. Attention Cues

10.2. Attention Pooling: Nadaraya-Watson Kernel Regression

10.3. Attention Scoring Functions

10.4. Bahdanau Attention

10.5. Multi-Head Attention

10.6. Self-Attention and Positional Encoding

10.7. Transformer

Additional sections:

9.7. Sequence to Sequence Learning

9.6. Encoder Decoder Architecture

Additional Chapters:

Chapter 17: Generative Adversarial Networks

How to run these code:

The best way (in our opinion) is to either clone the repo (or download the zipped repo) and then run each notebook from the cloned (or extracted) folder. All the notebooks will run without any issue.

Note: We claim no originality for the code. Credit goes to the authors of this excellent book. However, all errors and omissions are my own and readers are encouraged to bring it to my notice. Finally, no TF code was available (to the best of my knowledge) for Attention Mechanisms chapter when this repo was first made public.

About

This repository contains Tensorflow 2 code for Attention Mechanisms chapter of Dive into Deep Learning (D2L) book.

deep-learning attention-mechanism transformer d2l-attention-mechanisms attention d2l tensorflow

Languages

Language:Jupyter Notebook 92.7%Language:Python 7.3%