TensorFlow code and pre-trained models for A Dynamic Word Representation Model Based on Deep Context. It combines the idea of BERT model and ELMo's deep context word representation.
https://yuanxiaosc.github.io/2018/11/27/Bidirectional_Encoder_Representations_Transformers/