Processing GISAID SARS-CoV-2 emerging variants data and generating a language model using Transformers. The model semantically captures masked glycoprotein amino acid mutations in emerging variants of the SARS-CoV2-virus and gives appropriate scores for each potential aa change.