Lukecn1 / TSAuBERT

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Text Similarity Assesment using BERT (TSAuBERT)

Using the contextually-dependent embeddings obtained from BERT to assess the similarity of a candidate- and reference-text using cosine similarity. The main application of this metric is to assess the similarity between a candidate- and reference-summary.

Algorithm

The scoring algorithm consists of two central steps:

1) Encoding

Obtaining embedding vectors from a pretrained BERT-based model.

2) Scoring

Calculating the score using cosine similarity.

Installation requirements

About


Languages

Language:Python 100.0%