Textual Similarity on MD&A disclosure

Code and data for "Textual Similarity on MD&A disclosure"

Prerequisites

This code is written in python. To use it you will need:

We provide all the similarity scores for the different methods described in the paper along with the data statiscs.

To create the doc2vec model and then use it to find the similar document vectors: run python3 compute_doc2vec_sim.py

The data file can be found here: train_docs_sec7.txt

Code and data: "Temporal Change Analysis from Text for Regression: Application to Financial Disclosure Documents"

Apache License 2.0

Language:Python 100.0%