vr25 / temp_change_text_reg

Code and data: "Temporal Change Analysis from Text for Regression: Application to Financial Disclosure Documents"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Textual Similarity on MD&A disclosure

Code and data for "Textual Similarity on MD&A disclosure"

Prerequisites

This code is written in python. To use it you will need:

Getting started

We provide all the similarity scores for the different methods described in the paper along with the data statiscs.

To create the doc2vec model and then use it to find the similar document vectors: run python3 compute_doc2vec_sim.py

The data file can be found here: train_docs_sec7.txt

About

Code and data: "Temporal Change Analysis from Text for Regression: Application to Financial Disclosure Documents"

License:Apache License 2.0


Languages

Language:Python 100.0%