DISCOSUMO's repositories
featureextraction
Feature extraction scripts for the DISCOSUMO project, to be used for extractive summarization of discussion threads.
evaluation
Evaluation and agreement scripts for the DISCOSUMO project. Each evaluation script takes both manual annotations as automatic summarization output. The formatting of these files is highly project-specific. However, the evaluation functions for precision, recall, ROUGE, Jaccard, Cohen's kappa and Fleiss' kappa may be applicable to other domains too.
query-based_summarization
This repository contains a module for query-focused summarization of discussion threads in the DISCOSUMO project.
annotation
Tool to convert a thread in DISCOSUMO xml format to html for the purpose of creating reference summaries
dataconversion
This repository contains the scripts for converting forum data used in the DISCOSUMO project to the unified xml format (defined in forumthread.dtd)