Aditi1692 / TextSummarization

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Text Summarization

  • Aditi Jain
  • Nipun Bayas

Purpose

The aim of this project is to summarize the text present in an HTML page using different algorithms, like the Latent Dirichlet Allocation (LDA) algorithm.

In order to run the summarizer program, use the following command:

python summarizer.py <metric>

where: metric can be N, L, S or A(all)

Classification

We have used the Stanford Dependency Parser to create dependency trees for each opinion. Based on these trees, we aim to extract summaries, based on the words and their part of speech in the paragraph.

The code for classification can be found under classification/parsing_opinions.py

References

About


Languages

Language:HTML 64.8%Language:Python 35.2%