VipulMascarenhas / DocSummarizer

Node.js Document Summary Statistics Application

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DocSummarizer

Node.js Document Summary Statistics Application

Task:

Utilize Node.js to create a simple command line application that provides summary statistics for a given dataset. This will not require using Node.js as a web server, but rather will use it as a command line application only. The application will take the relative path to a text file (.txt) as a command line argument. The application then processes the text file to generate the following descriptive statistics about the document:

1.	The number of words in the document
2.	The number of lines
3.	A list of all the words with the count of their occurrence, in descending order
4.	A list of the ten most common trigrams
5.	A listing of the edit distances between the most common trigram and the next ten most common trigrams, in order of distance value. This includes the trigrams being compared and the edit distance value.

The output of is a properly formatted HTML file that can be viewed in any modern browser. The name of the HTML report includes the name of the data file provided as input.

About

Node.js Document Summary Statistics Application


Languages

Language:HTML 94.2%Language:JavaScript 5.8%