quimcastella / wordstorms

Create multiples of word clouds to compare documents visually

Home Page:http://wordstorms.org

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Word Storm

The objective of this project is to generate word storms, multiples of word clouds to afford visual comparison of groups of documents. Just as a storm is a group of clouds, a word storm is a group of word clouds. Each cloud in the storm represents a subset of the corpus. For example, a storm might contain one cloud per document, or alternatively one cloud to represent all the documents written in each year, or one cloud to represent each track of an academic conference, etc.

Word Storm

In order to make the clouds easy to compare, words that appear in different clouds are placed in similar locations and more informative words are emphasized. In this way, similar documents are be represented by visually similar clouds.

Create a Word Storm

This project allows you to create word storms from your texts, giving you options to personalize their appearance, so you can create beautiful figures to analyze your documents.

The algorithms to generate the storm are build on top of WordCram (web, gitHub), a flexible word cloud generator that creates beautiful clouds using the popular Wordle's layout.

Usage

  1. Set the path to the text files using 'io.DataPath.java'. The project takes the data from 'datapath/input/folder' and returns the storm in 'datapath/output/folder/parameters'
  2. Decide the cloud's size in 'AppletConf'.
  3. Decide the storm configuration using 'StormConf' (number of files, number of words, colors, angles...)
  4. Create a Storm!

Examples

See examples of usage in 'src/wslauncher':

  1. 'StormLauncher.java' to create a coordinated word storm.
  2. 'IndepLauncher.java' to create independent clouds.

About

Create multiples of word clouds to compare documents visually

http://wordstorms.org

License:Apache License 2.0


Languages

Language:Java 99.9%Language:Shell 0.1%