scotthaleen / concordance-clj

Clojure application that prints concordance of english text

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Concordance

Creates an alphabetic list of all word occurrences labeled with the frequency and sentences the word occurred.

Uses instaparse to describe tokenization in EBNF

Build Status

Installation

Download from the latest release

Usage

$ java -jar concordance-*-standalone.jar [TEXT_FILE]

Build

To build an executable jar, run:

$ lein uberjar

This will produce an executable jar in the target directory.
Example: target/concordance-0.1.0-d0fa75f-standalone.jar

Test

$ lein test

Examples

$ cat examples/helloworld.txt
Hello World! The world is round.

$ java -jar concordance-*-standalone.jar examples/helloworld.txt
a.	hello	{1:1}
b.	is	{1:2}
c.	round	{1:2}
d.	the	{1:2}
e.	world	{2:1,2}


$ java -jar concordance-*-standalone.jar examples/alice-in-wonderland.txt

...
www.    accept  {1:1656}
xxx.    acceptance      {1:263}
yyy.    accepted        {2:1737,1740}
zzz.    accepting       {1:1736}
aaaa.   access  {10:1658,1662,1667,1673,1677,1680,1682,1685,1685,1690}
bbbb.   accessed        {1:1673}
cccc.   accessible      {1:1730}
dddd.   accident        {2:1529,1533}
eeee.   accidentally    {1:1527}
ffff.   accordance      {2:1691,1709}
gggg.   account {1:1041}
...

License

Copyright © 2017 ☕

Distributed under the Eclipse Public License either version 1.0 or (at your option) any later version.

About

Clojure application that prints concordance of english text

License:Eclipse Public License 1.0


Languages

Language:Clojure 100.0%