freq

A commandline tool that counts the number of word occurrences in an input.

This is just a placeholder repository for now. Please create issues for feature request and collaboration.

Usage

Commandline

echo "b a n a n a" | freq

0.16666667 - 1 - b
0.33333334 - 2 - n
0.5 - 3 - a

Library

use std::error::Error;

fn main() -> Result<(), Box<dyn Error>> {
    let frequencies = freq::count("fixtures/sample.txt")?;
    println!("{:?}", frequencies);
    Ok(())
}

Features

Idea contributors:

@jamesmunns
@M3t0r
@themihel
@AlexanderThaller
@pizzamig
Want to see your name here? Create an issue!

Similar tools

tot-up

Similar tool written in Rust with nice graphical output https://github.com/payload/tot-up

uniq

A basic version would be

curl -L 'https://github.com/mre/freq/raw/main/README.md' | tr -cs '[:alnum:]' "\n" | grep -vEx 'and|or|for|a|of|to|an|in' | sort | uniq -c | sort

This works, but it's not very extensible by normal users. It would also lack most of the features listed above.

Lucene

Has all the bells and whistles, but there is no official CLI interface and requires a full Java installation.

wordcount

freqword <tab> freq

Nice and simple. Doesn't exclude stopwords and no regex support, though. https://github.com/juditacs/wordcount

word-frequency

Haskell-based approach: Includes features like min length for words, or min occurrences of words in a text. https://github.com/cbzehner/word-frequency

What else?

There must be more tools out there. Can you help me find them?

About

🗼 A CLI term frequency analyzer. Counts the number of occurrences of each word in an input and creates formatted output or a histogram.

frequency histogram occurences words

Apache License 2.0

Languages

Language:Rust 100.0%