sts10 / tidy

Combine and clean word lists

Home Page:https://sts10.github.io/2021/12/09/tidy-0-2-0.html

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Add boolean attributes for presence of digits, symbols, or non-ASCII characters

sts10 opened this issue · comments

Might be useful when auditing word lists.

commented

Maybe this should be another issue, but wondering if this could be extended to support alphabets of different languages? Greek letters may be treated as symbols in languages that use a Latin alphabet, but in Greek you definitely want to use those characters.

There's so many things to check!

I'm trying to think bigger here: If we keep adding attributes, I think it might be best to separate these functions into two different tools. I'm thinking Tidy can be for editing lists, while a new tool can be strictly for printing information about a list (auditing).

Spent this afternoon trying to pull out the list information code from Tidy and put it in a new tool I'm calling WLA for now.

commented

I wonder if it would make sense to have a library and different cli tools around it. For context: I imagined this not just as a feature to display information but as a filter