Simple command-line scripts for manipulating text files.
Using:
git clone https://github.com/gordonbrander/txttools
cd txttools
pip3 install -e .
- txt_clean_whitespace: clean up junk whitespace.
- txt_combine: combine contents of files with a separator.
- txt_ext: assign a new file extension to a batch of files.
- txt_textiness: filter chunks of text based on "textiness". Meant for cleaning out junk text after scraping.
- txt_unwrap: unwrap text in a file.
- txt_sep: add separators between each chunk of stdin
Do <cmd> --help
for usage info on each.
TODO
- txt_filter_files: some way of filtering file names based on contents inside. Use for running reports, combining, etc.