gordonbrander / txttools

Python command-line tools for munging text files

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

txttools

Simple command-line scripts for manipulating text files.

Using:

git clone https://github.com/gordonbrander/txttools
cd txttools
pip3 install -e .

Commands

  • txt_clean_whitespace: clean up junk whitespace.
  • txt_combine: combine contents of files with a separator.
  • txt_ext: assign a new file extension to a batch of files.
  • txt_textiness: filter chunks of text based on "textiness". Meant for cleaning out junk text after scraping.
  • txt_unwrap: unwrap text in a file.
  • txt_sep: add separators between each chunk of stdin

Do <cmd> --help for usage info on each.

TODO

  • txt_filter_files: some way of filtering file names based on contents inside. Use for running reports, combining, etc.

About

Python command-line tools for munging text files

License:MIT License


Languages

Language:Python 100.0%