This is yet another Python library to parse the "My Clippings.txt" files generated by the Amazon Kindle.
The name "gannet" is inspired by the Monty Python "Bookshop" sketch.
In the hopes of making a more flexible and understandable parser, this one makes use of the Pyparsing library for parsing the various parts of a "My Clippings.txt" file.
pip install -r requirements.txt
from gannet import parse_my_clippings
import codecs
clippings = []
with codecs.open(sys.argv[1], 'r', 'utf-8') as f:
clippings = parse_my_clippings(f)
$ python clippings-to-json.py "My Clippings.txt"
$ nosetests