pilate / pimht

mhtml parser

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

pimht

Python mhtml parser

Installation

$ pip install pimht

Example

import pimht

mhtml = pimht.from_filename("test.mhtml")
for part in mhtml:
    print(part)

Performance

The chardet module, used by default, is slow. Performance can be improved by also installing cchardet with:

$ pip install pimht[speedups]

This is aimed specifically at parsing Google Chrome generated snapshots as fast as possible, but feel free to report issues with MHTML files from other sources.

About

mhtml parser

License:MIT License


Languages

Language:Python 97.7%Language:Dockerfile 2.3%