elektito / pyxmlperftest

Benchmarking Python XML parsing libraries.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This program benchmarks a few XML parsing libraries available to Python programmers. In particular, it compares lxml with some of the modules in the xml package in Python standard library.

You need to pass at least one XML file as test input:

$ ./pyxmlperftest.py 1.xml 2.xml 3.xml

Sample final results:

Results:
   xml.dom.minidom: 7.49 MBps
   lxml.etree: 89.63 MBps
   xml.etree.ElementTree.iterparse: 31.77 MBps
   xml.etree.ElementTree: 58.43 MBps
   xml.sax: 25.68 MBps

From what I've seen so far, lxml is superior to all other options by a wide margin, will minidom is the slowest option by far.

This is not a very scientific benchmark, so take the results with a grain of salt.

About

Benchmarking Python XML parsing libraries.


Languages

Language:Python 100.0%