gsf / ptree

minimal PairTree implementation

Home Page:https://confluence.ucop.edu/display/Curation/PairTree

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

The ptree module has two functions to help you work with PairTree identifiers 
and file paths: id2ptree and ptree2id. 

PairTree [1] is an technique from the digital preservation community for 
safely mapping identifiers to file paths, and back again. It can be helpful 
when writing resources to disk so that they can be identified later on by 
merely looking at the filesystem layout.

    >>> import ptree
    >>> ptree.id2ptree("info:lccn/12345678")
    '/in/fo/+l/cc/n=/12/34/56/78/'
    >>> ptree.ptree2id('/in/fo/+l/cc/n=/12/34/56/78/')
    u'info:lccn/12345678'

ptree draws from Ben O'Steen's PairTree Python module [2], which provides a 
lot more functionality for storing bitstreams on disk. ptree intentionally  
focuses soley on the identifier/filepath mapping, and leaves IO operations up 
to you. The unit tests were shamlessly stolen from John Kunze's File::PairTree 
CPAN module [3]. 

Author: Ed Summers <ehs@pobox.com>
License: Public Domain

[1] https://confluence.ucop.edu/display/Curation/PairTree
[2] http://pypi.python.org/pypi/Pairtree
[3] http://search.cpan.org/dist/Pairtree/

About

minimal PairTree implementation

https://confluence.ucop.edu/display/Curation/PairTree


Languages

Language:Python 100.0%