measiala / PUMS_Data_Dictionary

Python project to process a weakly structured docx file and produce a structured, 508 compliant docx file, a txt version, and a machine readable version.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PUMS_Data_Dictionary

This is an update to the previous project PUMS_Data_Dictionary. It greatly improves both the object-oriented approach and couples it with more robust testing using py.test. This will be the finished version to hand off to others.

To-Do List

[x] Complete the handling of record types and the machine readable file.

[x] Complete final write out of files including testing.

[x] Obtain final layout of input file to include variable type

[x] Add tailored log output

[ ] Obtain final layout of machine readable file (improved)

[x] Neaten up test cases

[ ] Remove unused code

Uses the following modules:

argparse os re logging textwrap docx* py.test*

*Not installed by default

About

Python project to process a weakly structured docx file and produce a structured, 508 compliant docx file, a txt version, and a machine readable version.

License:MIT License


Languages

Language:Python 100.0%