gmx / transcript-xml

Myanmar Parliamentary Transcript XML Repository

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Myanmar Parliamentary Transcripts Data Repository by The Ananda

This is the open data repository for the digitized parliamentary transcripts from

File Names

File names contains six digits like this 02-16-01.xml. Fist two digit stands for Term, the middle two digit stands for Session and Last two digits stands for Sitting Day. So 02-16-01.xml means Second Term, 16th Session, First Sitting Day.

Directory Structure

Root directories are divided by their terms and in the sub directory files are again divided by House and File type. The directory structure is as follow.

  • First Term
    • lower
      • xml (First Term Lower House)
    • upper
      • xml (First Term Upper House)
  • Second Term
    • lower
      • xml (Second Term Lower House)
    • upper
      • xml (Second Term Upper House)
    • union
      • xml (Second Term Union House)

Digitization Process

Draft paper of the digitization process is available here

Contact Us:

Terms of Use:

  1. This data repository is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) by The Ānanda .

  2. Attribute the data as the "Parliamentary Transcripts By The Aananda and the url: https://github.com/theananda/transcript-xml/

  3. There is no warranty and we shell not guarantee that the dataset is 100% accurate. Here is the list of available files in this repository. Please always make sure to cross check with original PDF files and always cite back to the respective Hluttaw Webpages.

About

Myanmar Parliamentary Transcript XML Repository

License:Creative Commons Attribution 4.0 International