justicehub-in / justice-hub-docs

Documentation platform for the Justice Hub

Home Page:https://docs.justicehub.in/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data report | Contract Enforcement Litigation Data from District Courts | NIPFP

apoorv74 opened this issue · comments

Data Accessibility Report

Links
Sample Dataset
Data Documentation
Data Dictionary

Files available

Files Status
Data dictionary (Human readable dictionary of data contents)
Data License (How to use and share the data)
Raw Dataset (The original/first data provided)
Processed Dataset (Final data used in analysis)
Dataset README (A Human readable description of the data)
Citation (How you want your data to be cited)

Data Cleaning & Standardisation Report

Issue Status
Data does not have any PII's (Personally Identifiable Information)
Data to be uploaded is in a machine-readable format (CSV, JSON)

Other details

  • Data maintainer details

Comments/Next Steps:

  • Can the columns with type as HTML string be stored as separate tables
  • Mark columns as either directly sourced from the source (raw/original) or derived/user-generated in the data dictionary. E.g. columns court_code , complexcode, day_pending, etc. can be marked as derived
  • Only 76 out of the total 86 columns are present in the data dictionary
  • How does the dataset deal with empty values ? Is it different for all individual columns? This information for each column, can be included in the data dictionary as well
  • Variables with personally identifiable information (PII's) (As per our data sharing policy, we are not uploading any datasets with sensitive information either about communities (CII's) or individuals):
File Variable
sample_dataframe petNameAdd
sample_dataframe pet_adv
sample_dataframe pet_name
sample_dataframe petnameadArr
sample_dataframe petparty_name
sample_dataframe resNameAdd
sample_dataframe res_adv
sample_dataframe res_name
sample_dataframe resparty_name

❗ Important:

  • Anonymise sensitive information. To do this, columns with PII's listed above can be removed from the original dataset
  • Mention the license under which this dataset is to be released on the JusticeHub. Please refer to this link for learning more about open data licenses

📈 Improving data accessibility:

  • If possible, share all files listed under the Files available section above.
  • Include a README file which is short description about the dataset. Refer here, to know more