Data report | Correctional Fascilities in Assam | Studio Nilima
apoorv74 opened this issue · comments
Data Accessibility Report
Links |
---|
Sample Dataset |
Data Documentation |
Data Dictionary |
Files available
Files | Status |
---|---|
Data dictionary (Human readable dictionary of data contents) | ❌ |
Data License (How to use and share the data) | ❌ |
Raw Dataset (The original/first data provided) | ❌ |
Processed Dataset (Dataset used for analysis) | ✅ |
Dataset README (A Human readable description of the data) | ❌ |
Citation (How you want your data to be cited) | ❌ |
Data Cleaning & Standardisation Report
Issue | Status |
---|---|
Data does not have any PII's (Personally Identifiable Information) | ✅ |
Data to be uploaded is in a machine-readable format (CSV, JSON, XLS*) | ✅ |
Other details
- Data maintainer details
Comments/Next Steps:
Worksheet: Sheet 1
- RTI responses under each indicator can be shared as a separate worksheet. For E.g.: Each of
Women and Child Health
,MEDICAL FACILITIES
,MEDICAL STAFF
,EDUCATION AND HEALTH
,DETENTION MANUAL + FORTNIGHTLY PRISON REPORT
can be a separate sheet as they all have information under different heads (columns) - All date columns such as
RTI dated on
,Reply received on
, etc should only contain valid date values, in similar date formats. For Eg: YYYY-MM-DD - Remove cell formats (Colors, Bold, Italics, etc)
- Indicators with
RTI IGP official no.
should be a separate worksheet/file. Eg:
Group | RTI Details |
---|---|
MEDICAL FACILITIES | RTI IGP official no. - 34 |
Women and Child Health | RTI IGP official no. - 35 |
EDUCATION AND HEALTH | RTI IGP official no. - 44 |
- All Column names should be standardised (small case, mostly an identifier instead of a description)
- Every column should just be a label and its description shall be available in the data dictionary. For E.g. a column name can be
gynaecologists_available
and its description can beHow many gynaecologists are appointed or available for visits in the correctional homes of Assam? Please provide the number of such doctors and frequency of visit (of last 3 years) along with institution/hospital where they are appointed or available.
(which is the actual column name in the file shared). - A few RTI responses can be converted to quantitative data as well. For E.g. responses mentioning nil can be converted to 0, etc. (Depends on use-case to use-case, sometimes it is not feasible to assign numbers to text, but should be done where possible)
Worksheet: Nature of illness - details
- Share this as a CSV file
- Remove
Nature of Illness
from Cell 1 as this is the title of the file/worksheet - Include geographic details as a sepearate column
- Values of the same type should be present in each individual columns. For E.g. column titled
Monthly Average (approx)
should only have numbers and not dates e.g. 2019
❗ Important:
- Please share a link to the data dictionary (This is a CSV file which contains information about the columns present in all files under a dataset). Learn more
- Mention the license under which this dataset is to be released on the JusticeHub. Please refer to this link for learning more about open data licenses
📈 Improving data accessibility:
- If possible, share all files listed under the
Files available
section above. - Share the data as
CSV
files. - Include a README file which is short description about the dataset. Refer here, to know more