NIH AIM:4 YR:2 TASK:1B | 2.4.1B | (started yr1) Resolve OAI-PMH harvesting issues
mreekie opened this issue · comments
Continuation of: #10
Updated title from NIH AIM:4 YR:2 TASK:1A | 2.4.1A | (started yr1) Resolve OAI-PMH harvesting issues to NIH AIM:4 YR:2 TASK:1B | 2.4.1B | (started yr1) Resolve OAI-PMH harvesting issues
The following issues identified in 4 | 1.4.1 | Resolve OAI-PMH harvesting issues | 5 #10 remain open at this time. Issues in bold are in NIH Backlog Queue:
- Refactor the OAI code, Step 2 #8842
- Revisit/reimplement the concept of a "Harvested file" dataverse#8629
- Harvest: Set export can temporarily be adversely affected by full (clean) reindex. #3437
- Harvest: Do not list a set as available in ListSets until it has been successfully exported at least once. #3322
- Feature Request/Idea: Sanitize languages controlled vocabulary values #8243
- Harvesting: OAI sets are not updated when datasets are deleted. #8005
- Remaining mapping problems when harvesting from a repository using ISO 639-3 language codes #8578
Perhaps we could fix this one too:
Hey @cmbz, here are those GitHub issues that I was referring to and you asked me to list here when we were Slacking with @landreev today. And below each issue I added the related email conversations with admins of other repositories.
- https://help.hmdc.harvard.edu/Ticket/Display.html?id=324230
- https://help.hmdc.harvard.edu/Ticket/Display.html?id=329544
- https://help.hmdc.harvard.edu/Ticket/Display.html?id=349681
I think the second GitHub issue, IQSS/dataverse.harvard.edu#142, is a duplicate and more specific version of the third GitHub issue, IQSS/dataverse.harvard.edu#153, so maybe the second one can be closed?
This would also be important for harvesting (blocks the most datasets for us)