IQSS / dataverse-sample-data

Scripts and sample data for demo purposes

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Failed to Publish Dataset – The dataset could not be published because one or more of the datafiles in the dataset could not be validated (physical file missing, checksum mismatch, etc.) Please contact support for further assistance.

pdurbin opened this issue · comments

If you try to add the dataset @philippconzett added in f3ef7ee...

$ git diff dvconfig.py.sample
diff --git a/dvconfig.py.sample b/dvconfig.py.sample
index 8f97f28..f8815c9 100644
--- a/dvconfig.py.sample
+++ b/dvconfig.py.sample
@@ -5,6 +5,7 @@ sample_data = [
 'data/dataverses/pums/pums.json',
 'data/dataverses/pums/datasets/2000pums5/2000pums5.json',
 'data/dataverses/dataverseno/dataverseno.json',
+'data/dataverses/dataverseno/datasets/tabular-sample-data/tabular-sample-data.json',
 'data/dataverses/open-source-at-harvard/open-source-at-harvard.json',
 'data/dataverses/open-source-at-harvard/dataverses/dataverse-project/dataverse-project.json',
 'data/dataverses/open-source-at-harvard/dataverses/dataverse-project/datasets/dataverse-irc-metrics/dataverse-irc-metrics.json',
@@ -26,6 +27,3 @@ sample_data = [
 'data/dataverses/king/king.json',
 'data/dataverses/king/datasets/cause-of-death/cause-of-death.json',
 ]
-
-# put this back at line 6 once https://github.com/IQSS/dataverse/pull/6924 is merged
-# 'data/dataverses/dataverseno/datasets/tabular-sample-data/tabular-sample-data.json',
HMDC-beamish:dataverse-sample-data pdurbin$ 

... it appears to publish correctly from the API output...

{'status': 'OK', 'data': {'id': 246, 'identifier': 'FK2/MTJL70', 'persistentUrl': 'https://doi.org/10.5072/FK2/MTJL70', 'protocol': 'doi', 'authority': '10.5072', 'publisher': 'Root', 'storageIdentifier': 'file://10.5072/FK2/MTJL70', 'metadataLanguage': 'undefined'}}

... but the dataset has not, in fact, been published

If you log in look in at the dataset page you can see the following error:

"Failed to Publish Dataset – The dataset could not be published because one or more of the datafiles in the dataset could not be validated (physical file missing, checksum mismatch, etc.) Please contact support for further assistance."

Here's a screenshot:

Screenshot 2022-08-11 at 14-20-36 Testing Tabular File Ingest - Excel and tab-separated  txt

Here's a stacktrace as of IQSS/dataverse@1e528090e3 (develop, post 5.11.1): stacktrace.txt

I'm not sure, but possibly this is related:

(As suggested in the diff above I thought we'd be able to add this dataset into the default sample data now that IQSS/dataverse#6924 has been merged.)