sul-dlss / was-registrar-app

Rails app to organize downloaded web archiving data and trigger preassembly/accessioning when appropriate

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

redundant collection ID string in accessioning path

nullhandle opened this issue · comments

Convention for the programmatically-created path for retrieved Archive-It crawls appears to have the collection ID string twice, e.g.,

/was_unaccessioned_data/jobs/AIT_5425/2018_10/AIT_5425/694543/2018-10-25T07:00:23.344000Z

Would be great if we could remove the second instance of the collection ID, so that it would instead be, e.g.,

/was_unaccessioned_data/jobs/AIT_5425/2018_10/694543/2018-10-25T07:00:23.344000Z

The second instance of the collection id is added by the Wasapi Downloader, so would need to look there for a fix.