miku / metha

Command line OAI-PMH harvester and client with built-in cache.

Home Page:https://lab.ub.uni-leipzig.de/metha/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

decode failed due to XML header

bertsky opened this issue · comments

If I do metha-sync -format mets -set 17th-century-prints http://digital.slub-dresden.de/oai/ I finally end up with a fatal error after this:

INFO[0024] decode failed with: <?xml version="1.0" encoding="UTF-8"?> 
INFO[0025] decode failed with: <?xml version="1.0" encoding="ISO-8859-1"?> 
INFO[0027] decode failed with: <?xml version="1.0" encoding="WINDOWS-1252"?> 
INFO[0028] decode failed with: <?xml version="1.0" encoding="UTF-16"?> 
INFO[0029] decode failed with: <?xml version="1.0" encoding="US-ASCII"?> 

Is this a problem with our data, or with the parser?

Note, by instrumenting L197 I found out that the record in question simply gets cut short. When I increase the io.LimitReader size limit from 16 MB to 64 MB, then it runs through.