decode failed due to XML header
bertsky opened this issue · comments
Robert Sachunsky commented
If I do metha-sync -format mets -set 17th-century-prints http://digital.slub-dresden.de/oai/
I finally end up with a fatal error after this:
INFO[0024] decode failed with: <?xml version="1.0" encoding="UTF-8"?>
INFO[0025] decode failed with: <?xml version="1.0" encoding="ISO-8859-1"?>
INFO[0027] decode failed with: <?xml version="1.0" encoding="WINDOWS-1252"?>
INFO[0028] decode failed with: <?xml version="1.0" encoding="UTF-16"?>
INFO[0029] decode failed with: <?xml version="1.0" encoding="US-ASCII"?>
Is this a problem with our data, or with the parser?
Robert Sachunsky commented
Note, by instrumenting L197 I found out that the record in question simply gets cut short. When I increase the io.LimitReader
size limit from 16 MB to 64 MB, then it runs through.