unsupported character
NicoledeGreef opened this issue · comments
Nicole de Greef commented
Benjamin Jubb commented
it appears to be garbage data. the bytes in question are 0xc2 0x96 which don't decode to anything in UTF8 as far as i can tell.
Nicole de Greef commented
have asked DAs to look into this record.
Nicole de Greef commented
confirmed to be garbage data; Brad found a number of instances of this in the Francophone data sets. he will take it to his Chapter for investigation/resolution.