We saw and see (!) the problem when we convert from USMARC/MARC21 to
MARCXML. Obviously the MARC control chars are not a problem since these
are part of the ISO2709 structure, but sometimes you see Control chars
in MARC records.. Again, there is nothing we can do than delete them as
things stand now. It also means that we can't construct the original
ISO2709 record again.. For both string and xml encoding charset of
embedded record must be same as SRW wrapper. base64 would solve that.
base64 would solve other things too, so maybe we should consider that.
-- Adam
Mike Taylor wrote:
>>Date: Mon, 28 Jun 2004 18:09:45 +0200
>>From: Adam Dickmeiss <[log in to unmask]>
>>
>>http://www.w3.org/TR/2004/REC-xml-20040204/#charsets
>>
>>The XML spec guys really did exclude most chars in the 0-0x01f
>>range. I wonder why.
>>
>>
>
>Un-bee-LEEV-able. Here's us, all this time saying that XML is pretty
>much a generic record syntax analogous to GRS-1, and now it turns out
>that it's no such thing. More fool me for not having checked this out
>properly before, but -- What CAN they have been thinking? How in the
>name of all that is rational can it be any of XML's business what kind
>of data we choose to embed in it?
>
>
|