I've got a few misgivings about the proposal to forbid nearly all formatting
characters, as stated in point 3c from Joan's list, or Appendix C.4 of Part 2
of the 'Assessment of Options' report.
There are at least two or three reasons I can think of offhand as to why we
might want to reconsider this:
1.) CGJ (Combining Grapheme Joiner)--would be used in the German
bibliographic
system to distinguish between umlauts and diaresis.
2.) ZWS (Zero-Width Space)--for CJK bibliographic records, I don't
know that a
stable solution has been arrived at for word separation, and this character
could provide some of the functionality that's needed to demarcate word
segmentation without introducing a visual space.
3.) ZWNJ (Zero-Width Non-Joiner)--I believe this character may be needed,
especially for inputtin in Persian?
4.) ZWJ (Zero-Width Joiner)--widely used in Devanagari scripts, Arabic, and
Korean. I think it's a needed character in Tamil for correctly spelling out
the country name of Sri Lanka.
|