Print

Print


Eric Lease Morgan wrote:
> On Sep 26, 2005, at 9:55 AM, Oldroyd, Bill wrote:
> 
>> I like it.
>>
>> But the response will contain records using utf-8 which doesn't  seem to
>> display correctly. Try "lenin single works" .
>>
>>>    * http://spell.ockham.org/bl/
> 
> 
> I feel like the Ugly American because I do not know how to resolve  this 
> problem, and it appears in many of my other implementations.
> 
> What encoding should I add to my output so it works correctly?

well, whatever encoding there was used in the recods in the first place.

My experience with all this encoding problems is that you need to do 
three-four things right

1) know which encoding you are coming from
2) know which encoding you want to go to
3) apply the right encoding transformation
4) label your XML with the right encoding afterwards

It always astonishes me how many people trying to solve 3), not having a 
good grip of 1) and 2). Off course, 2) and 3) are not used if you are 
happy with the encoding you got in the first place.

In addition, Perl has it's own magic which will toss your nicely planned 
strategy over board.

I had quite many debugging nightmares cased by Perl magic. See maybe the 
following for a better explanaition:

http://www.ahinea.com/en/tech/perl-unicode-struggle.html


Marc Cromme, Index Data

> 


-- 

Marc Cromme, cand. polyt, Ph.D
Senior Developer, Project Manager

Index Data Aps
K°bmagergade 43, 2
1150 Copenhagen K.
Denmark

tel: +45 3341 0100
fax: +45 3341 0101

http://www.indexdata.com

INDEX DATA Means Business
for Open Source and Open Standards