Amy Turner writes:
[...]
> They work from the language code, lists of articles by language, lists
> of exceptional phrases "A is for ...", and specific cases of titles
> that are not in the language of the text.
Do you find that records have valid language codes for specific fields
like titles? In my experience the language codes, if they exist, can be
inconsistent.
> Language recognition software, rather than the language code, could be
> used (Following LTI processing, we corrected hundreds of language
> codes that erroneous non-filing fix had revealed).
On short strings language recognition is very difficult: I haven't had
much luck with it on strings under 250 characters or so.
-tree
|