Dear Gary,
I get nearly 1,628,000 records with 11x fields in all, but of those how many records might call for a second $c?
Curious,
Kevin
--
Kevin Ford
Network Development and MARC Standards Office
Library of Congress
Washington, DC
> -----Original Message-----
> From: MARC [mailto:[log in to unmask]] On Behalf Of Gary L Strawn
> Sent: Saturday, January 25, 2014 10:12 PM
> To: [log in to unmask]
> Subject: [MARC] Proposal 2014-02 (repeating subfield $c)
>
> I case anyone is interested, during a meeting later in the day I made a
> dump of all of the 11X fields in my copy of the NACO authority file
> with subfield $c containing "and", and wrote a little program to
> attempt to match the text in $c to authority records. The program
> attempted to find a corporate or geographic authority record for the
> whole thing ("Bellaggio Study and Conference Center"; "Sarajevo, Bosnia
> and Hercegovina") or authority records for the texts to the left and
> right of the "and" ("San Francisco, Calif. and San Diego, Calif."). If
> the program finds that "and" joins two identifiable things, then
> insertion of $c might be desired; if it finds that "and" is part of a
> single thing, then no action is appropriate. All of course to
> determine if automated retrospective changes to existing data might be
> possible.
>
> Skipping the details, I'll report that the program was able to
> determine either that "and" joined two distinct pieces (and so the
> heading might call for a second $c) or was part of a single thing (and
> so the heading must be left alone) for a bit over 81% of the headings
> tested. I found slightly different but comparable results for the
> headings in bibliographic records in my database; the fate of about 75%
> of the headings could be determined by program. I suspect that
> additional programming (more than 15 minutes' worth!) could raise the
> percentages a bit; but at least for these two files, the overall
> numbers are so small that it would probably be more expedient to deal
> with the indeterminate cases individually than to spend time developing
> and testing fancier code.
>
> Gary L. Strawn, Authorities Librarian, etc. Twitter:
> GaryLStrawn
> Northwestern University Library, 1970 Campus Drive, Evanston IL 60208-
> 2300
> e-mail: [log in to unmask] voice: 847/491-2788 fax: 847/491-
> 8306
> Forsan et haec olim meminisse iuvabit. BatchCat version:
> 2007.25.428
>
|