Can you give us a few extracts of this big HTML finding aid?
Do you wish to convert it to just one EAD file?
Does this HTML file happen to group a number of distinct fonds or series?
I am quite familiar with conversion (any type) but I don't know much
about your data so I can only say a few generalities:
Often, programming is the best way of dealing with conversion, be it
XSLT or Visual Basic or Visual Foxpro.
Programming can get complicated if data has been treated differently
from one source file and the other.
But if there are clear rules that can be implemented like position in
the file, position on the line, appearence of a header for each type
of data, etc, then programming could be done quickly. Even wording
can be used to recognize the EAD field to transfer to. However, my
experience has shown that there is always some kind of surprise
around the corner.
Maybe a solution could be to start with hardcoded administrative data
always the same from one EAD file to the other, and then to extract
the most obvious like the title, etc, and put the rest in a temporary
field and process it manually.
In other word, build a EAD template with your own administrative data
and then extract from the source file what is most easy, and copy the
rest between temporary tags in order to recognize it later and
process it properly (manually).
Some data, like biography, administrative history, scope & content
can easily be spotted and it often constitutes the bulk of some
archival descriptions. This could be processed right away with the
title, in the first step.
This work could be distributed between your staff. Juniors could
create the new EAD file from the template, copy the easily
recognisable fields like title at the proper place, and copy the rest
of the source data to a temporary tag. Seniors would then open the
partially processed EAD file and complete the conversion.
At the end, another member of your team could test the EAD file
against its DTD or its SCHEMA.
Hope this can help.
At 17:06 2010-02-05, you wrote:
>We have a very lengthy finding aid that exists in HTML format and
>would like to convert it to EAD. We're currently using a NoteTab
>text editor template to create our EAD finding aids from "scratch."
>Has anyone on the list had experience in doing such conversion (not
>necessarily with NoteTab, but possibly with other clients), or could
>advise us on some practical ways to do this conversion? At this
>point, it seems that a lot of copying and pasting is involved, or
>complicated programming, but we're wondering if there's a more
>"expeditious" way to do this?
>Burt Altman, CA
>Tallahassee, FL 32306