On 03/10/13 12:31, Jon Stroop wrote:
> 1. ...cache it!
> 2 When we're all validating each of our 500 gazillion METS are we really
> hitting loc.gov each and every time, for every record?
1 & 2 - our automated systems access loc.gov via a caching proxy,
but the schemas are going to fall out of that proxy when the TTL
expires, which it probably has by now. We could have built a specfic
smarter proxy for our xsd calls, but hindsight is just that.
> 3. It's possible to validate an XML document against a different schema
> than the one it references.
Absolutely. But going in to the dozens of places that your automated
systems that can be validating and transforming tens of thousands of
documents a day and cludging something to point to a local copy isn't
a matter of typing something different on the command line. DNS
spoofing loc.gov or transparently proxying are also options, but I
think the community needs to look whether there's a better way to
manage this.
This may just be a matter of difference of perspective when it comes
to scale and complexity of systems and workflows. We can work around
the current unavailability, but we're all going to have to do it
individually, and we should be looking for a sustainable way to
avoid it happening.
--
Illtud Daniel [log in to unmask]
Pennaeth TGCh Head of ICT
Llyfrgell Genedlaethol Cymru National Library of Wales
|