We basically did it this way:
> ----------------------------------------------------------------------------------------------
> Date: Mon, 22 Jul 2013 15:49:21 +0000
> Reply-To: "Data, API, website, and code of the Chronicling America website"
> From: Michael Beccaria
> Subject: Re: Chronam Technical Metadata/LCCN requirements?
>
> Stephanie,
> Regarding the LCCN requirement, you can get around that for more "local" institutional holdings by creating your own MarcXML file for the series you have and placing it in a folder that is set in the settings.py file using the BIB_STORAGE variable (line 127 in this file: https://github.com/LibraryOfCongress/chronam/blob/master/settings_template.py)<https://github.com/LibraryOfCongress/chronam/blob/master/settings_template.py_> and then editing the batch_loader.py file to point to those files instead of the loc website.
>
> In the current version of batch_loader.py that is on line 260 (https://github.com/LibraryOfCongress/chronam/blob/master/core/batch_loader.py)
>
> You would need to make sure you name the files something consistent so it can find it using the lccn variable. For example, if your fake "lccn" number for your local holdings is 'testlccl' then name the file "testlccn.xml", set the lccn in your marc and alto metadata files to be "testlccn". You would alter the line (line 260) to then point to the BIB_STORAGE directory instead like this:
>
>
> management.call_command('load_titles', BIB_STORAGE + '%s.xml' % lccn)
>
> Something like that. Hope that gets you started.
> Best,
> Mike
>
Note, there are a couple of things you need to be sure are in place in the MARCXML record you're hosting locally-- the fake lccn should be in the 010 field where the real one would be, and you need to make sure your record has something in the 752 subfields a/b/c/d fields, because otherwise SOLR will throw an error.
Good luck,
Mary Willoughby
Digital Library of Georgia
________________________________________
From: Data, API, website, and code of the Chronicling America website <[log in to unmask]> on behalf of Jeremy Echols <[log in to unmask]>
Sent: Thursday, September 22, 2016 2:17 PM
To: [log in to unmask]
Subject: Re: Next issue
That’s the infamous “MARC must live on LC’s site” problem. In Oni we added a way to configure external MARC sources, but in chronam you’d probably have to hack the code. I know various people have found a way to load MARC records from the local filesystem, but I’ve got no experience with that. I’m certain it’s a process with which others on this list are familiar, though.
From: Data, API, website, and code of the Chronicling America website [mailto:[log in to unmask]] On Behalf Of Glenn Bunton
Sent: Thursday, September 22, 2016 10:58 AM
To: [log in to unmask]
Subject: Next issue
Thanks for the responses to our --skip-essays installation problem. You were right it involved jetty and solr which was solved so we're good on that front.
Next problem.
When we try and load a batch we get the following error message. The apache server is running. When we go to the site we get a chronam page, though it doesn't look correct (like style sheets are not being applied) - Chronicling America « Library of Congress<http://historicnewspapers.sc.edu/> : Any thoughts or suggestions?
Chronicling America « Library of Congress
ENV)chronam@historicnewspapers:/opt/chronam$ django-admin.py load_batch /opt/chronam/data/batches/batch_scu_clothleaderinformerdispatch_ver01
INFO:root:loading batch at /opt/chronam/data/batches/batch_scu_clothleaderinformerdispatch_ver01
INFO:chronam.core.batch_loader:loading batch: batch_scu_clothleaderinformerdispatch_ver01
INFO:root:attempting to load marc record from http://chroniclingamerica.loc.gov/lccn/2016226265/marc.xml
INFO:chronam.core.title_loader:loading titles from: http://chroniclingamerica.loc.gov/lccn/2016226265/marc.xml
ERROR:chronam.core.batch_loader:unable to load batch: HTTP Error 404: NOT FOUND
ERROR:chronam.core.batch_loader:HTTP Error 404: NOT FOUND
Traceback (most recent call last):
File "/opt/chronam/core/batch_loader.py", line 165, in load_batch
issue = self._load_issue(mets_url)
File "/opt/chronam/core/batch_loader.py", line 262, in _load_issue
management.call_command('load_titles', url)
File "/opt/chronam/ENV/local/lib/python2.7/site-packages/django/core/management/__init__.py", line 120, in call_command
return command.execute(*args, **defaults)
File "/opt/chronam/ENV/local/lib/python2.7/site-packages/django/core/management/base.py", line 445, in execute
output = self.handle(*args, **options)
File "/opt/chronam/core/management/commands/load_titles.py", line 105, in handle
results = self.xml_file_handler(marc_xml_source, skip_index)
File "/opt/chronam/core/management/commands/load_titles.py", line 45, in xml_file_handler
results = title_loader.load(marc_xml)
File "/opt/chronam/core/title_loader.py", line 507, in load
loader.load_file(location)
File "/opt/chronam/core/title_loader.py", line 58, in load_file
map_xml(load_record, urllib2.urlopen(location))
File "/usr/lib/python2.7/urllib2.py", line 127, in urlopen
return _opener.open(url, data, timeout)
File "/usr/lib/python2.7/urllib2.py", line 410, in open
response = meth(req, response)
File "/usr/lib/python2.7/urllib2.py", line 523, in http_response
'http', request, response, code, msg, hdrs)
File "/usr/lib/python2.7/urllib2.py", line 448, in error
return self._call_chain(*args)
File "/usr/lib/python2.7/urllib2.py", line 382, in _call_chain
result = func(*args)
File "/usr/lib/python2.7/urllib2.py", line 531, in http_error_default
raise HTTPError(req.get_full_url(), code, msg, hdrs, fp)
HTTPError: HTTP Error 404: NOT FOUND
WARNING:root:no OcrDump to delete for batch_scu_clothleaderinformerdispatch_ver01 (University of South Carolina; Columbia, SC)
ERROR:chronam.core.management.commands.load_batch:unable to load batch: HTTP Error 404: NOT FOUND
Traceback (most recent call last):
File "/opt/chronam/core/management/commands/load_batch.py", line 39, in handle
batch = loader.load_batch(batch_name)
File "/opt/chronam/core/batch_loader.py", line 194, in load_batch
raise BatchLoaderException(msg)
BatchLoaderException: unable to load batch: HTTP Error 404: NOT FOUND
CommandError: unable to load batch. check the load_batch log for clues
|