Print

Print


> I just happened to notice that the self, next and last links in the
> Atom feed [1] are pointing at the wrong place:
-- Thanks for catching this.  They've been fixed.

As for the increasingly slow response, we'll have to look into it.  We've known about it for a while now, but we've not had the opportunity to exploer some other options.  (I just now took another look at the code and do not see any obvious inefficiencies with it so I've always assumed this would require probably a fair amount of testing to correct.)

Happily, for subjects one must only go back fewer than 30 pages before hitting resources that would be in the latest bulk download.  Names, on the other hand, is entirely separate matter (you'd have to request a page north of 5500).  One of the reasons I want to update the bulk downloads.

Thanks again for the catch.

Yours,

Kevin



> -----Original Message-----
> From: LC Linked Data Service Discussion List
> [mailto:[log in to unmask]] On Behalf Of Ed Summers
> Sent: Friday, November 02, 2012 9:54 AM
> To: [log in to unmask]
> Subject: [ID.LOC.GOV] paging urls in atom feed broken
> 
> I just happened to notice that the self, next and last links in the
> Atom feed [1] are pointing at the wrong place:
> 
> <feed xmlns="http://www.w3.org/2005/Atom">
>   <title>Library of Congress Subject Headings</title>
>   <link href="http://id.loc.gov/subjects/feed/1" rel="self"/>
>   <link href="http://id.loc.gov/subjects/feed/2" rel="next"/>
>   <link href="http://id.loc.gov/subjects/feed/4117" rel="last"/>
>   <id>info:lc/authorities/subjects/feed</id>
>   ...
> </feed>
> 
> should instead look like:
> 
> <feed xmlns="http://www.w3.org/2005/Atom">
>   <title>Library of Congress Subject Headings</title>
>   <link href="http://id.loc.gov/authorities/subjects/feed/1"
> rel="self"/>
>   <link href="http://id.loc.gov/authorities/subjects/feed/2"
> rel="next"/>
>   <link href="http://id.loc.gov/authorities/subjects/feed/4117"
> rel="last"/>
>   <id>info:lc/authorities/subjects/feed</id>
>   ...
> </feed>
> 
> The same seems to be the case for the LCNAF feed [2] as well. Since I'm
> on the subject, I've noticed that digging back in time seems to take
> longer the further you go.
> 
> ed@taylor:~$ time curl --silent
> http://id.loc.gov/authorities/subjects/feed/501/ > /dev/null
> real	0m3.193s
> user	0m0.004s
> sys	0m0.008s
> 
> ed@taylor:~$ time curl --silent
> http://id.loc.gov/authorities/subjects/feed/1001/ > /dev/null
> real	0m12.031s
> user	0m0.008s
> sys	0m0.004s
> 
> ed@taylor:~$ time curl --silent
> http://id.loc.gov/authorities/subjects/feed/1501/ > /dev/null
> real	0m15.623s
> user	0m0.004s
> sys	0m0.008s
> 
> ed@taylor:~$ time curl --silent
> http://id.loc.gov/authorities/subjects/feed/2001/ > /dev/null
> real	0m22.023s
> user	0m0.012s
> sys	0m0.004s
> 
> ed@taylor:~$ time curl --silent
> http://id.loc.gov/authorities/subjects/feed/2501/ > /dev/null
> real	0m24.492s
> user	0m0.004s
> sys	0m0.008s
> 
> ed@taylor:~$ time curl --silent
> http://id.loc.gov/authorities/subjects/feed/3001/ > /dev/null
> real	0m27.716s
> user	0m0.008s
> sys	0m0.008s
> 
> ed@taylor:~$ time curl --silent
> http://id.loc.gov/authorities/subjects/feed/3501/ > /dev/null
> real	0m46.800s
> user	0m0.008s
> sys	0m0.008s
> 
> ed@taylor:~$ time curl --silent
> http://id.loc.gov/authorities/subjects/feed/4001/ |less
> real	0m1.225s
> user	0m0.008s
> sys	0m0.012s
> 
> I'm not sure if some optimizations can be made there. I guess the
> silver lining is that Varnish appears to be caching the result, so it's
> just the first person to request the URL that has to pay the price.
> 
> //Ed
> 
> [1] http://id.loc.gov/authorities/subjects/feed/1
> [2] http://id.loc.gov/authorities/names/feed/1