Print

Print


thank you.

I never thought of using OpenRefine (I think that derived from the
powerful tool gougle bought with metaweb), or other online tools. I
wonder how much time it would take to massage this pdf.

if you get somewhere on this, please let me know.

-brewster


On 5/22/19 5:09 AM, Feustle, Maristella wrote:
> Are you familiar with OpenRefine? It would take some work, but you could copy/paste this into a text file and wangle it thusly. The Transpose function will convert row headings into columns.
>
>
> The text file itself will require some cleanup before the cleanup in OpenRefine. A reader like Notepad++ that allows mass edits with Regular Expressions (RegEx) will help.
>
>
> Of course, I intend to mess with this on my own. If I get it to something I like, I'll share it.
>
>
> Maristella
>
> ________________________________
> From: Association for Recorded Sound Discussion List <[log in to unmask]> on behalf of Brewster Kahle <[log in to unmask]>
> Sent: Tuesday, May 21, 2019 7:45:26 PM
> To: [log in to unmask]
> Subject: [EXT] [ARSCLIST] anyone database-ized Brian Rust's: Jazz Records 19171934?
>
> It is a huge help that the pdf is available (thank you Mainspring!).   I
> would like to use it for automatically finding dates for records in the
> Great 78 Project and point back to the right page in the book.
>
> To do this I need it in a format like a CSV (but can convert it from any
> other database-like format)
>
>     Label, catno, matrix, performer, title, date, page-number
>
>     https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2F78records.files.wordpress.com%2F2016%2F06%2Frust_jr_free-edition.pdf&amp;data=02%7C01%7CMaristella.Feustle%40UNT.EDU%7C8b8d4f1d834e4e0d7edb08d6de4ed8d6%7C70de199207c6480fa318a1afcba03983%7C0%7C1%7C636940827573551798&amp;sdata=GPQRS6qaptUPdK%2F4k%2FMA6MQYOup5OBc7fWRfUS%2BtuLI%3D&amp;reserved=0
>
> I did this for Almost Complete ...
> https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Farchive.org%2Fdetails%2Falmostcomplete7800barr%2Fpage%2F115&amp;data=02%7C01%7CMaristella.Feustle%40UNT.EDU%7C8b8d4f1d834e4e0d7edb08d6de4ed8d6%7C70de199207c6480fa318a1afcba03983%7C0%7C1%7C636940827573551798&amp;sdata=Dg32sHIPav8nG3Cuw%2FvP5aQiFs41o8peK%2FGnDIUiEuk%3D&amp;reserved=0
> and American 45. and 78...
> https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Farchive.org%2Fstream%2Famerican45fortyf00dani%23page%2F38%2Fmode%2F1up&amp;data=02%7C01%7CMaristella.Feustle%40UNT.EDU%7C8b8d4f1d834e4e0d7edb08d6de4ed8d6%7C70de199207c6480fa318a1afcba03983%7C0%7C1%7C636940827573551798&amp;sdata=e1CCa7fy%2FXwtkoi3RuAMwTs6K166HFPCYn4koG%2BVMdo%3D&amp;reserved=0
> by having these hand key'ed and each month I use this for the new records.
>
>
> Does anyone have any databased version of this?   It must have started
> out in such a thing.
>
> Thank you!
>
> -brewster