TVNewser FishbowlDC AgencySpy TVSpy LostRemote PRNewser SocialTimes AllFacebook 10,000 Words GalleyCat UnBeige MediaJobsDaily

LAT Database Producer Archives Newspaper Website Home Pages

Interesting sideline project from Ben Welsh, a database producer at the LA Times. It’s called PastPages and has been set up to record hourly snapshots of the home pages of various newspaper websites.

There are currently about 80 publications being tracked, everything from the LAT and TMZ to Le Monde and The Guardian. Welsh has managed to quickly exceed a Kickstarter fundraising goal of $5,000 and tells journalism.co.uk he’s looking forward to building out his non-profit venture:

Currently, the site just takes an image snapshot of the front pages but in the future PastPages will scrape and host all the HTML, images and code running on the website. This will create an archive which is searchable by keyword and there are also plans to create an API which would allow other programmers to create new projects and mash-ups with the site’s data…

“I would view the site as a success if someone was studying the media coverage of the US election and came to me and said, ‘could you give me the database of everything you have?’ That would be really satisfying.”

Welsh, who launched PastPages earlier this month, says he got the idea last year during Arab spring. He’s sketching out his expansion plans here.

Mediabistro Course

Get a Literary Agent

Get a Literary AgentStarting August 6, learn how to find the right agent for your book and write a query that will get the deal done! Taught by Barbara Clark, a book agent and publishing consultant, you will learn the best methods for finding a literary agent, the proper protocol and etiquette for seeking literary representation, how to send queries and more. Register now!