September 9, 2003

Link archiving

O lazyweb, I have a request. I would like a automated spider to check the outgoing links from my blog on some regular basis and when and if any go dark, to find the most recent archived link at the Internet Wayback Machine and substitute it (thanks).

Speaking of the Wayback Machine, they're beta-testing a full-text search called Recall (hat tip to jwz), which does some groovy memewatch-type graphing of related search results over time.

[peak near 1996-7]A search for "crumlish" approximates pretty accurately my diminishing memeshare as the Web continues to broaded beyond the literate geeks who made a plaything of it back in the day.

Incidentally, all the peaks on my personal graphs correspond to publication dates of my mass-market Internet primers, demonstrating both the relevance of traditional ("old") media and the limited power of pontificating about the mechanics of a new medium. Self-reflection and media criticism continue to make sense as - if nothing else - a corrective, but handbooks for the knobs, widgets, cruft, and crome lose their appeal pretty quickly (even for their authors - trust me). That's why I envy/admire David Pogue. Writing a kewl-new-stuff column for the Circuits section of the New York Times (what, you want people to send me cutting-edge gizmos to play with so I can help ex-Wired readers figure out what netx toy to buy? what's the catch?) smells like a lot more fun than Press F7 for Dummies.

One other thing about the Wayback Project: Remember how after Google acquired Deja News, they put out a call for people with private USENET archives and asked them to donate to the collection? Shouldn't the Internet Archive put out a similar request for archived or cached web pages from before 1996? I know I'm biased but I think the Web was a pretty cool place in 1994 and 1995 and the earliest Web sites are from 1991, aren't they? Maybe they're mostly scientific papers but they're still part of the story. In this digital age there's no excuse for failing to document ourselves. Then again, if nothing ever got lost or buried, archaeologists would be out of a job.

Hey, I think I saw Brewster Kahle's name on the roster for Seybold SF, where I am right now, so maybe I'll buttonhole the poor sod and make my plea.

Posted by xian at September 9, 2003 9:29 AM

That's an extremely good idea. I could bring back quite a bit of content if I felt the need. I'm sure I am not alone.

Posted by: filchyboy at September 10, 2003 6:12 PM

Looking for info on the web in 1991, verified at a W3C site:

http://www.w3.org/History.html

"Dec 12: Paul Kunz installs first Web server outside of Europe, at SLAC."

Posted by: Scot Hacker at September 11, 2003 1:04 AM

good idea

Posted by: Video at January 18, 2004 2:45 AM
Other incoming links (via Technorati)

Hosted by Mediajunkie.

Sponsors
On this day in 2004
Long Live Uppity Negro: This was first posted at c u l t u r e k i t c h e n: Long live Uppity Negro. =============== My heart is heavy. Aaron Hawkin, editor and writer of Uppity-Negro.com has passed away. The details of how, where, why and when have not been posted yet,... (Blogosphere)
On this day in 2002
The UC Berkeley J-School Invites You (Sept 17, 2002): (via Scot Hacker): The UC Berkeley Graduate School of Journalism invites you to a Sept. 17 panel discussion on: Weblogs: Challenging Mass Media and Society Weblogs have received a lot of press lately, and journalism Weblogs are proliferating. Are Weblogs rejuvenating public discussion?. Are they an alternative to mass media? Join... (Salon Bloggers)
Unimpressing the Natives of Blogistan: This blog is semi-reviewed in A Geographical Guide to Blogistan, starting from the memewatch category (Ann Coulter does Google?), then to metablog, and finally finding RFB. The reviewer doesn't seem to understand what Radio categories are, which is something to worry about, because you can't expect someone stumbling onto one part... (Salon Bloggers)
Learning from Weblogs: In Building New Communities: Learning from Weblogs (a PowerPoint file), Tom Coates of plasticbag.org maps out the role of personal weblogs in community-building online. He has even broken down the communities surrounding a blog into three typical categores: online shared interests, geographical commonalities, and "real life" friends and family. Why am... (Memes)
Webloggers Not Qualified to Comment on Weblogs?: plasticbag.org wonders why journalists writing articles about weblogging consider webloggers to be unable to comment on our own revolution: Despite their protestations to the contrary, most mainstream publishers who say that weblogs represent a new democratising of the media still lapse into talking to figures with substantial 'authority' in the 'real... (Weblog Concepts)
Weblogged Conversation: Slow Academic Adoption of Weblogs: Seb closes the loop on an interesting multi-weblog conversation about why weblogs have not (yet?) been widely adoped by academics as a research tool: Stephen over at Blogging Alone mentions Sébastien Paquet's reasons why blogging has failed to become a widely accepted research tool among academia. I disagree with nearly all... (Memes)
Favorite Blog of the Moment: For some reason I'm really enjoying Girls Are Pretty, whose author dedicates each day to some specific thing that you, the reader, should do that day. Still blogging at a low-level. Still in New York. It's hot this morning. Soon I'll be heading over to Brooklyn to hang in my brother's... (Weblog Concepts)
Flash/Shockwave/Director Branding Confusion: Krzysztof Kowalczyk wonders about how Macromedia differentiates its various product offerings: I got interested in creating flash animations. I went to Macromedia site. I was confused. From their product description I couldn't figure out which product does what. Thanks to mostly prior knowledge, I figured out that Dreamweaver MX is for... (Information design)
Things I Know Are Broken at this Blog: I know that the list of my RSS feeds is getting a macro error. I haven't had a chance to look into it yet. I know that the headlines page has no updates since 8/21, and I need to figure out a way to put the headlines macro somewhere where it... (Weblog Concepts)