You are viewing a read-only archive of the Blogs.Harvard network. Learn more.

The Longest Now


Link distance between two articles
Friday October 15th 2004, 6:21 am
Filed under: %a la mod

Now you can find out how far removed two Wikipedia articles are from one another… more or less. Give this script two article titles and let it rip.

Of course it needs some tweaking, removing the easy links b/t articles, such as years and days (which get linked often, in a quirk of WP style), but it’s ”’wicked fun”’ to use. Sample results:

Barry Bonds →American football →Basketball →January 20 →1970s →Barry Manilow

Cheers →Alcoholism →Clich




But what is the meaning of the distance?

That those articles are underpopulated with data?

Or that those two articles are really (un)related by factor d.

You could also recompute d by giving the links weights somehow tied to the size/broadness-coverage of that article.

Ug, this is what happens after AI-Searching Algorithms class.

Comment by Omar 10.21.04 @ 3:32 pm





Bad Behavior has blocked 190 access attempts in the last 7 days.