Library of Congress photos on Flickr

On a similar theme to Google offering to host open source scientific data, the US Library of Congress has announced on its blog (( which is impressive in its own right, and appears to use WordPress too )) a project that has published over 3,000 photos from the LoC archives and seems to be going down a storm with Flickr users!

This is a pilot for what seems to be a larger Flickr initiative, which the LoC describes thus:

We’re also very excited that, as part of this pilot, Flickr has created a new publication model for publicly held photographic collections called “The Commons.” Flickr hopes—as do we—that the project will eventually capture the imagination and involvement of other public institutions, as well.

The LoC is also pretty sharp about the potential power of this, and how it may benefit themselves (and future generations), saying:

The real magic comes when the power of the Flickr community takes over. We want people to tag, comment and make notes on the images, just like any other Flickr photo, which will benefit not only the community but also the collections themselves. For instance, many photos are missing key caption information such as where the photo was taken and who is pictured. If such information is collected via Flickr members, it can potentially enhance the quality of the bibliographic records for the images.

This potential is foreshadowed by the discovery of 3 previously misidentified images of Abraham Lincoln’s second commemoration by a user of their traditional archive!

A user of our Prints and Photographs Online Catalog raised questions about the images, which sent Library of Congress curator Carol Marie Johnson sleuthing. Careful comparisons to the only other two known images from that event and meticulous combing through records led her to this discovery. My point is that if we can uncover those kinds of treasures, thanks in part to our discerning Web users, imagine what might happen after setting loose hoards of eager photo fans at Flickr.

This is why preserving our information for future generations is such an important activity, and why projects such as the National Archives of Australia push to develop open source Digital Preservation software tools is vital to ensure that our descendants have a rich picture of their history as we have of our ancestors.

Google to host Open Source scientific data sets

Now this sounds really interesting..

Sources at Google have disclosed that the humble domain, http://research.google.com, will soon provide a home for terabytes of open-source scientific datasets. The storage will be free to scientists and access to the data will be free for all.

They may also provide data viz tools..

Building on the company’s acquisition of the data visualization technology, Trendalyzer, from the oft-lauded, TED presenting Gapminder team, Google will also be offering algorithms for the examination and probing of the information.

There is more information (including about why Google intend to import data by shipping RAID arrays around the world) here and (more up to date) here.

We live in exciting times!

SCO stock price

I’ve not been following SCO’s stock price recently, it’s been pretty bad since the heady days of around $4 per share, but a Groklaw newspick from their RSS feed just piqued my attention by pointing at the Yahoo chart for SCOXQ.PK (delisted from NASDAQ, trading on minor markets) which shows SCO trading at barely over 5c a share.

SCOXQ.PK share price, January 20th, 2008.

IFPI – can we control all European Internet traffic – please ?

From Ars Technica (early December):

Imagine a world in which a single industry could control an entire continent’s access to particular web sites, force ISPs to install expensive deep packet inspection gear that would search the complete Internet data streams of millions of users, and force Internet applications to conform to its design parameters or risk being blocked. If you’re a European consumer, this might sound like a paranoid dystopia, but it’s actually a vision of paradise—if paradise were designed by the IFPI.

What are they after ? Terrorists ? Paedophiles ? Drug runners ? Not quite..

In a recent memo to European legislators, the worldwide music lobby laid out its vision of a world in which all ISPs adopted three “feasible and reasonable options” to help address copyright infringement on their networks.

Not surprisingly the EFF has something to say about this (PDF)..

Google Code Search

If you’re ever looking around for a piece of code to do something, then you should try Google’s Code Search.

For example, say I’m looking for some C code to parse RFC 2822 mail headers (which, strangely enough, I am). I go to codesearch and put in a search term of lang:c rfc2822

That gives me back a bunch of results, but say I want to look for something with a BSD license to use with Vacation, then I just extend that search with a license:bsd term, which gives me the great news that SMail (which I used to run 13-14 years ago now) has a librfc2822 directory, which deserves further investigation!

Sears – purveyors of Spyware to the masses ?

I wonder how many people using Windows have been bitten by this new spyware, as related by the Computer Associates Security Advisor Blog ?

Sears.com is distributing spyware that tracks all your Internet usage – including banking logins, email, and all other forms of Internet usage – all in the name of “community participation.” Every website visitor that joins the Sears community installs software that acts as a proxy to every web transaction made on the compromised computer. In other words, if you have installed Sears software (“the proxy”) on your system, all data transmitted to and from your system will be intercepted.

The mention of “banking logins” is to get your attention, because as this apparently hoovers up all your traffic it will get whatever you do, presumably including credit cards, etc.

They also have an interesting take on how to do privacy policies:

What I have come to learn is that if you navigate to http://www.myshccommunity.com/Privacy.aspx you could actually get one of two policies. […] If you access that URL with a machine compromised by the Sears proxy software, you will get the policy with direct language (like “monitors all Internet behavior”). If you access the policy using an uncompromised system, you will get the toned down version (like “provide superior service”). Both policies share the same URL and same look and feel – coloring, page layout, Kmart and Sears branding, etc.

In other words they have a policy that implies that it’s inoccuous prior to installation, which then springs into sharp relief once you’ve crossed the Rubicon and installed their spyware – nice touch!

(Via Bruce Schneier)

Taking the plunge (updated)

I’m about to try and upgrade this blog from WordPress 2.0.11 to 2.3.2, expect breakage for a while!

Updated

That was quick (as ever) – I’m now running 2.3.2 – now for the plugins..

OK – plugins done and theme hacked to do a better job at being a variable width theme!