30 Mar 2005 Scripting News: 3/30/2005
It's always fun watching my good friend Dave Winer develop an argument. It starts with a big contentious statement that is obviously missing the justification. In this case, this would be "The problem with the EFF psosition is that in order to remain consistent, they have had to say that copyright doesn't exist" This looks and smells like a traditional usenet troll. So rather than dive in and flame Dave unmercifully, I've become used to waiting for him to get to the point and elaborate a little. So today we get a pair of test cases ad absurdam. Both involve taking content from the EFF and Cory Doctorow and suggesting... I'd add links to their content, and see if they object. If that isn't a problem, I'll start changing the words, and see if that works for them. Then I'll put my name on their work, I imagine that would be okay too. Why not? I'm just being creative! Then I'll change their positions to be more in tune with the entertainment industry. Somewhere in there, there's got to be a line. So I thought I'd check the licenses on each. The main page of the EFF has an "Attribution-NonCommercial 2.0" license. So Dave is not just allowed but encouraged to produce derivative works of their content. Just as long as he provides attribution and doesn't attempt to sell it. So the only problem with his suggestion is "Then I'll put my name on their work". And this is only a problem if and only if he removes their's. Cory's Eastern Standard Tribe is licensed with Attribution-NoDerivs-NonCommercial 1.0 This again is completely clear and it's completely clear that every suggestion Dave makes is completely against Cory's wishes. In fact, Cory goes to pains to spell out what this means well beyond the Creative Commons license. Now hidden away in these two posts is an inkling of what might be happening and what the real issue is. "add links to their content", "Cory". So really, I think this is about Dave's dislike for Google's Auto-Link toolbar and Cory's noisy defence of it. Over time I expect this to rise to the surface and for a real argument to emerge. So rather than argue about whether the EFF want to destroy copyright or not (I think they don't), let's go back to arguing about what happens when you put text on the web in plain old html. My position is that the moment you do that all bets are off. You may want to have it seen in only one style, but by the time I've changed all the proper nouns to wikipedia links, all ISBNs to amazon links, changed the style sheet, used a different browser, blocked all the ads, taken the result and scraped it into RSS so that I can read it offline on my ipod it will be completely unrecognizable[1]. And there's not a damn thing you can do about it. If you really don't like that then lock it up in PDF at which point I'll simply stop reading you. But even when I've said all that, why were MS Smart tags evil and Google's auto-link not? Is it just that a few years have past and we understand a bit better what's possible and what isn't? Dave keeps talking about a line as though there is some qualitative difference between adding a link and changing a style sheet for accessibility. I keep seeing quantitative changes in bits sitting in someone else's computer and little more. But then I don't have the reverence for the written word and free speech that Americans seem to have. I prefer the situationists, post modernists and detourne. Derivative works? Hell yes! Tag it, photocopy it, spray paint it and slap it with glue on the side of the Bastille. [1] Yesterday had a great example of this. Start with del.ico.us, view it in Firefox, add the greasemonkey extension, add some custom javascript accessing del.icio.us web services. And what you get is auto-suggest layered on top of the form that Josh wrote. ps. To any #joiito denizens reading this, I apologise unreservedly for invoking He Who Shall Not Be Named. I'll just have to blame the Imp of the Perverse sitting on my shoulder and whispering in my ear. [delicious-discuss] big news
del.icio.us gets funding, Josh goes full time. WOOT! But how will it ever make money? Del.Icio.Us Auto-Complete - John Resig
Really rather cool example of using firefox, greasemonkey user extension javascript and some cool code to make the del.ico.us posting interface auto-suggest tags. [from: del.icio.us] How Not To Blog - RSS + Email. The Next Stage in Messaging
See also http://hownottoblog.com/index.php/2005/03/26/rss_aamp_the_evolution_of_online_chat. There's something here but I'm not sue what it is. [from: del.icio.us] Radio DavidByrne.com
What we need is not randomness, or recommendation systems, but better DJs [from: del.icio.us] 29 Mar 2005 I keep thinking about Last.FM's radio player and it's three button "Love, Skip, Ban" control. I want to put these buttons all over the place because they look to me like the bare minimum UI to build a rating system. So how about:-
- A Firefox status bar extension to the last.fm radio that makes these buttons always available - The three buttons on Google AdSense displays so that the publisher can express preferences for particular ads. - Put the buttons on a TV remote and link them to Tivo. Which then got me thinking about Amazon and their recommendation system. To make it work, you have to go through and manually rate items you already have. But I only have a 3 state reaction to a lot of things, not a 5 way grade. I love it, it's OK or I never want to hear/see it again. 26 Mar 2005 Big Champagne track P2P downloads and produce a BillBoard-esque top 10. As you'd expect the top 10 mirrors CD shipments. It would be interesting to do some long tail analysis of their data.
Key questions:- - Total number of unique files in a week - Total number of files that are of music that has been deleted from the music Biz catalogues or are just not in commercial catalogues and so are unavailable anywhere else. - Total number of files that are unavailable on commercial downloading sites like iTMS, etc. I'm sure you could think of more. I went to San Diego last week and while I was out there bought a Fujitsu 80Gb 5400rpm drive for the laptop. I also got a USB2 laptop drive enclosure. So over the last 24 hours I've been swapping and upgrading disks across 3 laptops. Just to recap, I was starting with a recent Toshiba laptop with a 30Gb disk configured as FAT32. The disk was pretty much full with about 1.5Gb free.
I started with my own laptop. The new drive goes into the enclosure and is connected via USB. I opted to start with Norton Ghost. Two nightly attempts at this failed with a memory error after a good 5 hours of running. Not good. It seems that Ghost does a pretty good job of copying volumes across as long as you don't take the option to extend the partition to use the extra space. So what I finally did was use Ghost to copy the main volume into completely unallocated space on the new drive and not attempt to extend it. I then swapped the drives so the new one is in the laptop. Then I ran convert.exe to change it from FAT32 to NTFS. The final trick to extend the partition across all the available space was to boot the machine from a Knoppix CD and use QTParted to adjust the partition. It may seem strange to use Linux to modify a Windows machine, but I didn't want to buy Norton Partition Manager as well as Ghost. In retrospect, there probably is a way to use Linux utilities to do the initial copy but QTParted doesn't do partition copies and Partition Manager on Knoppix is a command line utility and not at all obvious. And I haven't got my head fully around using Knoppix and windows/samba networks. So then it's on to my son's Fujitsu Lifebook which apparently has a 20gb disk. Firing up Computer Management, Disk Management, I was amazed to see that it had actually had a 60Gb disk and the person who installed XP had partitioned it as 20Gb NTFS and 40Gb of unallocated space. duh! Again, 10 minutes of booting into Knoppix and QTParted had a single 60Gb partition. Result! The third machine is a quite old Dell Inspiron 4000. This has a 20Gb disk and out of all the work above, I had a spare 30Gb in the USB enclosure from my machine. Unfortunately as well as being quite slow, the Dell only had a single USB1.1 port. Trying to do volume copies across this was going to take half a day. So this time I used the network. Ghost did a backup across the network to space on my main laptop. I then used Ghost to restore this (without changing the partition size) onto the USB enclosure using my main laptop. Now I've got a 20Gb bootable disk with 10Gb spare. Next, swap the disk with the Dell. Convert to NTFS. Finally, boot the dell into Knoppix and adjust the 20Gb partition to 30Gb. Finally, I've got the Dell's 20Gb disk in the USB enclosure so that gets cleaned and reformatted and can be used for backups or portable storage. So here's some lessons in all this. - 2.5" HD USB2 enclosures are cheap and damn useful. In the UK they are about £20 ($20 in the USA) - 2.5" 5400rpm laptop drives are now pretty cheap (< £100, < $150) and available in up to 80Gb sizes. 100Gb drives are just appearing but all seem to be 4800rpm. - Norton Ghost will do volume copies from one disk to another or from the internal boot disk to a USB disk but seems to have problems with adjusting partition sizes afterwards. - This may be because adjusting a FAT32 partition seems to involve moving and adjusting every file, whereas adjusting an NTFS partition only involves changing the partition table (as long as the partition doesn't move). - So, if you're on FAT32, switch to NTFS as early as possible. - If you get a new machine with Win XP installed, don't just assume the disk has been partitioned properly. At least have a look! - Knoppix is an incredibly powerful rescue CD for Windows machines. It's free. You've got to find you're way round a whole new OS. But it's ways of doing things are not totally alien. And the utilities included are really powerful. I'm sure there's a way of doing partition copies across disks but I haven't worked it out yet. Assuming that's possible there's really no point in buying Ghost for this task. Though you may still want Ghost for it's backup abilities. - Make sure you've always got a good copy of whatever you're doing before doing anything that could potentially destroy all your work. - If you're not comfortable with hacking around like this, don't even start. Take the laptop and new drive to a shop and pay them to do it. I think I end up where I was when I started this. I'm somewhat surprised and disappointed that XP doesn't have the tools built in to do these sort of tasks. Pretty much anything you do with Fdisk or Disk management is destructive and you lose data. There's no utility to copy partitions. And backup (at least in XP Home) is pretty limited. 24 Mar 2005 Delicious Linkbacks
How very neat. Go to a web page, hit the bookmarklet and get a list of the del.icio.us pointers to it. [from: del.icio.us] Yahoo! Search for Creative Commons licensed content. You can search for content that is free for commercial use and/or where you are free to produce derivative works.
New set of Search APIs so you can develop programs that interact with Yahoo's search. Blogging and social networking. Acquires Flickr. for photo sharing and publishing. Upgrades mail service to 1Gb Introduces a virtual market for ideas Yahoo Buzz is a notional market where you can track, buy and sell, and create futures in notional ideas with notional money. Tech Buzz is the same thing for technical products and services. If it wasn't that Google search and news were still better than Yahoo's, I'd go long on Yahoo and short on Google. And then there's all those really annoying ads on Yahoo. [from: JB Ecademy] Tom Coates has another big idea. plasticbag.org | weblog | Social Software for Set-Top boxes...
This is TV meets last.fm 23 Mar 2005 19 Mar 2005 When I was at school, we did a business game course to try and get us to understand what manufacturing was all about. My group decided we were going to liquidate everything and go out of business. So in the last 3 rounds we stopped manufacturing anything. Our production costs went to zero, our warehousing costs wound down, while income remained the same. We didn't win but we came second for money in the bank.
So what happens when the Music biz realises they are screwed and decides to go out in a blaze of glory and with money in the bank? The plan (as explained here before) is to digitise everything they've got in the archives that is mastered or at least in more or less final form. Put it all up on an AllofMp3.com style site where you can download it in your choice of encoding and with no DRM. Set the price at around 5 cents a song. And go for broke. See if they can get at least 1 billion downloads in a year. this would bring in huge amounts of money but it would also as a side effect flood the P2P networks with high quality, properly tagged MP3s with no DRM. So they'd be using their entire history as seed capital for whatever the next big idea is and in the process take down the whole current distribution chain of hardware CDs. Isn't a supernova better than going out with a whimper? 18 Mar 2005 Had an idea last night for a web site called ThisSucks or NoClothes or such like. It would be for people to expose the emperor with no clothes. To be able to open the window and shout "I don't care what everyone else says, This Sucks".
The plan is for people to post to del.icio.us with the tag ThisSucks/their_tag. The site would then pick these up and republish them via RSS and probably via a lazyweb style trackback as well. I'm probably not going to do this, but maybe by the power of lazyweb somebody else will. And here's my first. Google Adsense Sucks for Bloggers! 16 Mar 2005 Clay Shirky
What we think we know about categorization is wrong. Because we're holding onto old outmoded techniques for categorization. Q: what is Ontology. A: It depends on what the meaning of "is" is. The study of what exists in a domain and how do these elements relate. The parable of the Travel Agent. Travel agents exist to distribute the interface between a handful of airlines and a large number of consumers. The web replaces this so the TAs claim they add value. What's surprising is that the internet plays tried to use the same argument. They tried to recapitulate the old order rather than undermine it. It took some time for people to realise the problem had changed. Classification schemes. Periodic table. Best classification scheme ever. Almost perfect. Context shifts where a whole column were labelled "gasses" where that's only true at some temporary ranges. Libraries are the commonest classification system. And have huge fundamental mistakes. eg Dewey scheme category religion is all Christian. Library of congress treats Asia and Switzerland as equivalent in size. The essence is actually "number of books" about this topic. Optimises linear shelf space. Not reality. Unfortunately librarians now are using the same approach in the digital domain where shelf space is irrelevant. The argument like travel agents is that they are recapitulating what went before instead of undermining it. Yahoo grew into a hierarchy of categories. So they hired a professional ontologist. Who built a huge tree. They said "we understand this better than you". They felt they couldn't organise the world without the shelf so they added the shelf back in. And so we get a tree structure. But the world isn't tree structured. So add a few cross links. So let's have a hierarchy with lots and lots of links. But the ontologists said "get outta here" and limited them to a maximum of 3. In reality, there are lots of links and no tree. And Google took over because there is no filing system. There's only links. Google bought DMOZ, but nobody used it so they downgraded it. When does ontological organisation work well? Small corpus, formal categories, stable entities, clear edges, coordinated users, expert users, expert cataloguers, authoritative source. Note: ontologists often claim the users don't understand the categories. And see this as a user's problems. Turn it around and you have where it works badly. And that is a perfect description of the web. Huge scale, uncoordinated users, no authority. Voodoo categorisation. Act on the model and it changes the world. Classify an SUV as a small truck and it becomes popular. Signal Loss. Ontologists claim that synonyms fail. But actually synonyms refer to different things. Predicting the future is hard. A. This is a book about Dresden. B. This book is about Dresden, and goes into the category "East Germany". Ooops. Countries are radically different to cities. One is an idea, the other is physical. But we can't change it because we don't have the staff to move the books. Absolutely key. Categorization requires predicting the future. "My God, it's full of links". Adventures in scale pt.1 Don't merge categories, merge the GUIDs. Great minds don't think alike. Adventures in scale pt.2 del.icio.us. power law distribution of people and numbers of tags they've done. Long tail. classic sign of an unconstrained population behaviour. Look at number of entries for tags for one person, and it's another power law. 10% of the tags have 90% of the entries. Now look at 2 URLs and study the tags used against them. A lot of entries have very clear convergence. Some URLs have classic power law curves with less consensus. Which gives us a measure of the certainty of the popular tags. Organic Categorization - Market logic: individual motivation but group value. - Merged from URLs (links), not categories - Merges create overlap, not sync - Merges are probabilistic not binary - User and time are core attributes - Signal loss comes from expression not compression - One off categories are ignored, rather than deflected. Filtering is after the publishing. (very deep idea here). - The semantics are in the users, not in the system. Does the world make sense or do we make sense of the world. Objective vs subjective. Recognises that there are alternate views. (note: If you don't understand Unix, you are doomed to re-invent it. There is only World, Group and User) In a primary school that had no server. How it works.
- The teacher runs Instiwiki on her iBook. - Students find the wiki on the lan via rendezvous - Security is simple. Turn the laptop off. - Students and teachers can easily find their work. - The app is very responsive - This solution is not supported by the local T department! - Teacher and laptop need to be present. Which is tricky when she's sick and their's a temp. The whole thing started as a weblog post. Trying to create an OSS school administration platform. All REST and built on Zope. Clay Shirky. Teaches a course for computers and art. Aimed at Artists and creatives who aren't afraid of the machines. Looking for things which ought to exist and trying to bring them into reality
The students are running point and they started to involve phones into what they were doing. Now more formal. PacManhatten. Big Games Class. Tried using GPS but in an urban environment it doesn't work. So fell back on two way voice, over phones. A control room controls a runner. ConQwest. Joint venture with Qwest. Combines 2d barcodes (semacodes) with cameraphones. Automated server interpreting the semacodes and sending data back to phone. Dodgeball. First mobile social software. Problem is that you have to tell the system where you are. But it is SoSo that becomes part of real life. First problem was the ex-girlfriend bug. Your ex is still a friend of a friend so you get messages suggesting you meet! Mobjects/Hearbeat. Bluetooth huggable piece of soft plastic. Send a hug to someone and their Mobject glows and demands a hug back. Phone hardly used as a device. - Standard connectivity beats local flexibility. - Only the minimum latform is widespread. - SMS had in USA (doh!) - Develops lack experience and tools - Device manufacturers unfamiliar with hackishness - OWNERZERD by the US carriers. (same in UK). - Server infrastructure is key. Pry data out of carriers. - Out of band (eg Flickr) allows complimentary value. - supports CPU-intensive post-processing - Phone # is universal primary key Underuse of Voice - VoIP. Mesh is coming but not soon. - Dodgeball is social mesh on point to point links. - Multi-network coming. Bluetooth, Wifi Note here the differences with the UK. And the recent announcement that Broadreach WiFi hotspots would be free for Skype only. So Skype + Skype in and out + wifi + PDA + Broadreach = free encrypted phone calls with presence. I was just walking through the lobby and came across a Brit I didn't know. He's got his Apple iBook open facing outwards with a camera on the top running iChat and the remote video on full screen. On the screen is Suw Charman in the UK (a regular on IRC #joiito) talking and listening as he walks round the conference. Hi Suw!
In the hall is a big Apple screen. It's running the feed from a chat session. The chat is full of bots that are scanning the crowd for Bluetooth devices and then looking up the IDs on Google and matching them with people's websites. It's also getting a feed from Technorati of all the blogs and photos people are using to document the conference. You should be able to see this at http://etech.inroomchat.org/chatlogs/ The Brits are a little subculture within the conference. They're constantly giggling and cracking jokes about obscure UK TV programs. During the last corporate presentation from Nokia there was a mass IRC coordinated walkout. There seem to be very few Bluetooth headsets here compared with the UK. But everyone has a cellphone pressed to their ear. Out on the streets, the Californians are all trim and lightly tanned. Very few *big* people compared with the rest of the US. Clay Shirky (commentator), Stewart Butterfield (Flickr), Joshua Schachter(del.icio.us), Jimmy Wales(wikipedia)
Wikipedia categorization started last summer. English was chaos for some weeks. It self organised quickly but took a while to rationalise. Stewart. Tags are not necessarily a replacement for categories. 200,000 tags. Joshua. del.icio.us started with a personal text file with 20,000 url entries. Then he started adding #tag on the end so that he could do search and replace. Then it became a web site. Then multi-user. What's interesting is community behaviour where people group round a common tag that means nothing in itself. JB: I love this! This is how open source software gets written. It starts with a personal itch that you can't stop scratching. And often because the simplest possible tool you're using doesn't quite cut it any more. Flickr: People using the comments attached to a photo or tag to have a conversation. So the tag or photo becomes a placemark for an on the fly discussion board. Q from Marc Canter: Can we share tags across systems? Technorati already doing that. (Incidentally, arc keeps asking this and I don't get what he's asking for) There are no bad tags. As long as they are useful for the user and there is feedback they will tend to be good enough. There is still a UI problem with finding things tagged with say Java when people used JDK. making it useable relies on clever UI around "Related" tags. "the point wasn’t to let you find all and only pictures of elephants, it was to give people better tools for organizing their own pictures, it was a happy accident that it worked across users. " |
The Blog


