Twisted sprint; Google spam, round 3

Twisted sprint

Andrew and I are flying to Hobart this evening for the inaugural Australian Twisted sprint. Sprinting will be competing with tourism a bit, but I’m looking forward to both, despite this being another weekend in which I am not sponging around on my beanbag.

Google spam, round 3

Seems Google’s business development people (or their cyborg stand ins — the mail has a reference number, so it’s possible it’s all handled by The Machine) were concerned I didn’t reply to their last mail. So, they sent a followup.

I sent the following in reply:

 On Wed, Mar 30, 2005, Google Crawl Coverage wrote: > Greetings, > > I just wanted to follow up with you to confirm that you had received my > previous email. If you are not the right person for this, perhaps you > could forward it to the appropriate person within your firm. > > Thanks. I look forward to hearing from you.  I received it. I've deliberately chosen to have a number of domains I run, including this, excluded from Google's index (and from many other robot crawls). It's not a site designed to be useful to casual viewers, and it isn't a commercial site so attracting extra visitors has no commercial advantage to me.  I'm concerned about this particular business development scheme of Google's -- what if every search engine sent me repeated queries about my robots.txt file? That would be very time consuming to deal with. I know the implications of its contents and deliberately chose them.  -Mary 

And hopefully that’s the first and last time I’ll reply to spam. Invoking the categorical imperative in that reply was silly anyway: much of the idea of a business plan is to do something that no one else is doing and hope that it doesn’t become a universal law.

Note to self

<spiv> Ok, another upgrade, this time for universe. This should be the last net connection-choking thing I do today.
<spiv> Uh, weird. It self-aborted mid-download.
* spiv lets aptitude just do whatever.
<spiv> Ah, fuck.
<hypatia> ???
<hypatia> *suspense*
<hypatia> ???
<spiv> dpkg segfaulted, and now the machine has hung.
<spiv> *sigh*
* spiv sysrq reboots.
<hypatia> Ew, suck
<hypatia> Maybe you should have done the whole thing at once??
<spiv> Possibly, but I wouldn't think so. Anyway, I'm fscking to make sure it's not like the weird error I saw on trogdor.
<spiv> Anyway, file this one away for next time you feel like computers only hate you 🙂
<spiv> They hate everyone 🙂

Consider it noted.

Later:

<spiv> Ah, kernel BUG during fsck.
<spiv> I'll try without the proprietary NVidia module, then I'll try vanilla 686.

Just for reference, he’s trying to upgrade an underclocked (long story) Athelon machine with a August 2004-ish snapshot of Debian sid to Ubuntu Hoary (as of today). Um, don’t try this at home… ?

Business development

Is this really Google spamming me because LinuxChix Live‘s robots.txt file is a universal block?

This line at least does seem to imply that it came from their netblock (the public google.com IP addresses also begin with 216.239):

 Received: from 216-239-45-4.google.com (216-239-45-4.google.com [216.239.45.4]) by fuchsia.puzzling.org (Postfix) with ESMTP id 4D2DF1016C for <additions@live.linuxchix.org>; Tue,  1 Mar 2005 05:58:08 +1100 (EST) 

That’s kind of … uncool. One might almost say… unsolicited commercial email.

I did get something like this once before and forwarded it to abuse@google.com, only to get an instantaneous automated reply telling me that any spam I was reporting could not possibly have come from them.

Perhaps it didn’t and it’s a rather more competent header forgery than most spam. The forensics will be interesting in that case.

Cleansing the mind

I bring you this list of things I do to stay sane on the internet more in the spirit of catharsis than anything else. In saying it, it no longer binds my heart. I am free! And so on.

By no means is this intending to be convincing: you do not have to choose your reading material like I do. By all means read things that piss you off. Even more important, read things that piss me off. Someone has to do it. I choose… you.

That said, I bring you Mary’s rules for staying sane on the Internet:

  1. re-read Charles’ Rules of Argument monthly;
  2. do not syndicate an RSS feed if anything in it ever causes me to stand up and pace around having an imaginary argument with the author;
  3. do not syndicate an RSS feed if more than one-twentieth of the posts are commentary about weblogging as a fabulous community building tool (and that number is only one-twentieth so that Crooked Timber and Easily Distracted slip under the radar), making exceptions only for people in their first months of weblogging;
  4. unsubscribe from an RSS feed when I read less than half the content;
  5. permanently unsubscribe from all mailing lists on which I read less than half the content;
  6. temporarily unsubscribe from mailing lists when at least two posts in any one thread cause me to pace around the room having an argument with either their author or everyone who’s contributed to the thread;
  7. permanently unsubscribe from a mailing list the third time the previous condition is met (I make exceptions for the LinuxChix lists because they’re the only very active and vibrant social lists I’m on); and
  8. delete unsent about one third of all emails I compose to mailing lists.

Ubuntu’s Hoary Hedgehog release; Mutual obligation

Ubuntu’s Hoary Hedgehog release

I upgraded from the stable Ubuntu to the development version (due to be released in April as, I believe, 5.04) on the weekend. So far I haven’t actually noticed very many dramatic changes. Booting is marginally faster. There are now suspend scripts that don’t work (well, for me, there’s positive feedback about them in general). The default fonts look different. Otherwise everything seems pretty much the same.

One thing I was reminded of, particularly when upgrading my workstation which has such a slow disk that the UML server that runs puzzling.org can install packages faster, is that desktop Ubuntu systems have all the Python development libraries installed. It’s nice that they’re all supported, but I’ve developed using Python as a primary language for four years or so and I’ve still used only five of those libraries (more once Twisted splits into packages…). I’d be happy to install them as needed, just like I would gcc.

Mutual obligation

Alignments of the stars this week have me going to a meeting at SLUG on Friday to try and organise a community program that tries to get local volunteers together to work on Free Software. Why add this layer between local volunteers and Free Software, given that most projects have open development communities?

Well, there are a couple of reasons. The first is that to receive unemployment benefits in Australia you need to fulfil a ‘mutual obligation’ requirement in which you seek work, train or give back to the community. Setting up a local community group/non-profit will allow people on the Newstart allowance to work on Free Software (and docs and bug triage and mailing lists and…) as part of their mutual obligation. The second is that there is a group of somewhat nebulous size who would ‘love to help out’ but aren’t for whatever reason suited to jumping into new communities and offering their work up. Having a group set up to specifically push them into projects may help here: it will be interesting to see.

This is something that’s been in the background of my thoughts for years now: working with people who aren’t used to the standard model where you just start doing stuff; or the other standard model where you hang out on IRC for a few months until people know you’re not an idiot, and then you start doing stuff. This third group is the set who ‘need an invitation’, if you like.

Clarification re Ubuntu and compilers

Some questions imply that I wasn’t terribly clear about this, so now I will be clear: Ubuntu has supported binary gcc and g++ packages. You can install them via apt-get/aptitude/synaptic. (By the way, on Debian systems I use aptitude now because it especially marks "packages that were only installed because I installed another package that depends on it" and automatically removes them when nothing depends on them any more. Even better than deborphan.) It’s probably easier to grab the build-essential package though, which drags in those and make and a few other things as dependencies.

The ‘problem’ with the compiler is not that it is not packaged and supported, it’s just that if you whack a CD in your machine, choose the default install, and then log into your brand new Ubuntu machine, you will find that the compiler is not yet installed.

This is not something I personally consider a problem, possibly because I’m a Debian user and also because while I have tried the RPM distros I came to them rather late anyway (RH8) when apt-get like tools (yum, apt for rpm) were not far away. I’m very used to the idea that software that’s not installed yet is just sitting in an archive or on the CD somewhere for me waiting for me to use a nifty tool to download and install the package. (As for satisfying dependencies of packages, my automatic reaction on any distro whatsoever now is "what’s the apt-get equivalent on this one?" It pretty much always has an answer now too.)

It appears though that some people don’t think this way. They think "either it’s on the computer at the end of the install or it’s a major disaster involving downloading the packages myself and resolving the dependencies by hand or it’s a super-major disaster involving downloading the gcc source and trying to bootstrap a gcc compile." They don’t imagine this fourth option where it’s not on the computer yet, but hey, it’s just there on the CD waiting for me to type "aptitude install gcc". (Actually, there are people who do realise this and just hate the idea, but the entry wasn’t about them.)

And these users are not a good fit for this particular design decision of Ubuntu’s. Which is a pity actually, because I thought it was rather a good one myself.

Python in Sydney; Ubuntu among the Python programmers

Python in Sydney

Alan Green organised a Sydney Python Meetup last week. I’m pleased someone stepped into this slot: I was organising Python meetings a few years ago, ran out of energy during honours and was never really inspired to start it back up because meeting attendance never got above five or so. I hope Alan has better luck: he got fourteen for the first meeting.

Alan has given a full account of the talks, so I’ll just note one thing: the presence of Tim Churches, an epidemiologist from the New South Wales Department of Health, who was behind the open source release of NetEpi. If any of the state government’s employees would like to follow suit, it took Tim 18 months and 37 memos to get the release past the government. Beat that!

But the real reason I was particularly interested to meet Tim was that he’s the first example of a phenomenon Anthony Baxter tells me he sees all the time: the professional who just needs a dab of programming and chooses Python because it’s ideal for people who aren’t specialist programmers. Apparently it’s increasingly popular among scientists in particular.

Ubuntu among the Python programmers

I wanted a pressed copy of Ubuntu rather than having to download one, so I went to get one for free. They helpfully informed me that I might as well get a bunch since the cost of shipping massively outweighs the cost of CD production. So I got 10 i386 CDs and 5 PowerPC CDs. I was at a loss for what to do with them — I wore out my parents’ tolerance for installing stuff on their machine when I was 12, and most of my non-geek friends are now canny enough to do the "only if you install it and fix it for me whenever it stuffs up!" trick with proposed Linux installs — so Andrew took them along to the Python meetup and gave them away.

Alan was most impressed that the LiveCD worked on his laptop — alas that with the 4.10 release (Warty) of Ubuntu that’s apparently absolutely meaningless when answering the question "will the installer work?"

Tim was pretty dismissive of Ubuntu though on the "I tried it and it doesn’t even have a compiler!" principle. This isn’t actually true — gcc isn’t installed by default, but it’s supported and I believe it’s on the CD — but since it cost Ubuntu a user I’ll have a closer look at why I think he thought this.

I wasn’t really party to it (I joined the pre-release test team after they decided not to include a compiler in the base install) but as I understand it, the idea was most users won’t want a compiler in their install because most users aren’t programmers and the idea behind distros like Ubuntu is that you shouldn’t need to compile software for it. Further, and this seems to be key, people who want the compiler will know where to get it. Or I assume that was their premise, perhaps with the addition of "developers will read the documentation." (I’m not sure though, "where is gcc?" hasn’t made the FAQs.)

Anyway, it turns out that this isn’t the case. There are developers who, if the compiler is not installed by default, do not think to try and install it because they have no inkling that it might be there. My suspicions about the reasons:

  1. they aren’t former Debian users who are used to their distro having every conceivable piece of Free Software available in a centralised archive: what the installer offers is what there is; and
  2. they are not expert Linux users, by which I mean that they don’t have server sysadmin or security skills. It’s quite hard, in fact probably pointless at this time, to program on Linux without knowing a shell reasonably well, but it is possible to program on it without knowing much about how software gets installed on it. You put a CD in, perhaps select "development workstation" and it installs stuff. When you want newer software, you do a new install. When you want different software you might look at a different distro.

I might be reading too much into this, and in any case developers, whether power users, sysadmins or none of the above, aren’t the Ubuntu market, but assuming that anyone who programs using Linux has Linux expertise above that of a desktop user is probably wrong.

Fixing the Planet <h1> <h2> evil; Bad SLUG, no slug biscuit

Fixing the Planet <h1> <h2> evil

I notice that Andrew’s recent entry has triggered all kinds of evil on Planet SLUG and to a lesser extent on Planet Twisted. The nature of the evil is two-fold:

  1. RSS feeds contain an unrestricted subset of HTML tags, therefore there is a collision between tags the maintainer of the planet wants to use in their own content that goes around the blog entries, and tags the entry authors use.
  2. HTML doesn’t really do sectioning. At one point I heard XHTML2 was going to have section tags with h tagged heading embedded within them. This is a better idea than having the author of pieces of content needing to know what level of heading tag to use to fit into sites designed by someone else.

The solution I’m thinking of using on planets I maintain will apply special CSS properites to headings inside entries to make them less prominent:

 div.blogbody h1, div.blogbody h2 { font-size: large; text-decoration: none; } 

I should look up how to reduce their prominence to screen readers too. Alas, the fundamental semantic nastiness of having the following in the HTML will remain:

 <h2>Blog Entry title generated by Planet software</h2> <h2>Blog entry <em>sub-title</em> coded as h2 by content author</h2> 

Bad SLUG, no slug biscuit

I think Planet SLUG is still sending the character encoding as iso8859-1 while they’re actually sending utf8 content. (eg the character ‘ó’ in Andrew’s entry)

Testing; Online appeals; Ambition

I haven’t even being doing much tech stuff over the last two or three weeks…

Testing

Well, that’s entirely false. I have a new job. A fulltime programming job. In Python. I only just escaped signing a "we own your brain" clause which would have meant that I would have nothing to say because it would be owned. But it was modified and I continue to own my brain during off-hours. And occasionally I will continue to use them for tech stuff.

Tech-wise, the brains’s been busy with thoughts about test driven development, which I’ve never really done before. But I’ve spent large chunks of the last fortnight updating tests to work with modified code. I’m beginning to wonder whether optimally some other programmer should always write the tests for your code. But this may be unsellable: it takes long enough to write tests for your own code.

Online appeals — a review

Andrew and I gave money to the tsunami appeals: Andrew to CARE Australia and me to the Australian Red Cross. (As a side-note, why I am so susceptible to guilt trips? I keep seeing "only ten thousand people have donated through Amazon!" guilt trips around and falling for them. Note to self: failure to donate through Amazon is not any meaningful kind of failure.)

Within one second of my provision of my CC details to those nice folks at the Red Cross an ‘official receipt’ email was on its way to me thanking me for my donation and telling me how the Red Cross is intending to use the funds. Nearly a week after donating to CARE, Andrew’s still waiting for them to actually process the donation and bill his card. (I rather suspect their online billing handlers too the Christmas-New Year week off…)

Ambition

It’s time I had some. I’m close to declaring 2005 to be the year of ambition, much as 2004 was an explicit break from it.

Bouquets; Brickbats

Bouquets

I still don’t like online editing too much, but here’s why I love MediaWiki anyway:

Maturity
As far as I can tell, the software is mature enough to have moved well past problems with doubly or triply escaped ampersands and so on. This sounds fairly trivial, but a lot of wiki software is still in the land where you need to go back through a page every time you edit it and change all occurrences of &amp; to &, lest they be escaped to &amp;amp; on the next page submit! Similarly, I’ve never encountered problems with entering non-ASCII characters into a MediaWiki page.
Talk pages
Every MediaWiki page has an associated talk page, which it’s customary to use for criticisms, meta-discussions, and todo lists. (Not for first drafts though, these are wikis.) It’s amazing how productive I find the ability to make notes of my decisions, rather than silently worry about them.
Edit conflict notification without page locks.
Rather than giving an editor a time-limited lock on a page, a MediaWiki site will let two people start editing a page. However, if the page changes between you beginning to edit it and you submitting your edit, MediaWiki notices and presents you with two input boxes: your edit and the new version of the page. I can see how in some circumstances edit locks are nicer than requiring hand merges, but I generally prefer the latter.

Brickbats

I wasn’t going to do this after the last couple of times, but boo hiss to St George and Optus for having pages incompatible with my browser when they used to work. In the first case at least, it’s due to a buggy commercial browser detector.

Also, a brickbat to Australian English vowels for making it hard to spell. We use the neutral so much that I find it very hard to choose when to use ‘a’ versus ‘e’ in unstressed syllables.