Business development

Is this really Google spamming me because LinuxChix Live‘s robots.txt file is a universal block?

This line at least does seem to imply that it came from their netblock (the public google.com IP addresses also begin with 216.239):

 Received: from 216-239-45-4.google.com (216-239-45-4.google.com [216.239.45.4]) by fuchsia.puzzling.org (Postfix) with ESMTP id 4D2DF1016C for <additions@live.linuxchix.org>; Tue,  1 Mar 2005 05:58:08 +1100 (EST) 

That’s kind of … uncool. One might almost say… unsolicited commercial email.

I did get something like this once before and forwarded it to abuse@google.com, only to get an instantaneous automated reply telling me that any spam I was reporting could not possibly have come from them.

Perhaps it didn’t and it’s a rather more competent header forgery than most spam. The forensics will be interesting in that case.

Cleansing the mind

I bring you this list of things I do to stay sane on the internet more in the spirit of catharsis than anything else. In saying it, it no longer binds my heart. I am free! And so on.

By no means is this intending to be convincing: you do not have to choose your reading material like I do. By all means read things that piss you off. Even more important, read things that piss me off. Someone has to do it. I choose… you.

That said, I bring you Mary’s rules for staying sane on the Internet:

  1. re-read Charles’ Rules of Argument monthly;
  2. do not syndicate an RSS feed if anything in it ever causes me to stand up and pace around having an imaginary argument with the author;
  3. do not syndicate an RSS feed if more than one-twentieth of the posts are commentary about weblogging as a fabulous community building tool (and that number is only one-twentieth so that Crooked Timber and Easily Distracted slip under the radar), making exceptions only for people in their first months of weblogging;
  4. unsubscribe from an RSS feed when I read less than half the content;
  5. permanently unsubscribe from all mailing lists on which I read less than half the content;
  6. temporarily unsubscribe from mailing lists when at least two posts in any one thread cause me to pace around the room having an argument with either their author or everyone who’s contributed to the thread;
  7. permanently unsubscribe from a mailing list the third time the previous condition is met (I make exceptions for the LinuxChix lists because they’re the only very active and vibrant social lists I’m on); and
  8. delete unsent about one third of all emails I compose to mailing lists.

Ubuntu’s Hoary Hedgehog release; Mutual obligation

Ubuntu’s Hoary Hedgehog release

I upgraded from the stable Ubuntu to the development version (due to be released in April as, I believe, 5.04) on the weekend. So far I haven’t actually noticed very many dramatic changes. Booting is marginally faster. There are now suspend scripts that don’t work (well, for me, there’s positive feedback about them in general). The default fonts look different. Otherwise everything seems pretty much the same.

One thing I was reminded of, particularly when upgrading my workstation which has such a slow disk that the UML server that runs puzzling.org can install packages faster, is that desktop Ubuntu systems have all the Python development libraries installed. It’s nice that they’re all supported, but I’ve developed using Python as a primary language for four years or so and I’ve still used only five of those libraries (more once Twisted splits into packages…). I’d be happy to install them as needed, just like I would gcc.

Mutual obligation

Alignments of the stars this week have me going to a meeting at SLUG on Friday to try and organise a community program that tries to get local volunteers together to work on Free Software. Why add this layer between local volunteers and Free Software, given that most projects have open development communities?

Well, there are a couple of reasons. The first is that to receive unemployment benefits in Australia you need to fulfil a ‘mutual obligation’ requirement in which you seek work, train or give back to the community. Setting up a local community group/non-profit will allow people on the Newstart allowance to work on Free Software (and docs and bug triage and mailing lists and…) as part of their mutual obligation. The second is that there is a group of somewhat nebulous size who would ‘love to help out’ but aren’t for whatever reason suited to jumping into new communities and offering their work up. Having a group set up to specifically push them into projects may help here: it will be interesting to see.

This is something that’s been in the background of my thoughts for years now: working with people who aren’t used to the standard model where you just start doing stuff; or the other standard model where you hang out on IRC for a few months until people know you’re not an idiot, and then you start doing stuff. This third group is the set who ‘need an invitation’, if you like.

Clarification re Ubuntu and compilers

Some questions imply that I wasn’t terribly clear about this, so now I will be clear: Ubuntu has supported binary gcc and g++ packages. You can install them via apt-get/aptitude/synaptic. (By the way, on Debian systems I use aptitude now because it especially marks "packages that were only installed because I installed another package that depends on it" and automatically removes them when nothing depends on them any more. Even better than deborphan.) It’s probably easier to grab the build-essential package though, which drags in those and make and a few other things as dependencies.

The ‘problem’ with the compiler is not that it is not packaged and supported, it’s just that if you whack a CD in your machine, choose the default install, and then log into your brand new Ubuntu machine, you will find that the compiler is not yet installed.

This is not something I personally consider a problem, possibly because I’m a Debian user and also because while I have tried the RPM distros I came to them rather late anyway (RH8) when apt-get like tools (yum, apt for rpm) were not far away. I’m very used to the idea that software that’s not installed yet is just sitting in an archive or on the CD somewhere for me waiting for me to use a nifty tool to download and install the package. (As for satisfying dependencies of packages, my automatic reaction on any distro whatsoever now is "what’s the apt-get equivalent on this one?" It pretty much always has an answer now too.)

It appears though that some people don’t think this way. They think "either it’s on the computer at the end of the install or it’s a major disaster involving downloading the packages myself and resolving the dependencies by hand or it’s a super-major disaster involving downloading the gcc source and trying to bootstrap a gcc compile." They don’t imagine this fourth option where it’s not on the computer yet, but hey, it’s just there on the CD waiting for me to type "aptitude install gcc". (Actually, there are people who do realise this and just hate the idea, but the entry wasn’t about them.)

And these users are not a good fit for this particular design decision of Ubuntu’s. Which is a pity actually, because I thought it was rather a good one myself.

Python in Sydney; Ubuntu among the Python programmers

Python in Sydney

Alan Green organised a Sydney Python Meetup last week. I’m pleased someone stepped into this slot: I was organising Python meetings a few years ago, ran out of energy during honours and was never really inspired to start it back up because meeting attendance never got above five or so. I hope Alan has better luck: he got fourteen for the first meeting.

Alan has given a full account of the talks, so I’ll just note one thing: the presence of Tim Churches, an epidemiologist from the New South Wales Department of Health, who was behind the open source release of NetEpi. If any of the state government’s employees would like to follow suit, it took Tim 18 months and 37 memos to get the release past the government. Beat that!

But the real reason I was particularly interested to meet Tim was that he’s the first example of a phenomenon Anthony Baxter tells me he sees all the time: the professional who just needs a dab of programming and chooses Python because it’s ideal for people who aren’t specialist programmers. Apparently it’s increasingly popular among scientists in particular.

Ubuntu among the Python programmers

I wanted a pressed copy of Ubuntu rather than having to download one, so I went to get one for free. They helpfully informed me that I might as well get a bunch since the cost of shipping massively outweighs the cost of CD production. So I got 10 i386 CDs and 5 PowerPC CDs. I was at a loss for what to do with them — I wore out my parents’ tolerance for installing stuff on their machine when I was 12, and most of my non-geek friends are now canny enough to do the "only if you install it and fix it for me whenever it stuffs up!" trick with proposed Linux installs — so Andrew took them along to the Python meetup and gave them away.

Alan was most impressed that the LiveCD worked on his laptop — alas that with the 4.10 release (Warty) of Ubuntu that’s apparently absolutely meaningless when answering the question "will the installer work?"

Tim was pretty dismissive of Ubuntu though on the "I tried it and it doesn’t even have a compiler!" principle. This isn’t actually true — gcc isn’t installed by default, but it’s supported and I believe it’s on the CD — but since it cost Ubuntu a user I’ll have a closer look at why I think he thought this.

I wasn’t really party to it (I joined the pre-release test team after they decided not to include a compiler in the base install) but as I understand it, the idea was most users won’t want a compiler in their install because most users aren’t programmers and the idea behind distros like Ubuntu is that you shouldn’t need to compile software for it. Further, and this seems to be key, people who want the compiler will know where to get it. Or I assume that was their premise, perhaps with the addition of "developers will read the documentation." (I’m not sure though, "where is gcc?" hasn’t made the FAQs.)

Anyway, it turns out that this isn’t the case. There are developers who, if the compiler is not installed by default, do not think to try and install it because they have no inkling that it might be there. My suspicions about the reasons:

  1. they aren’t former Debian users who are used to their distro having every conceivable piece of Free Software available in a centralised archive: what the installer offers is what there is; and
  2. they are not expert Linux users, by which I mean that they don’t have server sysadmin or security skills. It’s quite hard, in fact probably pointless at this time, to program on Linux without knowing a shell reasonably well, but it is possible to program on it without knowing much about how software gets installed on it. You put a CD in, perhaps select "development workstation" and it installs stuff. When you want newer software, you do a new install. When you want different software you might look at a different distro.

I might be reading too much into this, and in any case developers, whether power users, sysadmins or none of the above, aren’t the Ubuntu market, but assuming that anyone who programs using Linux has Linux expertise above that of a desktop user is probably wrong.

Fixing the Planet <h1> <h2> evil; Bad SLUG, no slug biscuit

Fixing the Planet <h1> <h2> evil

I notice that Andrew’s recent entry has triggered all kinds of evil on Planet SLUG and to a lesser extent on Planet Twisted. The nature of the evil is two-fold:

  1. RSS feeds contain an unrestricted subset of HTML tags, therefore there is a collision between tags the maintainer of the planet wants to use in their own content that goes around the blog entries, and tags the entry authors use.
  2. HTML doesn’t really do sectioning. At one point I heard XHTML2 was going to have section tags with h tagged heading embedded within them. This is a better idea than having the author of pieces of content needing to know what level of heading tag to use to fit into sites designed by someone else.

The solution I’m thinking of using on planets I maintain will apply special CSS properites to headings inside entries to make them less prominent:

 div.blogbody h1, div.blogbody h2 { font-size: large; text-decoration: none; } 

I should look up how to reduce their prominence to screen readers too. Alas, the fundamental semantic nastiness of having the following in the HTML will remain:

 <h2>Blog Entry title generated by Planet software</h2> <h2>Blog entry <em>sub-title</em> coded as h2 by content author</h2> 

Bad SLUG, no slug biscuit

I think Planet SLUG is still sending the character encoding as iso8859-1 while they’re actually sending utf8 content. (eg the character ‘ó’ in Andrew’s entry)

Testing; Online appeals; Ambition

I haven’t even being doing much tech stuff over the last two or three weeks…

Testing

Well, that’s entirely false. I have a new job. A fulltime programming job. In Python. I only just escaped signing a "we own your brain" clause which would have meant that I would have nothing to say because it would be owned. But it was modified and I continue to own my brain during off-hours. And occasionally I will continue to use them for tech stuff.

Tech-wise, the brains’s been busy with thoughts about test driven development, which I’ve never really done before. But I’ve spent large chunks of the last fortnight updating tests to work with modified code. I’m beginning to wonder whether optimally some other programmer should always write the tests for your code. But this may be unsellable: it takes long enough to write tests for your own code.

Online appeals — a review

Andrew and I gave money to the tsunami appeals: Andrew to CARE Australia and me to the Australian Red Cross. (As a side-note, why I am so susceptible to guilt trips? I keep seeing "only ten thousand people have donated through Amazon!" guilt trips around and falling for them. Note to self: failure to donate through Amazon is not any meaningful kind of failure.)

Within one second of my provision of my CC details to those nice folks at the Red Cross an ‘official receipt’ email was on its way to me thanking me for my donation and telling me how the Red Cross is intending to use the funds. Nearly a week after donating to CARE, Andrew’s still waiting for them to actually process the donation and bill his card. (I rather suspect their online billing handlers too the Christmas-New Year week off…)

Ambition

It’s time I had some. I’m close to declaring 2005 to be the year of ambition, much as 2004 was an explicit break from it.

Bouquets; Brickbats

Bouquets

I still don’t like online editing too much, but here’s why I love MediaWiki anyway:

Maturity
As far as I can tell, the software is mature enough to have moved well past problems with doubly or triply escaped ampersands and so on. This sounds fairly trivial, but a lot of wiki software is still in the land where you need to go back through a page every time you edit it and change all occurrences of &amp; to &, lest they be escaped to &amp;amp; on the next page submit! Similarly, I’ve never encountered problems with entering non-ASCII characters into a MediaWiki page.
Talk pages
Every MediaWiki page has an associated talk page, which it’s customary to use for criticisms, meta-discussions, and todo lists. (Not for first drafts though, these are wikis.) It’s amazing how productive I find the ability to make notes of my decisions, rather than silently worry about them.
Edit conflict notification without page locks.
Rather than giving an editor a time-limited lock on a page, a MediaWiki site will let two people start editing a page. However, if the page changes between you beginning to edit it and you submitting your edit, MediaWiki notices and presents you with two input boxes: your edit and the new version of the page. I can see how in some circumstances edit locks are nicer than requiring hand merges, but I generally prefer the latter.

Brickbats

I wasn’t going to do this after the last couple of times, but boo hiss to St George and Optus for having pages incompatible with my browser when they used to work. In the first case at least, it’s due to a buggy commercial browser detector.

Also, a brickbat to Australian English vowels for making it hard to spell. We use the neutral so much that I find it very hard to choose when to use ‘a’ versus ‘e’ in unstressed syllables.

Bugs

I’m averaging about two bug reports every time I try and use Nautilus at the moment: in particular, I have not ever successfully gotten it to talk ssh to a remote server for more than about a minute. I get a lot of ‘seems perfectly stable to me’ replies from upstream, and a lot of ‘yeah, doesn’t everyone know that Nautilus/gmone-vfs don’t really do SSH?’ from everyone else.

But it doesn’t matter anyway, because apparently every time I fill in one of those helpful "your application has crashed, inform developers?" dialog boxes, the bug reporting tool just eats the report. And this is a good thing, because it saves them from ‘otherwise useless’ bug reports.

So I think I’ll save my breath to cool my porridge from now on. In particular, the next time I see someone talking about how Nautilus is amazing if only all these CLI bound nerds would consider using the mouse, I’ll have a quiet laugh to myself and continue using scp, rsync and unison, which I have never ever ever seen crash and file an otherwise useless bug report in my head.

Writing a good online diary

Assuming that you have good reasons for keeping an online diary, there are a few things you can do to improve your chances of making your diary readable. I’ll begin by stating the general principles, and then by reviewing a few breakable rules of thumb that, in my experience, are good indicators of an interesting diary.

The general principle of good writing is to determine your audience, and write for them. An online diarist will normally encounter some tension here — the diarists are often writing partly for themselves or their future selves, and the desire to record events that were important to them may conflict with the desire to record events in an interesting way. You will need to decide to what extent you are intending to resolve this tension in the audience’s favour.

It is the case, if I am part of your audience, that your choice of material is generally meaningless to me, and the use to which you put your material is everything, which is why most of these tips tend towards the stylistic.

Tell a story

Of the beginning, middle and end structure, online diarists struggle most with the ending, often because they don’t know it yet. The most successful stories are often trivial anecdotes. However, there may be an ongoing story that you don’t want to record only in hindsight. In this case, you will want to return to it periodically.

It is very very hard to make a story out of emotions you are still experiencing, unless you’re a brutally honest and particularly insightful person, so if you want to write a powerful emotional entry, you may be better off writing an entry that looks back a year or more.

Write long entries

A long diary entry gives you the chance to tell a story, rather than writing an instant message to your readership, and most good online diaries contain at least the odd long entry scattered in their archives.

Very few online diarists seem to be poets, and so generally very few short entries will not become the highlights of your diary.

Drama is the biggest online diary cliche

If your entry is an allusion to misery that only your three best friends in the world can comprehend, your entry will be boring. The high points of an online diary are very seldom the most dramatic entries, save in the case of diaries that resemble an emotional car crash. For the rest, you will need to hone your ability to make the prosaic interesting, because it is actually much easier to do that than to make secretive drama interesting.

Make your entries complete within themselves

Again, if your entry is full of allusions to events you cannot describe in full, and people you cannot say anything about, and feelings that you are unwilling to share, your entry will be boring. If you need to censor something that is crucial to understanding a story, you may as well censor the entire story. In other cases, tell the story in such a way that it is a complete anecdote, even if it is not totally uncensored. If your reader can tell that you’ve left part of the story out, your entry is not as good as it could be.

A subtle style will serve you well

A diary with a unique voice is often an interesting read. One of the easiest ways to achieve this is to let your spoken style influence your written style. It should be relatively sparing, but a touch of spoken mannerisms in a diary makes it more readable.