Categories
Weblogging

Weblog Links: Part 1—The Impermanence of Permalinks

Recovered from the Wayback Machine.

Webloggers may vary in their support of technologies such as comments and trackbacks and RSS, but two things most agree on: weblogs support archives for older posts, and each post is accessible individually with a URL called a permalink. In fact, it is these two items that give all webloggers pause before they switch from one weblogging tool to another — none of the weblogging tools support a standardized permalink/archive structure, and moving can result in that horror of all horrors for webloggers — broken permalinks.

 

Tool Differences

 

To demonstrate the differences between tools, Blogger uses a numeric hash value for each individual item, and archives pages are listed by date and by weblog name. For instance, Jeneane Sessum talks about getting funky in an archive page called: 2003_08_03_allied_archive.html, located on Blogspot, and the individual item is identified by the value 106011655159322454.

Compare and contrast this with a weblog posting created using Bloxsom, such as this wonderful book recommendation by Rael Dornfest. In this particular case, the individual item can be found within a hierarchical structure, and the file is named the title of the post: society/literature/tears_of_the_giraffe.html.

Marc Cantor posted photos of his lovely little girl, and you can access these through a structure of date, and a fragment identifier for the individual item, most likely based on some internal value: 2003/08/05.html#a1590.

My own Movable Type weblogs use the standard out of the box numeric identifier that’s incremented with each posting. So this individual item is…well whatever it is, because I haven’t created the page yet. Since I create individual pages for individual items, though, you won’t find a URI fragment (that thing following the pound sign ‘#’) for each item. Additionally, I don’t use a complicated archival setup, so all the pages are listed in the same sub-directory: archives.

 

Archiving

 

Speaking of archives, each of us has a preferred archival scheme — category, month, week, day, whatever. I used to support multiple archive schemes, but after reviewing a years worth of weblog access patterns of my readers, the statistics showed me that very few people use dated archives such as monthly and weekly archives; however, people do like a listing of recent items and items by category. Based on this, I support category archives, individual item archives, and provide lists of both.

Regardless of your preferences, most tools allow you to choose the type of archiving you want, and at a minimum, tools should support some kind of dated archive and individual items. Personally, I think they should also support categories, but that’s my own preferences showing.

Archive choice can impact on your permalink. For instance, if you support daily archives, and each individual weblog posting has it’s own URI fragment, the URL to access this item will be different from my own individual page items. Both wil be different for the site that supports weekly archive items only, and so on.

Not a problem — unless you’re thinking of moving.

 

Moving between tools

 

Regardless of the archived format, most tools provide functionality that allows you to export all your archives — or at least, all tools should provide this functionality. If you didn’t have this, you wouldn’t have a way to make your independent copies for personal backup. In addition, you wouldn’t be able to export your data for import into another tool.

Most tools also support imports from other tools as long as you follow given procedures, usually provided by the tool vendor. After all, it’s bad business sense to not support data importing.

In fact, moving from one tool to another isn’t that complicated, unless the tools differ drastically. For instance, if you use a weblogging tool that dynamically generates pages, and you want to move to a different tool, you’re going to have some problems — archival data is stored within an independent data store, and it’s the tool that generates the pages.

However, lets say that you’re using a weblogging tool such as Blogger, and you want to move to another weblogging tool such as Movable Type — is this doable? Sure, and the Movable Type folks provide detailed instructions on how to do this. In fact, there are people from my weblogger co-op — Reading & WritingSiblog, and Baker’s Dozin’ — who have moved from Blogger to MT recently.

However, moving the data between tools isn’t the real challenge — it’s all those links that you leave behind that cause the pain. This is made more complicated if you’re moving from a hosted environment, to your own domain.

 

The Link you lose

 

A lot of webloggers started their weblogging career out at in a hosted environment, such as Blogger’s blogspot. The advantages to this are that there is no issues of tool installation, and you don’t have to go through the hassle of finding a web site, supporting it, and so on. However, there’s a major disadvantage to hosted weblogs — when you move, your address changes, and unlike the post office, there are no little cardboard cards you send to let everyone know you’ve moved.

(Yet.)

When you have a weblog at an address such as burningbird.blogspot.com, you don’t own the primary domain, though you own the content. If you move off of Blogger, or of you move to your own server, your URL for your weblog won’t be the same. Lately, with a flurry of acitivity among weblogging tools, more and more webloggers are moving and more and more webloggers are running into the problem of lost addresses because the domain name changes for their weblogs.

For Blogger, moving to another location is an inconvenience, but it’s not impossible — you just put a link at your old weblog telling people where to go for the new one. When Mike Golby moved from blogspot to the Wayward Weblogger co-op, he used the following to let people know he moved:

 

Cheers Y’all….I’m Goin’ Away

I’m shacking up with Shelley

 

Now, contrary to what this sounds, Mike isn’t snuggled up next to me at the moment — what the words overlayed was a link to his new location on the co-op server. Still, it is a great way to point out that you’ve moved locations.

Of course, though people can still find you, there are other problems associated with this type of move. For instance, those who have you listed on their blogroll have to change their links. Any links from the buzz sheets such as Technorati won’t know where you are, and neither will webbots such as the Google Bot. You’ll exist in limbo for a time — one foot in the old home, one foot in the new — until the change propagates through known space.

It’s a hassle, but you’re not lost. This is good. But what happens if your weblog is hosted, and for some reason or another — it just gets yanked?

 

Cut Adrift

 

This happened recently with a well-known weblogger, John Robb. When John left his old employer, Userland, his Userland-hosted weblog was literally yanked from the server. No one knew where he was; no one knew how to find him. It took a combination of emails and links and word of mouth among the webloggers before we re-discovered him, and I imagine that he’s still listed as “missing in action” in more than one blogroll.

John wrote about this and added the now fairly well-known line:

 

NEVER (under any circumstances) publish a weblog to a domain that you don’t control.

 

Radio Free Blogistan christened this Robb’s Law.

Never host your weblog on a site where you don’t control the domain name. Webloggers are taking this to heart and moving their sites to doman names they controlled, moving out of hosted envinronments, if necessary. Liz Lawley recently moved her weblog, Mama’s Musings to its own domain, facing some unique challenges along the way (which we’ll cover in Part 2). Originally the site was hosted on her University’s server, and though we can assume some permanence with the location — the URL is not under her control. This was also a major factor in AKMA starting up the Desseminary domain name.

Tool makers are hearing this, or should be. When the new TypePad hosted environment released, the Six Apart company took this to heart and are building in domain name pointer support for their sites. With this, even though the weblogs are hosted in a controlled tool-driven environment, the pages are still under your domain name. This is going to be the next big trend with all hosted environments, enough so to lead me to create my own law:

 

Never publish a weblog within a hosted environment whereby the host doesn’t provide support for domain name pointers. And never brush your teeth with a salmon backbone.

 

 

But that’s not the whole problem…

 

Of course, if you have your own domain, you can move your weblog between hosts and servers and people will never lose track of you. Still, this is only part of the problem — the other is that pesky, persistent, permalink. If during a move, either to a different host or tool or archival format, your permalink structure changes, there is going to be a major disconnect between the old and the new.

Even though you’ve moved on to other things, any links to your old weblogging pages will either go to your old site, or go into that well-known void, the terror of Web surfers everywhere — the 404, Page Not Found. So what can you do, besides leaving broken links strewn in your path, like leaves in the Fall?

End of Part 1.

Categories
Just Shelley Weblogging

Caricatures and Shadows

Recovered from the Wayback Machine

I’m thinking more about the concept behind discarding weblog archives. I’ve been re-reading some old posts, some nice and some not so nice. Those posts that are a year or two old aren’t even recognizable.

I’ve earned the sobriquet “Burningbird” honestly, as I’ve been nothing if not hot in much of my writing. Passionate, yes, and there’s nothing wrong with passion. But there’s also a lot of anger and pettishness and I cringe to see me in these words.

If I wanted to grow my popularity, I would feed the fire because this attracts the links, the comments, and the discussions. The more petulant the tone, the more vicious the words, the more noise and flurry of activity—flies around shit. Played correctly, I could even become an A-lister someday, until I burn up and become nothing but a cinder, driving away all that’s important.

Lately, life intrudes and does so significantly, and I just don’t want to feed the flames. Being passionate about causes, yes; more now than ever. Being passionate about truth, yes; the truth is threatened daily. But fighting with other webloggers—the nit nitting, the pick picking— it’s getting old. It gets older, with each level of dust layered on the history of this weblog.

If you all met me in person, you’d be disappointed. I’m not the person in my words, in these pages that stretch back like too long a road. They are a caricature of me, and I am only a faint shadow in them.

Categories
Weblogging

Weblog Graveyard

Recovered from the Wayback Machine.

I finally managed to get the For Poets sites up and running and have moved a couple of existing articles over, to Internet for Poets and Semantic Web for Poets. You can check the sites out following the links in the sidebar on the main page of Burningbird. Please let me know if the template I used for all the sites is causing breakage in your browser. I’ve tested in Mac OSX and Windows, with a variety of tools, but you know how it is.

All the new weblogs are using Movable Type. I did manage to install several other weblogging tools, but when it came time to try and skin them — I threw in the towel, and decided that what I’m writing is more important than what I’m using to publish it. Either I continued to fuss with the tools, or I started writing again.

Writing won.

In addition to moving the older articles over, I also published the first part of a multi-part series on Permalinks, Weblog links. Part 1, “The Impermanence of Permalinks”, discusses the problems of permalinks and moving weblogs or changing tools, something we all know all too well. The rest of the segments of the article are as follows:

 

Part 2: “Re-weaving the broken web”

What are some of the techniques you can use to point your old permalink addresses to your new page locations.

Part 3: “Architectural Changes for Friendly Permalinking”

What should tools provide, at a minimum, to prevent permalink problems.

Part 4: “Start fresh by sweeping out the old webs”

Sometimes you may want to break the permalinks, and sometimes you may want to deliberately throw out archive pages. This last section challenges the premise behind persistent archives, and the myth of the permalink.

Part 2 will cover techniques you can use to redirect from old pages to new locations, including using htaccess, ErrorDocument, Redirect, and so on.

Part 3 is going to focus on what weblogging tools should support to minimize the broken permalink problem. I plan on tying this back to the Pie/Echo/Atom effort, and see where the requirements outlined in the article would fit into the current work with the new initiative. In addition, I’ll write on how these same requirements will impact on existing weblogging tools, and what we can do to propagate changes more quickly through all the services we’re all so tied into now.

The last part is going to be the fun one. The scenario basically is this — throw out the rulebook, and break the permalinks, throw away the archives. Yes, violating the two strongest taboos of weblogging in order to make a move to a new site, or, more importantly, to redefine what your weblog is, and what you want it to say about you.

Sometimes you may find yourself caught up in weblogging patterns that you find, over time, don’t suit you. You may want to change what you write about, or change how you write, or even your subject matter. You look back through your old archives of what you wrote a year ago, two years ago, and it isn’t compatible with what you want to write now. So what do you do? Do you keep the old baggage — dusty old permalinks to dusty old writing? Or do you start fresh?

Somewhere along the way we built into Law that thou shalt not break permalinks, thou shalt not drop archives. We leave them lying, permanent reminders for all time and either just change direction in our writing (causing some confusion between the old and the new), or start a new weblog.

I envision a future Web, littered with the carcasses of old, old weblogs, long discarded, long forgotten, ghosts that haunt the threaded void. What exactly do you do with an old, old weblog that refuses to die? Can we take it out into the wilderness somewhere, and leave it by the side of the road? Will it manage to find its way back home? Can we pretend at some future time, that we don’t know it?

I think it’s time, and past, to put this law to the challenge. Part 4 talks about this, and also talks about how you can ‘break’ the laws with style, grace, and elegance.

Categories
Weblogging

For Poets

Completed the four new Burningbird Network weblogs – Semantic Web for Poets, Internet for Poets, Linux for Poets, and Weblogging for Poets – as shown in the list to your left. This is all part of a major rework of the entire Burningbird Network, something I’ve been wanting to do for some time and am finally getting around to it.

Currently I’m moving a couple of older posts over to the new sites, but I’m also working on a Semantic Web for Poets entry having to do with recent W3C TAG discussions, semiotics of photography, and the ultimate New Age question:

Who am I?

Couple of days. First I have an overdue essay for Linux for Poets, and one on Weblogging for Poets.

Fun. By the way, check out the poem for the Semantic Web for Poets weblog – see if you can catch the allusion to RDF.

Categories
Weblogging

There was a tree. There was a sock. There was a man of God.

Nothing I can say in this post will make any sense outside of the context, so all I can do is point you to AKMA’s most recent trip report from Oxford, no less.

You see, there was this tree, with this sock in it…

(BTW AKMA, I am glad you’re okay, and I hope you don’t mind the giggles. But…I can’t help it.)