Categories
Technology

Don’t link to this

Never attempt to write anything as long as my previous post, directly in the weblogging tool. If you do, you will go mad, and most likely cause harm to your computer or cat, whichever is closer.

I did want to point out that after my experiment in the earlier post–and did you all have to click that link that said “don’t click”? Is that the key to popularity? Name your weblog, “Don’t Read Me”?–I did confirm what Kevin Marks wrote earlier, in that you can use your own URL as the feed for a Technorati tag, rather than have to point to Technorati directly.

As long as the rel=”tag” attribute is in the link, Technorati pulls the filename from the URL and uses this as the tag name. This should reassure folks who are concerned about putting too much juice into Technorati, because any tool, now, can do the same: look for the attribute and derive the tag name from the file name, and create it’s own ‘tagback’ page. And since I point the link at my URL, even if the Technorati tagback page disappears, my link is still valid; I control the data, and the tagback still exists.

Kevin also confirmed that search bots are being denied access to the tags page, through the use of the META tag. Google honors this, so no pagerank. No pagerank, no reason for spamming.

Categories
Technology

Tagback seed is sprouting

As you can see from the initial , created for yesterday’s post, several people have added weblog posts that tagback to the original item. In addition, a new del.icio.us tag, tagback, was created, and since neither the original del.icio.us bbintroducingtagback and tagback tag entries are being pulled into the Technorati tagback page (anyone know why?), I used furl to add links to both delicious tag pages, as reference. And others have added the Technorati tagback page to the del.icio.us tagback page, as cross-reference.

Now when you access the page, you’ll find weblog posts that respond to the original post, my funky photos, as well as cross-references to related but not directly linked material, including material from a rival bookmarking site.

There has been considerable, and good, discussion about using a tag, or even the name I used yesterday and I’m going to cover these in more detail in my long awaited – you are waiting for it, still, I hope–sequel to tags and folksonomies, which should be out late tonight. However, before I expanded on the concept of tagbacks, I did want to see Technorati’s reaction, first. After all, I am proposing to utilize more, perhaps considerably more, of the organization’s resources. However, from Dave Sifry’s early response, the company is cool with the concept.

Speaking of such heavy utilization of Technorarti, Kaf asked the question in my comments about whether I am changing my mind about centralized services. After all, Technorati is centralized, and trackback is distributed. My answer is that once a resource has been corrupted by outside interests, as trackback and comments have been, then I would rather centralize that resource in the care of skilled technicians who are motivated to keep the resource clean, then put the burden on all the poor souls who don’t know SQL and don’t understand XML-RPC and pagerank, or who don’t have the tools to easily clean up their sites.

There is a risk that Technorati may go away someday, or put up a costwall between us and the data, especially if investment companies urge this. However, by making use of many resources, such as del.icio.us, furl, even flickr, (and other tag based entities sure to pop up), and cross referencing the material, we should be able to pick up the threads if need be. And I am making an assumption about Technorati: that the organization doesn’t intend to cause harm. They might put ads into the tagback pages, but we’ve seen ads embedded in all of the facilities we use, and they don’t cause harm. Still, to repeat: we are backing up the threads by using cross-references in other tag-enabled tools, no offense Dave and Kevin and other folk at Technorari.

In addition, if I understand the documentation with Technorati tags correctly, the URIs we use don’t have to be to Technorati, though I’m not sure how this works yet, especially in regards to tagback–still experimenting around.

Another personal refinement is that I decided not to generate new tagbacks automatically with each post, because some posts, such as this, are an addition to one published previously. I’ll use the original tagback for all posts on the same thread. In addition, not every one of my posts needs a tagback page, though if I don’t add one, with tag systems such as delicious and furl, flickr, and other systems sure to spring up, as well as webloggers ready to wield that mighty link to create a tagback page, someone can always create one for me if they disagree.

The tag for this post is bbintroducingtagback. To add an item to the discussion surrounding this post, you can use this tag with a flickr photo or as a del.icio.us or furl bookmark tag. You can also include the following Technorati tag in your post: .

Categories
Technology Weblogging

Bad Webloggers. Bad.

Recovered from the Wayback Machine.

As you can see, I’m still getting pingbacks, even with removing the link to the pingback server from my page header. The reason for this, most likely, is because in the WordPress code somewhere, my site is responding affirmatively to an XML-RPC request, and the pingback is then sent. I’ve since moved the xmlrpc.php file elsewhere, though this means I can’t remotely post for now. But I rarely do anyway.

The pingbacks are from a post that Jonathon Delacour wrote on the recent trackback and nofollow issues, over at Writable Web, the new weblog he’s writing in conjuction with Marius Coomans. In this writing, Jonathon provides a nicely done comparison of pingbacks and trackbacks and how the two have become somewhat synonymous in most webloggers minds, primarily because of trackback autodiscovery. He also covers the new nofollow attribute, automatic addition of in weblog tools such as TypePad has led the spammers this last week to basically hit webloggers across the nose with a rolled up newspaper, going “Bad, webloggers. Bad.”

In the meantime, here’s a surefire method of preventing comment spam:

Open up robots.txt, or create one, and add the following two lines:

User-agent: *
Disallow: /

It could take a couple of months, but eventually you’ll find you’ll have no more comment spam. Of course, you’ll have no Google or other search engine pagerank, either. But why bleed pagerank out of the weblogs slowly with nofollow, when we can do it quickly with robots.txt?

Seriously, bite the bullet, cut the cord, and be comment spam free. Isn’t this what everyone wants?

Categories
Technology

Daily hits via Technorati

Through Technorati I found a post where Roland Tanglao referenced my post on trackback being dead. There was a discussion in comments about Technorati opening up Watchlists and API queries.

Hmmm.

I then created a watchlist of my base URL, http://weblog.burningbird.net, which you can access directly with this URL. This returns an RSS feed of the watchlist for the entire weblog — a watchlist being all links to my weblog on any specific day.

I took my old Backtrack application, which used to backtrack trackbacks and print out who else has trackbacked a specific post, and modified it to consume the RSS that Technorati provides, instead. I then posted a link to this at the top of my sidebar, and you can also check it out here.

If you want to do the same, create a watchlist for your weblog, copy the source code for Backtrack, and then modify the look and feel to match whatever you want. You’ll want to leave the PHP bits in the body alone, except to replace my watchlist URL with your own.

This will give you a list of links to your weblog, tracked by Technorati, on a daily basis. The question remains, though, how this alternative to trackbacks will scale, because Technorati is a centralized service, and one that can get sloggy at times.

Update: to add Technorati and Bloglines links to your posts

I’ve added Technorati and Bloglines links to each of my posts.

For WordPress, the Technorati link is:

<a href=”http://www.technorati.com/cosmos/search.html?rank=fresh&url=<?php echo get_permalink() ?>” >Technorati Links</a>

If you’re not using WordPress, you’ll need to replace the function call to print out the permalink with whatever your tool supports. Just see what the tool uses for your permalink and copy this into the placeholder of the Technorati link.

For Bloglines citations (thanks to Dare for pointing this out):

<a href=”http://www.bloglines.com/citations?url=<?php echo get_permalink() ?>&submit=Search” >Bloglines Citation</a>

Again, replace the WordPress permalink function call for whatever your tool uses.

These will return the links, in Technorati or in Bloglines, for a specific post. Now, Bloglines was just bought out by Ask Jeeves, so who knows how long this functionality will last. And I’m sure someone somewhere is about to buy out Technorati, so ditto. But might as well make use of the functionality for now.

Categories
Technology

Throttling the Trackback

Recovered from the Wayback Machine.

I was hit with 781 trackbacks last night, all of which went into moderation, but all of which triggered my comment throttle (trackbacks are stored in the same table as comments in WordPress), so if you tried to comment and couldn’t you’ll know why.

I added throttles now to the trackback code–only allows ten trackbacks in a minute, 30 in a day. My site is using customized code, but I created a customized wp-trackbacks.php file for WordPress 1.22, which you can access here. Note, I’ve not done a thorough job testing the throttle code on trackbacks (it has been in use for months at Burningbird for comments) so use at your own risk. If someone spots a bug, let me know.

Search in the code for the Burningbird throttle comment, and change the 10 or 30 to whatever value you want.

I imagine that this is notice being given by the comment spammers that nofollow won’t stop them. Contrary to what you read in the Register though, pagerank is the primary reason for comment spams, not click through. While I am not making the issue into a religion, as Scoble asserts, I don’t agree that nofollow is going to be a solution for comment spam. However, I’m also not going to ignore spammer FUD: I imagine the only reason that “Sam” agreed to the interview with the Register was to cast additional doubt on nofollow. This isn’t because he’s concerned about nofollow driving him out of business, but because he knows he’ll have to send that much more spam to make up for sites that are using nofollow.