Categories
Social Media Weblogging

I own Stuff

Recovered from the Wayback Machine.

Technorati has a new beta feature: blogs with authority on topics. I, of course, checked out my site on certain topics to see if I am an ‘authority’.

I am the second highest authority on photography after Tim Bray ahead of Heather Champ. A big surprise there.

I am eleven in technology, after Doc and Meg and Scoble but before Dave Sifry, himself. There’s something rather poetic about that one.

I am second in Writing after Neil Gaiman. Who is Neil Gaiman, I think to myself. Exploring, I find a post talking about the Satanic Tomato. Of course.

I also own feminist but there’s only two of us. And I own women though there are more women. Just so there’s no confusion about my position, I also own goddess. And I’m fifteenth in Politics, but third on Bush. Oh my, I could have fun with this.

But none of this matters, because I own Stuff. When you own Stuff, then you know you’ve arrived. Oh, and it helps when you know how to work with metadata.

Ooops! Hold the presses! As of this afternoon, I am 12th in Technology and Dave Sifry is now one place ahead of me. Got to keep up in this metadata crazy world.

Categories
Weather Weblogging

We are not the Red Cross

Recovered from the Wayback Machine.

DailyKos is running a board for folks needing shelter. There’s also something over at LiveJournal and a hurricane blog (links via Rogers Cadenhead).

This is all cool and I love seeing people helping each other.

Having said that…

What the hell do you people think you’re doing? How do you know if that person contacting you is a true refuge or someone wanting to rob you blind? And are you ready to take a person in for possibly weeks? Months?

Come to think of it, we know what you own, how many kids you got, and that you at least have a computer since you’re weblogging. What a great way to do one’s early Christmas shopping if one was of a mind in this direction.

And for those of you who are thinking of getting caravans of stuff together to take down, what the hell do you think the well trained, and highly prepared Red Cross, not to mention FEMA, is for?

You want to open your home to a weblogger? Great. Make sure it’s someone you know and can live with for some time.

Trying to arrange a ride in New Orleans? It’s too damn late. Get to one of the ten shelters.

Going to stick it out and blog it, like a good little journalist soldier? Don’t want to miss the adventure? Not worried about it because your kitty cats are sitting calmly in the window and everyone knows animals can predict weather? Thanks for adding to the burden on the infrastructure put into place to provide support for those who have no option but to ride it out.

It’s frustrating to see people suffer, and we want to help, and that’s a goodness, and you should be admired for that. If you truly want to help, then donate to the Red Cross. They’ll need money, and not your old clothes and expired cans of food. They are the first line of a civilian help force, and should be the focus for early contributions. You might also consider donating to the Salvation Army, because they’re also experienced at giving help in times like this. Later, there will be other, sanctioned organizations that will provide effective, and targeted help, to which you can donate time, money, and goods. You might also consider donating blood. Even if it’s not needed for Katrina, it’s still needed.

As for those who have no choice but to ride it out, you’re in my heart, and that’s about all I can do for you right now.

There’s a fine line between providing effective help, and being a busybody nuisance. If you want to insert your butts into the emergency process, fine. Just make sure you don’t make more of a mess of it than it already is.

Categories
Weblogging

No such thing as a quiet marketer

Recovered from the Wayback Machine.

I don’t know what it is, but I’m really tired today. And since I don’t want to post variations of “Oh, No!” every hour for the next 24, I think now is a good time to focus on finishing some work for folks, and the new code for this site.

I’ve decided to package the photo code up in such a way that it can be used by people regardless of weblogging system they use and whether images are stored at Flickr or not. By doing so, and making it both fun and easy to use, I’m hoping I can encourage more people to use it. A by-product of this use, then, is that it provides easily accessible rich, structured, metadata that can benefit all of us.

This is just going to revolutionize our lives. I am not joking — the next generation of the web is here, and I’m just so excited! It is going to be big, babies! Big! I am so going to punk the web.

And it started here, first! With me!

I need to call Dave Winer. I know he’ll want to be in on this.

Whoa. Deep breath now.

No, I haven’t been bitten by one too many tics. I’m trying find a way to inspire you all with my enthusiasm, without me being there to grab you by the shoulders and look you intently in the eye. I’ve used some of the same words you may have read elsewhere in the last month or so, for some new innovation or other. But where the words can fall naturally off of some folks tongues, like hail in a storm, the don’t feel like me.

I was inspired in this momentary exercise, in part, by Kathy Sierra’s latest humorous and well written post where she says we’re all marketers:

The late (and brilliant) comedian Bill Hicks was an early adopter of the “all marketing is evil” meme:

“By the way, if anyone here is in advertising or marketing, kill yourself. No, this is not a joke: kill yourself . . . I know what the marketing people are thinking now too: ‘Oh. He’s going for that anti-marketing dollar. That’s a good market.’ Oh man, I am not doing that, you f***ing evil scumbags.” (asterisks are mine)

I was about to protest, “Dammit Jim, I’m a programmer, not a marketer!”

But that would be a lie. In this new open-source/cluetrain world, I am a marketer. And so are you. If you’re interested in creating passionate users, or keeping your job, or breathing life into a startup, or getting others to contribute to your open source project, or getting your significant other to agree to the vacation you want to go on… congratulations. You’re in marketing. Now go kill yourself.

Kathy has a valid point, and one that isn’t lost on me. I’ve not been a particularly good marketer: of my skills, my projects, or the technology I use (RDF comes to mind). I mean, look how I started the post: “Hey, kinda tired lately”. What kind of marketing is that? It may be true, but it doesn’t sell people on an idea or a person.

It wasn’t as if leaving these words off would be a lie. We can choose not to say something, and doing so turn a quiet post into one that has zim and zingle. My problem, sorry, challenge, though, is that I’m not a zim and zingle type of person. Oh, I can get angry, and I can get passionate, but when I’m creating something important to me–be it software, writing, photos, or even a relationship–the more important it is, the closer it is to me, the quieter I get.

Later in her post, Kathy writes:

Remember — when people are passionate about something, and in a state of flow–and you have contributed to that by helping users/members learn and grow and kick ass–these are some of the happiest moments in their lives.

I agree with this, too–it is wonderful when you’ve helped someone, or someone likes your application (or photo or book or you). The thing is, you can be passionate about something, but quietly so and that’s what separates out the true marketers from all the people who love what they do.

Loren Webster writes on flashy flowers and one’s own garden in a post full of subtle innuendo–but how does that translate into RSS and hold up under an aggregator? You know, bright lights and lots of noise make it hard to hear a lover’s whisper; and if you’ve sandpapered your fingertips, it’s going to be hard to feel the veins of a leaf.

So where is the middle ground between the quiet corner and the jumping up and down we see so much in certain unnamed-weblogs-but-you-know-who-they-are? Is being passionate, enough? Or must we exaggerate that passion–emphasize it so it can be seen at a distance: paint with bigger brushes, more gadgets in the code, zoom in with larger lenses, use more exclamation points when we write, and scream more during sex?

More, brighter, louder. No wonder I feel tired.

Categories
Weblogging

Minor syndication tweak

I have tweaked the syndication feed to filter out the photos. I figured that these are either adding a burden to the feed reads, or if they make it into an aggregator, are morphed out of shape.

If you want to look at photos, you’ll have to click through.

If you’re a WordPress user and provide full feeds and don’t want your pictures flowing into the feed, let me know and I’ll whip up a plugin for you to use.

Categories
Weblogging

Links not wanted

Feedster released its own version of a link ranking system, Feedster 500. It matches previous lists, but also has a number of surprises.

Unlike other lists, or even link aggregators, Feedster has been very forthcoming about how it derives its list and, more importantly, how it finds the incoming links it uses as the key component of its list: it finds them in syndication feeds. This will explain why there are some unexpected results in this list. First, blogrolls are left out of the calculation, as they are not part of syndication feeds, or at least, not traditionally part of syndication feeds. Second, and this is the kicker, if you publish a syndication feed that doesn’t provide full content, then your links are not being picked up by the service and used in its calculations.

My links weren’t picked up. In fact, when working with my Linkers tool, and the more sophisticated Talkdigger, I have found that none of my links to other sites are being picked up by any of the services. And when I went looking for how the services work, none of the tools, other than Feedster, publishes its process to find links and/or other searchable material.

This is frustrating because if I don’t care about lists and ranks, I do care about letting people know that I’ve written something about their posts. Since I don’t support trackback anymore, the only way another weblogger will know I’ve made comments on their work is if they read my weblog regularly, someone else tells them about my post, I put a link into their comments, or they see my URL show up in their referrer logs. And with abuse of referrers, these are less than useful nowadays, or even unavailable for some webloggers.

Besides, I don’t want just the weblogger to know I’ve written about their posts–I want others to know, too.

Now I know how Feedster works and that if I want links to show up in that service I have to provide full content. I don’t want to do this, I’ve never wanted to do this but either I decide to blow off inter-weblog communication, or I provide full feeds. The question then becomes: what about the other services?

Supposedly Technorati uses the syndication feed if this provides full content; otherwise it grabs the the main page and scrapes the data. By accessing only the front page, if I use the -more- link to split a larger post into a beginning excerpt with a link to the individual page, the links in this split apart page are then not included. If I then want to have my links picked up from a post, I either have to make sure they show in the very first part of the post, or not use the -more- capability.

Even when I don’t use -more- capability, my links are not showing up in Technorati. Nor in IceRocket, nor in Bloglines, nor in any of the other services as far as I can see. Now, I’m beginning to suspect that most services now use only the syndication feeds, which means I’ll have to use full content for them, also. As a test, I’ve set my site to provide full feed for now, and I’m linking to several sites in and at the end of this post to see which service, if any, picks up the links.

Other factors that could influence the feed being picked up include me repeating my permanent link to a post in the title and at the bottom of a post; publishing links to weblogger’s URLs in my comments (which could trigger spam filters); not pinging weblogs.com or blo.gs; perhaps even the fact that I only support one feed type (RDF/RSS). Without knowing how each of the services process links, your guess is as good as mine.

If I’m frustrated with the services, I also know how difficult it is to collect ‘good’ data from a site, as separated from ‘bad’; how to determine which links are coming from the outside (a commenter’s URL) versus ones from the site author; and a static link (blogroll) from a dynamic one (one included in a page). I can respect the challenge involved even as I am critical of the results.

What would I do if I were creating a service like this?

First, I wouldn’t scrape weblogs off of the global services, such as weblogs.com. These are mined by spammers so badly now as to make them useless. What I would do is provide a ping service that a person could trigger manually, or through their tool if it provides this facility.

I would access the syndication feed, and if full content is provided, I would process this for data and URLS. Otherwise, I would access these URLs directly to pick up links. By doing this, I’ll also be accessing URLs in comments and anything in the sidebars, which is why most services don’t want to access the individual entries — but I’d rather be more liberal than not when it comes to gathering data.

I would also like to send a bot once a day to access the main page, just to make sure updates haven’t happened that haven’t been reflected in the feed, and to access the blogroll and other more static data.

At this point in time, we have a lot of data. Pulling blogrolls and other static links out of content isn’t that hard if you have the storage to maintain history and can compare if a link provided today was also provided yesterday. About the only time I would refresh this in the database is if the link changed in some way– it was there one day, not the next. Or the content in which it occurred changed (and this could require a way of annotating context of a link, which could be pricey in storage and computation).

One interesting way of looking at this is to remove duplicate links when it comes to aggregation for lists, but to refresh the item in the most recently updated queue if it shows in fresh content at the site being scanned. With this you don’t need to have much context, and if a person is interested in finding out who is talking about a specific post, these top-level links won’t show.

As for links for comments — here is where the vulnerability to spam enters, but using an algorithm to find and discard multiple repeated URLs could help to eliminate these. Looking for domains that have been determined to be spamming is also another approach. Sometimes, though, we have to accept that some crap gets through. I’d rather let a little crap through than to discard ‘good’ stuff–just because I feel I’m in some kind of war with the spammers.

It could help to annotate links for blogrolls and links for comment URLs and so on. Not that abysmal ‘nofollow’, but with something meaningful, like ‘commenter URL’ or ‘blogroll link’ or something of that nature. We do something like this with tags, and though I don’t care much for tags in weblog post, I don’t agree with Bloglines’ Mark Fletcher that tags generally suck–especially when it comes to effective uses of microformatting to annotate links.

(Speaking of which, what kind of a post is: I was going to blog something about how tags are bad, evil horrible bad, and highlight the failure of existing search technology, but I couldn’t muster the energy. High level message: tags suck and are unnecessary except in cases where no other textual data exists (like photos, audio or video). Discuss amongst yourselves.. How’s this: Bloglines is indulging in evil censorship of my communication because it doesn’t pick up the links from my posts. Discuss among yourselves.)

Unfortunately, microformats generally require some technical expertise on the part of the person using them, and to base any kind of measurement on this is irresponsible.

Once I have data that is reasonably clean and fresh, if I were to create a list, I would do one based on popularity versus influence, and I would differentiate these by the number of blogroll links for a site, as compared to the number of dynamic links. A person that has a large number of dynamic links compared to static blogroll-like links to me would be a more influential person (hi Karl) than one who has a fairly even ratio between the two. I wouldn’t mind seeing this ratio in a list rather than the counts — we could then find who is influential within groups, even if the groups are smaller. Regardless, I would also provide the raw data to others, and let them derive their own lists if they want.

Why give away precious data? Because by keeping the source of the data and algorithms open, I establish credibility. In addition, flaws will be found and smart people will provide suggestions for improvement. Most importantly, I give those who would be critical of any of my processes nothing to hook on to — the algorithms are public, and mutable; the data is available to all. I have, in effect, teflon coated myself with Open Source. I agree with Mary Hodder a hundred percent on the advantages of openness when it comes to data gathering techniques and processing, and providing access to raw data–but not just for ranking.

As for business model, well knowing the algorithms and having access to the data is one thing; being able to use these effectively, consistently, and in a manner that scales is the bread and butter of this type of technology. Google never would have been Google if it was slow.

Additional links:

Joseph Duemer is teaching a class in weblogging today. Welcome to weblogging, Joe’s colleagues. Just as an FYI, I’m on the Feedster 500 list, which makes me a weblogging princess. If I were in the top 100, I would be queen. If I were in the top 10, well, I would be a lot wealthier than I am now.

Someone who is in the top 100 is the Knitty Blog. Now, this site ably demonstrates the nature of influence over popularity — it’s not that it’s linked statically by a lot of sites; but it is referenced in a large number of posts. That, to me, is influence.

Dare Obasanjo just uploaded 50 photos from his recent trip home to Nigeria. What I want to know, Dare, is why you took so many photos of billboards?

Fulton Chain carries the best b-link bar there is: with links to stories that cover a range of topics, such as a praying mantis eating a hummingbird, and how to build your own homemade flamethrower. Then there’s the Ode to Rednecks. Come on down and visit me in the Ozarks. Hear?

And that’s about enough about linking.