Categories
RDF Writing

Inhale

Recovered from the Wayback Machine.

I went to bed at 3 and got up at 6 so have had 3 hours of sleep, and feel much better. Today was also my last day on the contracted job. The people at the place where I worked were very likable, the consulting company that arranged the gig less so. However, that’s not unusual. Am currently working next chapter of story of life.

I have to finish the proofs for Practical RDF for the editor tomorrow or risk the wrath of a production editor on deadline. Believe me, that’s a not someone you want to antagonize. However, I wanted to clarify that my comment in the last post about the Advice emails isn’t directed at my regular readers. You all can give me any advice you want – as long as you accompany the email with a nude photograph of yourself.

I think that’s a fair deal. Don’t you?

One day after the fooflah yesterday, and lots of good reasons to like wikis, but I still dislike them. The frenzy of activity yesterday accomplished a lot, I will admit – but it was like ants scurrying about an ant hill, as the number of edits headed into the hundreds, and pages were changing by the minute, much less the hour.

ants1.jpg

However, my dislike of wikis has nothing to do with Sam or his decision to use a wiki. I like ants! Ants are good!

Not caring for a technology does not mean that I’m slamming the people who use it, ants analogy aside. That’s equivalent to saying that anyone who doesn’t like RDF must not like me because I wrote about it and promote it’s use. Such silliness.

Speaking of RDF…

Categories
RDF

Harvard Support?

Recovered from the Wayback Machine.

Dave Winer uses his Harvard weblog, and we assume the clout and prestige of his Harvard position, to push the weblogging industry into backing his versions of both RSS and a Weblogging MetaAPI.

There’s already been discussion about Winer taking on ‘co-creator’ claims with RSS — something I and others dispute. Now, he is doing the same types of power manipulation with the MetaWeblog API:

The same philosophy dictates an end to the disagreement over RSS. If they want respect for the formats and protocols they implement, they must do RSS exactly as UserLand does. The thing that Blogger and MT currently call RSS is not only not what UserLand does but it isn’t even an improvement over what UserLand does. Lose-lose.

Sure, other people encourage this nasty thing, but that doesn’t make it right. I’ve written to all the parties privately on this. It’s important, if the blogging API world is to come together in a rational way, we must have basic aggreement on RSS. It’s time to settle this argument now. This is the nasty stuff the big companies do. Let’s get over it and get some principles in place now.

About APIs, I request others support the MetaWeblog API without reservation. If you want me reorganize and move the docs to a neutral place and put an IETF-like disclaimer on it, I’m happy to do so. Maybe this is something Harvard could help with. I ain’t going with MIT, they’re the competition.

Smiley faces aside, these closed door discussions by private phone between “The Big Three” of weblogging, the re-writing of history about the tools, the use of a respected institution to add credibility for what is a dispute based on personality differences — Congratulations, weblogging: you’ve just become a true American industry.

Ford, Chrysler, and GM would be proud.

What’s next — a Userland RDF?

Categories
Semantics

The meaning web

You may not be aware of this, but elements of the semantic web are already in place. For instance, if you’re not sure how to spell ‘algae’, you can search for a variation of the word, such as ‘algie’ along with a few characteristics, such as ‘slime’, ‘green’, and ‘water’, and Google responds with Do you mean algae slime green water? This may not seem like much, but there’s a great deal of putting together related words into a context, and then making some assumptions when a match isn’t found just to obtain a result of Do you mean….

The semantic web isn’t going to result from gigantic strides in science and technology — it’s going to result from efforts of people like you and I. From simple steps, just as with Google and the search for algae.

Joseph Duemer begins a discussion of poetry and the semantic web. He zeros in on that aspect of poetry that inspired my current effort in something such as the Poetry Finder. Joseph writes:

Poetry is the most human form of language, then, not because it is the most humane & not to valorize the term, but because poetry is a way of using language that takes maximal advantage of the notion that a word or phrase might “mean somewhat different things.” Somewhat. Some what. Poetry occupies the space between some & what. So how do we make our human machines grab onto human grammar? It seems just possible to me that metadata & metametadata & so on out to infinity might be used to create at least a semblance of human meaning that could move freely between machines & between machines & humans.

I agree with Joseph in that we can consider the use of metadata to create a connection between human meaning and machine understanding, but it won’t be the stuff of artificial intelligence. The important first step is to begin recording the data, and then once we have it, we can do interesting things with it, just as Google is doing interesting things with data it scrapes from web pages just as unannotated words.

Joseph also references the earlier work in creating this bridge between man and machine, through efforts such as Bertran Russell and Ludwig Wittgenstein. However, these earlier pioneers did their work without the concept of the interlinked network that is the web, which changes everything. To them, the effort that the machines needed to take was nothing less than heroic — true thinking machines. But as we see with Google, most of what we need is a way of recording meaning as statements, a simple model of how these statements are related, and a straight forward text-based format that can be utilized by any tool, in any environment. From there, we can build meaning exponentially greater then a very smart spellchecker.

Joseph ends his essay with:

 

I’m going to continue reading about the bones & nerves of the web & in coming days look at the ways those structures hook up with my own literary & philosophical knowledge. The only way I’ve ever been able to learn anything is a) that I need to know it & b) that I could hook it on to stuff I already knew. (I know, there has the be a foundation–at least some say so–but for now I’d just say it’s turtles all the way down. I hope to begin working my way down the stacked turtles of the web in coming weeks; there is also the (I think related) project of investigating academic dishonesty. Good thing I don’t have a class to teach until July.

Yes, a very good thing. Which means I need to focus on my RDF for Poets writing, and help with the effort of knocking down that stack of turtles.

Categories
RDF

RDF: Binary XML

Recovered from the Wayback Machine.

Tim Bray has had a couple of essays on RDF, including an offer of a domain, RDF.net, to someone who invents an RDF tool he would like to use. I wonder if he likes poetry? I wonder if I’m interested in yet another domain?

Tim’s pushback isn’t against RDF, which he seems to like. His pushback centers on the serialization technique for RDF — RDF/XML.

He writes:

 

RDF has ignored what I consider to be the central lesson of the World Wide Web, the “View Source” lesson. The way the Web grew was, somebody pointed their browser at a URI, were impressed by what they saw, wondered “How’d they do that?”, hit View Source, and figured it out by trial and error.

This hasn’t happened and can’t happen with RDF, for two reasons. First of all, the killer app that would make you want to View Source hasn’t arrived. Second, if it had, nobody could possibly figure out what the source was trying to tell them. I don’t know how to fix the no-killer-apps problem, but I’m pretty sure it’s not worth trying until we fix the uglified-syntax problem.

It’s not surprising that there isn’t a killer app for RDF — though the specification has been around a long time, it’s also been in committee and under development almost the entire time. Now that we’re finally heading into what should be a stable RDF specification, I think we’ll start to see more and more applications such as MIT’s DSpace, RDF Gateway, Siderean Software’s Seamark, Plugged In’s Tucana KnowledgeStore, Mozilla and other applications that are based in some part on RDF.

As for the syntax being unreadable, which he reiterates today (“RDF/XML syntax, blecch”), Tim makes the statement that the concept of “View Source” is essential for any web-based specification or application to succeed. He writes:

 

The Web grew because people could generate it by hand, and did.

By this I assume he means that a person inexperienced with the technology can learn about it by viewing the source, as people learned about web page creation through viewing HTML.

I have to disagree with Tim on this. Sure, View Source is how people learned to create web pages, or to manually create simple repetitive XML such as that found in RSS files — but the concept falters when you look at the back-end mechanics that have made the Web truly viable. There is no “View Source” for Perl/CGI, or JSP, or ASP, or PHP pages. There is no “View Source” for the code for the Apache web server, or the protocols, or the browsers.

The web page is only one part of the web — the user interface. So much of the web is either compiled code, or text that requires training and experience in order to read and understand.

Did I learn HTML from View Source? Yup. However, when I work with JSP, I rarely view the generated Java classes because even though they support the application they aren’t meaningful for me. I do know Java, so I can look at them –it’s just that I have better things to do with my time. Did I learn how to code JSP by looking at existing JSP pages? Sometimes, but much of the knowledge came from looking at the documentation.

In addition, is the data in a Berkely DB or MySql database in a format easily read or written manually? Or do we use applications to access the data?

We need to stop treating RDF/XML as yet another variation of tags similiar to HTML and start looking at it as a form of virtual binary code — machine generated and consumed, but output in plain text. Until we do, we’ll never get to the point of creating that killer app that Tim wants.

My advice to Tim is reap the benefits of RDF from the applications I’m sure he’ll be inundated with, and treat the serialization format as another incomprehensible machine language that just happens to use angle brackets.

(Thanks to Sam for link to Tim’s articles. Also see Danny’s take.)

Categories
Connecting Insects RDF Technology

What didn’t work

Recovered from the Wayback Machine.

The RDF Poetry Finder was more than a personal interest of mine — it was an attempt to see if a project could go from idea stage to implementation through the efforts of people who participated purely on interest — no formal group formation whatsoever. Additionally, it was a project that would, I hope, bring together members from both the technical and humanities communities, in such a way that each would contribute equal expertise to the project. The type of project would force this because it really did require a deep understanding of the mechanics of technology and poetry.

The latter combination of the technologists with the community was the real interest to me. I didn’t do the Poetry Finder because I needed practice with technology — I have close to 20 years of practice, I think I get it.

Along the way, though, I made several significant mistakes. Since we learn from our mistakes, I thought I would share these:

1. Antagonizing the Tech community

I started this project out by doing a deliberate pushback against RSS and FOAF, mainly RSS. By doing this, I had hoped I could attract the attention of the people who work with RSS, particularly the RDF/RSS people, but without tying RSS into the project. In addition, I hoped I could break through a growing belief that RSS 1.0 is representative of all RDF semantic web efforts.

What I did do was antagonize the RSS 1.0 fans, and they are nothing if not loyal. This ended up losing me key technology people, as well as spinning off a lot of energy back into the RSS vortex. Not just the RSS vortex, but the ongoing personality differences that have plagued RSS for too long.

2. Not providing the right hooks for the poetry enthusiasts

I think I did better attracting people from the poetry community. However, I didn’t give them the necessary hooks in how to participate in this project.

We’ve trained the ‘user’ in the weblog community and elsewhere to be passive — the technologists will generate the idea, develop the specs, write the implementation, and help you use the technology. If you don’t implement their new technology quick enough, you’ll hear about it. If the techs aren’t moving fast enough, there’s the LazyWeb.

Well, this is good. No, I take that back. This is not good.

We’ve made the ‘user’ in our community, the weblogger or other web site owner who doesn’t have a strong technical background, into a spectator; and we’ve turned much of our technical activity into spectator sport.

Scenario: Roman arena. Two participants. The issue is RSS. Get the picture?

This was a mistake. It was one I tried to rectify with Poetry Finder, but wasn’t as effective as I needed to be. I’m not quite sure how to do this better.

3. Tied project into RDF/XML from the start

Of course, this is a technology and a specification that I’m comfortable with so it’s not surprising I would focus the project on RDF/XML. However, this triggered much of the same glazing of eyes phenomena that always occurs with RDF/XML. I did expect a little of this and hoped that not jumping into the technology right away would work around this. It did, somewhat. However by not focusing on technology from the start, I lost more of the techies.

Additionally, focusing on the use of RDF as the underlying technology from the start also brought in contention from those enthusiastic about competitive technologies, which I wasn’t expecting.

4. Not starting effort out with a prototype

I had hoped to take this project from idea to implementation in the public eye, based on public participation, but I think that the idea of a “poetry finder” is too nebulous for a unstructured group of participants. I should have started with something, no matter how light and unscalable, rather than just, “I have an idea”.

It’s too easy to shoot down “just an idea”, and too difficult to engage people behind “just an idea”. I didn’t have to provide David, but I at least needed to provide the marble.

5. Fragmented discussions

I used more than one technology to encourage group participation in Poetry Finder. Originally the technologies included this weblog and a Yahoo Discussion group, Bloggers Unlimited. There was then bleed through to emails, other discussion groups, and other weblogs.

The Bloggers Unlimited discussion descended quickly into exchanges based on RSS and FOAF, and other pure technology issues. I pulled back, trying another group, Renaissance Web, and this group has been extremely good at keeping the focus at a higher level, and avoiding too much detailed technical conversation.

A challenge with all of this communication: the discussions in the Yahoo groups, and in weblog postings and comments are good, but there is no way of tying them all together other than linking to specific messages, and through the use of Trackback. Of course, if we had something like Threadneedle, or ThreadsML, this problem would be solved because the technology would link everything together. Right?

I used to think so but after this experience, I’m not so sure. I’m finding that the connectivity between the discussion threads is not as much of a factor as the format of the threads, themselves.

For instance, I don’t think I’m the only person that got overwhelmed by trying to follow the discussions at the Yahoo group, Renaissance Web, excellent as they are. Even when sorted by thread, not date. I had a very hard time finding who said what at any given point, especially with all the embedded quoting and nesting and so on.

Trying to connect the Yahoo group discussions into the weblog was difficult because the discussion group entries have such a different style and emphasis compared to weblog posts. Weblog postings, even when focused on responding to other people, have a more persistent quality to them than discussion threads. There is a different feel to each type of discussion; trying to blend them all together in a meaningful way would be like trying to make orange juice by squeezing together six oranges and two apples. And a banana.

At least, it felt this way to me. Maybe I’m weird.

Then there is the problem of censorship in the Yahoo groups. The one and only time I deleted a message in Blogging Unlimited was a mistaken email that was sent to the group. The only time in Renaissance Web was one of my responses. However, during this time I was censored in another group — and the group was not informed that I was censored, or that this type of censorship had occured. This wasn’t directly related to the Poetry Finder, but it is a problem with Yahoo groups and even weblog comments.

Still, I liked the forum/email/discussion group because there is no ‘ownership’ of the topics, anyone could add new topics, and there was a great deal of good commentary, especially in Renaissance Web. Much more in this format than the weblogs.

However, rather than having a conversation about Poetry Finder in just my weblog, I was now having conversations in many places. This meant that I had to respond in multiple places, to multiple threads, some of which may have started out on the Poetry Finder but morphed into something else.

And how to bring all this together? How do I get the Yahoo Groups people to read my weblog entries on Poetry Finder? A link won’t necessarily do it. How do I get my weblog readers to follow the many discussion threads at Yahoo? Some people are more comfortable with email lists, some more with weblogs. How do we establish a communication across the different venues? Sure we can hack together threads between IM (Instance Messaging) and weblog and Yahoo group — but goodness, it would be like trying to hold a church service during half-time at a baseball stadium, simultaneously using a semaphore to signal the service to the blimp passing overhead, while there’s a flock of geese in the way.

And some of the geese don’t like each other.

Gah!

Within all of the many threads, I had to work at generating enthusiasm for this crazy idea in order to attract the group participation the project needed, in multiple venues. Tthis was getting exhausting, particularly in light of the pushback — not the direct pushback, which I was expecting; but the indirect pushback, the subtle and not so subtle putdowns, which I’ve not particularly adept at dealing with. The honest, intelligent, and concerned opinions from people that I’m spinning my wheels, and that his baby just ain’t gonna fly. Something else I’m not particularly adept at dealing with.

I wish I was Audrey Hepburn. She’d know what to do. And probably look good while doing it, too.

It’s not particularly easy to continue a ‘crazy, impossible, half-baked’ idea when you have a feeling that some people think you’re blowing smoke out your ass. In multiple venues.

Summary

What next.

Well, I have a tick bite, which isn’t healing. The housing complex pest specialist says the head is still in, but the infection should clear eventually. Creeps me out a bit, though — I have tiny insect chompers in my ankle. Had another thunderstorm this morning, and now it’s getting humid. My cat likes my Titanium PowerBook because she can use it for a head rest. I need to do laundry, but I’d rather take another walk. Somewhere covered in asphalt. I have some web pages and PHP I need to do for a friend. I think I’ll make chicken for dinner tonight. Better yet — BLT pizza. With beer.

Yeah: Huh?

I don’t want to abandon Poetry Finder, not the least because I don’t like people thinking that I’m ‘giving up’. Dammit, and I also think the idea has merit. But I do need to figure out how to correct my mistakes, and how to get this moving again in a constructive manner. Or, more likely, just let it die, another smoke filled idea.

In the meantime, weblogging as usual. Pics, writing, cat.

zoelovetibook.jpg