Categories
RDF Writing

Acks

Recovered from the Wayback Machine.

Practical RDF is heading into the production process and when next I see her, it will be as proofs. Talking with Simon, it sounds like the book will hit the streets in July.

July is a good time to release a book. Better than now.

The tech book industry has been taking some severe hits lately. The book publisher WROX went into bankruptcy leaving authors unpaid. In fact, many authors only found about the closure of the company through online lists and weblogs, though it sounds like the WROX staff made efforts to notify them the day before the doors closed.

O’Reilly itself has had to do some downsizing recently in some of its divisions. This is particularly difficult for O’Reilly because the people that work there are a very close knit group.

Anyway, I thought I would publish the dedication section of my acknowledgement because several of you are mentioned. Hopefully you’ll all be pleased. And Tim Tams and Godiva Chocolates would be a suitable thank you.

(Just joking.)

Books don’t get written in a vacuum and this book is no exception. I’d like to thank some special friends for their support and encouragement during the long, long period this book was in development. This includes my best friend, Robert Porter, as well as ++AKM and Margaret Adams, +Jonathon Delacour, Simon St. Laurent, Allan Moult, Chris Kovacks, *Loren Webster, Jeneane Sessum, Chris Locke, **Dorothea Salo, and others I’ve met in the threaded void known as the Internet. Thanks friends. It’s finally done.

*Who doesn’t like all this sappy, mushy stuff. Heh.
+Who likes cherries.
**Who should houseboat the Mississippi!
++ Who has neat new digs – but what’s with the chicken logo?

Categories
RDF W3C

We the monkeys

Recovered from the Wayback Machine.

Such a long time since I’ve written about technology. Feels odd – like wearing clothes from the 80’s that still fit, but you’re not sure about leaving the house with them on.

Today I hope to finish the last of the final edits for the Practical RDF book, which means no longer putting off the edits that, frankly, frustrate the hell out of me. Two years ago when I began my interest in RDF, I felt that the concept was sound but the discussion about it was obscure. I still feel that way now, hence the frustration. It also doesn’t help that this obscurity is matched with what can only be called intellectual elitism and its attendant arrogance, both of which drives away any potential audience and interest in RDF.

Apropros, Sean McGrath just wrote about RDF and the dangers of abstraction and losing one’s audience. Specifically he talks about RDF as the focus of the W3C’s Semantic Web effort, and how this is forcing ‘the masses’ into basically tuning out, losing interest in what could be a Good Thing. Bad he says:

I think it’s time for the Semantic Web proponents to stop trying to teach us all to think at their level of abstraction. We can’t (or won’t). Instead, the Semantic Web proponents should look at mapping transparently from the RSS 0.91, XFML 1.0 specifications that 94% of us are happy with, into the more abstract, generalized models that the other 6% need, for the applications they are all dying to take advantage of.

In other words, let everyone do their own XML thing and just transform the bloody mess into RDF/XML. Everyone can have their chocolate and eat it, too. Unfortunately, he uses that misbegotten, old, tired, and basically inconsequential, as well as absolutely boring RSS debate as the basis of his argument.

Sean is a wiz at transformation, and a bright light in the XML world; but he’s missed the boat with this one. The problem doesn’t lie with the specification and the W3C’s effort to promote it. The problem lies within the W3C, itself.

If you’ll forgive me a little digression, I want to talk a moment about another data model that faced stiff opposition. Years ago, a man named Codd wrote a paper proposing a new way of viewing data – the relational data model. Considering that he worked for IBM at the time, which was making considerable profit from non-relational data storage mechanisms, one would have expected that this paper, and the concept, would face stiff opposition, and it did. But the concepts Codd proposed of a standardized model and view of data that would allow one to focus on the essential of the business domain rather than the implementation of physical storage was a sound one. DARPA became interested and so did some folks at UC-Berkeley who created a system called Ingres, which formed the inspiration for the beginning commercial databases in use today. Commercial databases created by people who not only knew about technology, but knew how to sell that technology.

They succeeded. You can’t swing a dead cat without hitting a relational database in the world today.

Yet, Codd’s data model can be considered very esoteric for the average person. Very “abstract”. However, rather than abandon the abstraction necessary to ensure that data is consistent, valid, and can be merge with data from other systems, the creators of relational databases provided tools and technologies to handle most of the implementation details of relational databases, allowing company technologists to focus on their own specific business needs.

I want to build a claims system for an insurance company. Okay, I start by mapping the business domain data to the relational data model and then have the DBAs implement it. That’s my start. It saves me a great deal of time because without the relational data model and the database implementation, I first of all would have to decide what model of data I’ll use, and then figure out the most optimum implementation of that data store, and then build a prototype, test it, cross my fingers and hope it doesn’t result in invalid data – all before actually building something that actually meets the needs of the business.

The concepts underlying RDF are basically the same as those underlying the relational data model – a model for data that supports multiple business domains in such a way that the data from the domains can be merged and manipulated, consistently and efficiently. As an added bonus, both come with lots of tools that support that data management and manipulation so you don’t have to build your own.

Now, tell me: what’s so hard to understand about that?

You might be thinking that I’m supporting Sean’s assertions with my analogy comparing RDF to the relational data model and the implementors hiding most of the detail, but I’m not. One key factor in all of this is that people today design systems for the relational data model. They don’t throw the data out using their own unique variation of data store and then tell the DBA’s and programmers to map the data to the database.

In other words, a decision is made to follow the relational data model from the beginning, using whatever tools, technologies, and experts necessary to use the data model correctly. There ain’t no free ride. If you want the job done right, do it right from the beginning. Don’t give me no Excel spreadsheet and tell me to slap it into Oracle and expect the database to support 10,000 people. It don’t work that way. I know. You might need to transform the data from an old system into the new system in the beginning – but you don’t try and support both at the same time. Not and expect it to scale.

Sure you can transform RSS 2.0 to RSS 1.0 and back. But RSS is basically a brain dead business model. You have a source, the source publishes items, here are the items, in this order. Even my mama can figure this one out. Of course you can make it more complicated, which the dear hearts associated with RSS do at the spill of a latte; but the underlying business model is the same. RSS is not a good ‘example’ on which to make a stand either for or against RDF.

I agree with Sean in that the W3C shouldn’t be forcing pure RDF model theory on the masses; I disagree when he says to continue to use whatever, transform it, and just bung it in when it suits us to map to RDF. If we want to do the job right, let’s do it right, from the beginning. Which means that at some point, we’re going to have to understand how to map that data of the domain to the RDF data model. RDF must be made accessible.

Unfortunately, the W3C is its own worst enemy when it comes to promoting RDF and the Semantic Web, and understanding the concerns of just plain folks when it comes to ‘abstraction’. Why? Because there are no street smarts at the W3C.

The W3C has representatives from some of the best research labs in the world. They come from the best universities, the most prestigious R & D centers at the largest corporations, and the most influential standards organizations in the industry. In many industries.

However, few, if any of the members, have been woken up with a call in the middle of the night by the SysOp because a database system failed during a quarter systems run, and then had to try and debug the problem over the phone for a non-programmer. Or looked into the face of a customer service rep who is trying to figure out how a multi-screen application is going to make their jobs better, when before they had a simple one page form.

They’ve never been faced with a business manager who tells them to do the application using XML. Why? Doesn’t matter. This manager knows that XML is the Big Thang – therefore shut up and use it. And, oh by the way, use Python for the application, she’s heard that it’s the language to use. Why? Doesn’t matter, just shut up and use the language. Oh, by the way, here’s the specs. These two pages are specs? Sure, we’re using that new iterative approach to development. Don’t need all the requirements up front. Improvise.

And then, when you’re done creating that ultra-modern Python/XML, extreme, iterative, ultra-hip application that’s guaranteeed tested, bug free, on budget and on time, using GoF object patterns, and UML, and Rational Rose, and CVS, and what not, document it so a monkey can run the application. However, make sure the monkeys aren’t made to feel like monkeys.

Street smarts. In a more formal parlance – accountability to the using community. Somewhere along the way, the W3C has forgotten its accountability to the using community. The monkeys. Us.

Mark Pilgrim touched on this relatively recently when he said the following of XHTML 2.0 and its lack of backwards compatibility:

Standards are bullshit. XHTML is a crock. The W3C is irrelevant.

Now see – a monkey can understand this.

What’s interesting is that the W3C’s XHTML 2.0 reminds me of Oracle when it changed it’s underlying database foundation from partitions to tablespaces long ago. Sure it was the right thing to do, but it still almost caused the company to fail, the customers were that irate. You can’t just tell people to throw out their hard work because you have a ‘better’ way to do things. Not an maintain any credibility. Or customers. If you do have a ‘better’ way of doing things, its up to you to meet the community, not have the community meet you.

Two years I’ve worked with RDF in one form or another. After all this time, I still don’t understand half of what the RDF Core Working Group says in their little semantic debates. Is that a shocking thing to say since I’m writing a book on RDF? Could be. My editor is probably slapping his forehead right now as he reads this. However, when you consider that no two members of the group seem to either understand or agree with each other, either, I find myself in good company.

I’m not knocking Tim Berners-Lee and the RDF Core Working Group or the other W3C folks. They’re people who believe in what they do, have a vision, and the smarts and the drive to try and implement this vision. They genuinely love this stuff, and want to see it work. But somewhere along the way, they seem to have forgotten about us. The monkeys.

Categories
RDF Writing

Book review finished

Recovered from the Wayback Machine.

The public book review period is over.

I wanted to thank all the people who were kind enough to provide comments and feedback about the book. I received more feedback than originally anticipated, overwhelmingly so at times. This does demonstrate the interest in the community.

This wasn’t an easy book to write, not the least of which is re-writing some sections more than once due to changes in the RDF specifications and technologies – moving targets at the best of times.

I hope that the book reflects the different views of RDF within the community, and is comprehensive, as well as living up to its title, “Practical RDF”. We will see.

Thank you again.

Categories
RDF Writing

Mardi Gras Blues

I had hoped to attend the Captain Morgan Mardi Gras parade, but this flu seems to be quite partial to my body, and has decided to take an extended vacation among my various crevices. Right now, I believe it’s paying a protracted visit to my lungs and throat. However, I have high hopes of making it to the traditional Mardi Gras parade Tuesday night, flu or not.

I am determined to pick up one strand of genuine Mardi Gras beads. Just one. Hopefully I won’t have to flash a breast to achieve it.

The enforced stay at home has been productive from a book stand point. I’m finishing up the first round edits in preparation for the chapters being reviewed one more time by a ’subject matter expert’, and then to my editor for the final edits and pre-production readiness.

If you’re interested, I just posted a note about the process in more detail, as well as an HTML version of the first chapter, in the book weblog. The chapter still needs some editing but it’s getting there.

In the edited introductory chapter, I put more focus on the purpose of the book, as well as the section on when to use RDF/XML and when to stay with straight XML. I also incorporated information that explains my interest in RDF/XML.

If I have any edge at all with writing about RDF/XML, it’s that I come from a politically neutral position in the battle between the semantic web folks on one end and the markup folks on the other. I’m neither semantic web nor markup; I’m a data person, with many years of experience working with data at different levels and for different companies. Because of this neutrality, I think I can safely represent all interests or biases in the material. Or perhaps a better way of saying this is: I piss everyone off equally.

But, you know, I can live with this.

Besides, people haven’t necessarily been beating down the doors to write books on RDF. If you go out to Amazon and search on the term ‘RDF’ only a few books get pulled up, and they haven’t been exactly flying off the shelves. Of course, this will change when my book comes out and all of you go out and buy your copies of the book. And, no, Amazon still hasn’t corrected the author list, removing Ray as a co-author. I told you his name would be there forever. Grrr.

As usual when I mention the book, I have to again send kudos to Simon St. Laurent, my editor. This last week he helped me deal with the rather detailed criticism of the book I received from some of my reviewers – something that’s not always easy for an author to absorb.

Simon is what is known in our industry as and a Good Man and a Very Cool Dude. I owe him a box of my favorite chocolates – as soon as I get my advance and can pay for it. Or maybe I should send him a box of Tim Tams?

There’s a funny story associated with the list of books at Amazon on RDF. If you look at this one you’ll see that it’s ‘authored’ by Dan Brickley. The story behind this is that the publisher grabbed the W3C specifications, which were, I believe, public domain or at least allowed this type of re-publication, and then did nothing more than reprint them in the book, exactly. They plunked Dan Brickley’s name on it since he was one of the co-authors of the original specification. This action pissed Mr. Brickley off quite a bit, as you can imagine.

Cheesy thing to do? Damn right it was. But also an excellent example of what can happen to material once it’s in the public domain.

Categories
Copyright RDF Writing

Checking in

Thanks for well wishing. The suggestion of tea was a good one, but unfortunately I can’t drink any acidic juice such as OJ, as it hurts my throat more than a little.

Doing a bit of catch up. There were a couple of items of RDF I had to respond to over at Practical RDF, both of them related to postings from editors on the book. My only comment in addition to my two postings is this: I have a great deal of respect for the RDF Working Group. They worked, hard, to reach Last Call status on the newest RDF specification documents. All that’s left is a few odds and ends, and they can call their job done. It would be a real shame if the group took all that hard work and drop kicked it off a cliff in a burst of tired arrogance at the end of the day.

Liz joined the fun on Creative Commons with a challenge to Jonathon and myself to provide reasons for why not to use the licenses:

 

How ‘bout a “non-shithouse” version of why people might choose not to use the license, that can live side-by-side with the CC discussion of why they should?

Well, you only have to search on “creative commons” among my archives to see my comments, though I’m not sure about their ‘shithouse’ status. I look to Jonathon to provide a better answer to Liz, if he wishes, as the RDF posts took my time tonight, and I’m to bed. However, it seems to me that if Creative Commons is to be effective, it’s up the members of the CC to detail the problems associated with the CC licenses as well as the advantages. I’ve pointed out to a couple of members the writing that Tim Hadley has done; hopefully they’ll consider writing a post or two on these issues to go with all the postings about this artist or that blogger that has attached a CC license to their work.

I was more interested in responding to the discussion Liz and Dorothea are having about about academia. Specifically, I wanted to pursue the thread off this conversation that Baldur started:

 

Everybody speaks the same, in the same way, about the same thing, with little to no variation. We could easily turn into the lightspeed version of the same unsubstantiated bullshit of postmodern academia, shedding even the pretense of giving ideas space and scope for discussion.

What killed the author and poisoned academia is trying to return through the violated corpses of a horde of ’blogger-zombies spouting inane commentary on the links of the day.

But as the popularity of weblogging increases, the number of meme-victims will rise and the blogdex top fifty will not only describe the fifty most popular subjects amongst webloggers…

It will describe the only subjects.

What I’ve tried to say in a thousand words, Baldur said in a few. I wanted to write in response, but lacked the energy to respond well. I couldn’t do justice to Baldur’s words.

But when I’m well, and have the energy to respond, to do Baldur’s writing true justice, should I?