Categories
Documents

Flying high in the (Document) clouds

DocumentCloud is an online document management system for journalists. It provides a way to upload and organize documents, making them easier to share with the public and other team members. In addition, DocumentCloud also provides a set of tools enabling a host of functionality, including the ability to search among all of the uploaded accessible documents. (example from the Washington Post)

Currently, my documents are all uploaded to my own server. They’re easily available, true, but if I were to someday be nibbled to death by ducks, the documents would vanish from the internets. In addition, there’s no functionality available for searching among the documents, and they’re isolated from other comparable documents, making them a little less than useful for generalized researching. Yes, they are accessible via Google, but documents need a bit more.

They need persistence not dependent on a frail human body.

DocumentCloud was kind enough to give me an account, once I demonstrated the intent of my document web site, and how I’ve used my documents as source material for writings. Now I’m in the process of figuring out how best to organize them for uploading, which includes determining what I’m going to name the document files.

The RECAP folks had legitimate criticism of my file name scheme, which consists of folders containing documents named “document1.pdf” and so on. The names are meaningful in context, but meaningless out of context. This is especially true for my court documents.

I could use the RECAP naming scheme, which consists of court system, specific court designation, case number, document number, and attachment number, if any. An example is “gov.uscourts.dcd.129639.1.0.pdf”, which is the US federal court system, the DC court, case number “129639”, document 1, and the main document, not an attachment.

The FOIA Project uses something similar, but it actually spells out the name for the document. An example is “dc-1-2010cv00883-complaint-attachment-1.pdf”. This name breaks down to the DC court system, the year and court type (“2010” and “cv” for Civil), as well as the case number, in addition to the document type and attachment. (As a side note, the FOIA Project also uses DocumentCloud.)

PACER assigns each case a unique PACER number within the system, but that’s only useful when accessing the documents via PACER.

I’ll have to live with whatever I decide, because some of the cases I follow have hundreds, even thousands, of court documents. I’m going to do the “rename and upload to DocumentCloud” thing once.

You can provide file name suggestions on Google+.

Categories
Documents Web

Harvard Business School: it will cost you to link to us

Discovered via Facebook, Harvard Business School’s extraordinarily parsimonious attempts to milk every last penny out of its material:

No one ever charges people for the act of curating and directing attention. That is our job. It is our mantra. But that is precisely what HBSP are doing. To be sure, HBR looks like any other magazine and is already paid for under an institutional license by the UofT library. What HBSP are charging for are the links to those articles made by academics in the ordinary course of teaching. While our Dean was clear that he wanted us to continue to treat ideas as ideas and give the best to our students, he also wanted us now to be aware that it was costing money each time we did so. I don’t know how much it costs but my guess is that it is $5 or so per article per student. So if I have a class of 40 students and in the process of being thorough, add 10 HBR articles to my reading list or class webpage or, if the students do the same on bulletin boards for the class, we have to pay $2,000 for the privilege. I want to adhere to norms but that is enough to cause me to think twice.

Note, this isn’t direct access to the article content, this is just linking to, or citing the article.

I guess Harvard must be hard up for cash.

Categories
Documents

Lavabit Court documents

The company Lavabit shut down rather than give the government encryption keys that would expose all private communication related to its email server. This is the same server used by Edward Snowden.

The Lavabit founder, Ladar Levinson, was finally able to get the court documents related to this action unsealed, and they were posted yesterday. I was fairly sure that Kevin Poulsen of Wired’s Threat Level would post an actual copy, rather than just discuss them, and I was correct.

I’m also posting a copy in the interest of dissemination. I recommend, though, that you read Poulsen’s overview of the case.

Categories
Documents

Open Feynman Lectures

At Quantum Frontiers:

Last Friday the 13th was a lucky day for those who love physics — The online html version of Volume 1 of the Feynman Lectures on Physics (FLP) was released! Now anyone with Internet access and a web browser can enjoy these unique lectures for free. They look beautiful.

Categories
Books Critters

Sharing photos

Ringling Brothers: The Greatest Show in Court book coverThe photo for my newest book comes from Shutterstock. It’s not a perfect photo. It’s a little dark, a little blurry and out of focus. But no other image worked for the book. When I saw it, I knew this was the image I wanted for my cover. Authors get funny that way, which is why publishers rarely let us anywhere near the cover.

Thankfully, O’Reilly’s Director of Brand Management and expert on all things book covers, Edie Freedman, kindly volunteered to help me pummel the photo into shape. She also helped educate me on what makes a good book cover. For instance, I didn’t know about needing to leave space on all sides of the cover page. I also wasn’t aware that when you’re a relatively unknown author, as I am, you want to put your name at the top of the page; get a little name recognition going. She helped polish away many of the photo’s distractions, and find a font that, I think, really makes the cover snap—especially in smaller sizes, which is what shows up on Amazon pages.

The cover image is probably the only photo I’ll be using from Shutterstock in my book. Most of the images will come from the court case and investigations the book covers. The others are coming from photos at Flickr made freely available for use with a Creative Commons license. You can use a photo in a book, as illustration, if the CC license permits noncommercial use.

Some of the photos are from folks who have attended the Ringling Brothers circus or the associated animal walks. Others, though, come from the Circus collection of the Boston Public Library. This wonderful institution has not only uploaded extraordinary graphics and photos to its Flickr account, it kindly allows people like me to use the photos in a non-commercial setting (such as within a book for editorial or illustrative purposes). My favorite set of theirs is, of course, the one related to the circus.

I’ve always been reluctant about the Creative Commons license, not the least of which, the licenses are a bit confusing. For instance, it took me the longest time to figure out that using a photo as illustration within a book that isn’t focused on selling said photo is not a commercial use of the photo. Or at least, that’s the interpretation I’ve seen most frequently given, and the one I’m sticking with.

I can now see, though, why having a licensing scheme such as the Creative Commons is so helpful. It wasn’t necessary to have older photos and circus posters in the book…but the added color and history makes it more lively.

Old circus poster

I was so grateful to the Boston Public Library that I decided to upload all of my photos to my new Flickr account and offer them for use. The CC license I picked is very open, other than I restrict commercial use because I don’t have model releases for people and buildings and don’t want to hassle with the potential content copyright issues.

I’ve already had one of my photos used in a Missouri Department of Tourism pamphlet, for illustrative purposes. I don’t claim to be the best photographer in the world, and most of my photos are ordinary. But you never know when one of your photos might help someone, so I just uploaded them all, let folks use them or not.