Huffpost Politics
THE BLOG

Featuring fresh takes and real-time analysis from HuffPost's signature lineup of contributors

Alan Rosenblatt Headshot

Creating Permanent Links to Online Government Documents Starts Now

Posted: Updated:

Last summer I wrote about a great idea to make government documents permanently citable. As it stands now, it is impossible to cite online references to government documents, from the executive, as well as proposed legislation, because the URLs for these references change over time. That is simply unacceptable. If you don't believe me, ask the nearest librarian.

Fortunately, a simple idea was offered to solve this problem at Citability.org.

This coming weekend (April 9-10), Citability.org is hosting a codeathon to create a working prototype for making online government documents permanently citable. And the timing is brilliant, as The Public Online Information Act was recently introduced in Congress.

Coders interested in participating in the codeathon can register at DCcodeathon.eventbrite.com. On Sunday, April 10, from 4-6pm, government officials and the press are invited to see the demos of what the coders build on Saturday. The codeathon and demos will be held at Microsoft's offices in Friendship Heights, MD (5404 Wisconsin Ave, Suite 700).

While the goal of creating a fully public online system of government documents may seem daunting, creating permanent citations as laid out by Citability.org provides a impressively straight forward foundation to make it a reality.

In order to provide a more complete picture of this project, I am posting below a guest piece from Citability.org's founder, Silona Bonewald:

Stop Fishing and Start Feasting: How Citable Public Documents Will Change Your Life

Putting government documents and data online is a great step towards making our government process more transparent to the people it serves, but in many ways simply making the material available is like serving someone dinner by giving them a pond full of fish. The pond is huge and the poor dinner guest doesn't have any tools. Worse, they're only looking for one particular bass, and every time someone sends them to where they last saw the fish it's long gone.

The recent healthcare bill was more than 1,000 pages long. The budget can often be half again that big. Commenting on these types of documents as they are currently implemented is extremely challenging. Pointing a finger at that big pond and telling someone that you swear you saw a fish isn't very effective. It's even worse when someone swears they saw a fish that isn't really there and it is effective because no one is willing to refute them. No one has time to wade around themselves and so they take it on faith. The recent "killing grandma" scare is an excellent example.

Citations, first, are a way of pointing at the fish. A simple paragraph level of granularity for references should be enough. This promotes ease of implementation and use and provides a tight enough zoom to bring someone right to the material being discussed.

The next problem is that fish move. If you're trying to point out a moving fish, and show it to someone later, you need to have a photograph with a timestamp. That line in the budget about forcing our children to manufacture chemical weapons might have moved to page three the next day, or a wily senator may have changed the wording and put it under a different heading. Proper citability requires an archived snapshot of the online material that maintains the integrity of any reference links.

Lastly, for someone to believe you about this fish, you need to have a way of pointing out where you saw it at the specified time. They'll want to know it was the same pond.

Making it possible to create timestamped permalinks at a paragraph level of granularity would be a huge leap forward in increasing government transparency through its online documents. The same principles apply when producing citable government data. When recovery.org decided to display visual representations of the data coming in about recovery money around the nation, it quickly became clear that some amount of data was erroneous. When the errors were reported and the data was later modified, there wasn't any way to go back and compare the two versions to see what changes had taken place. A blogger, reporter, statistician or scientist should be able to run a query against any specific collection of government data, as it was published, for a given version or moment in time.

WHAT WE'RE DOING

The nonprofit, nonpartisan League of Technical Voters has proposed a simple, easy to build and implement citability solution. Open source software development is underway and a wide range of government institutions are already on board. If you would like to help with this effort, consider being part of our upcoming codeathon or create your own codeathon. Microsoft is a platinum sponsor of this open source event.