If websites, databases and other content are the landscape of the virtual world, then search engines are the maps. Without search engines, the landscape is confusing and getting lost a certainty. With them, finding one's way through the dense forest of information is possible if occasionally made difficult with unexpected detours and dead-ends.
Disappearing from the results of dominant search engines leads to invisibility. And if one has a website, a blog, an ecommerce site, or a database that no one knows exists, it is useless. Given how critical maps are to successful navigation, having accurate, affordable maps that fulfill the variety of needs of a diverse population is key. So, how would we all feel about giving one for-profit company the exclusive right to map, say, New Jersey or Mexico? If no one else could produce a map of New Jersey, there would be no market incentive to produce the best maps that met all the various needs of the population (shortest route to Delaware from New York, coffee with baby-changing stations). If the mapper wanted to direct traffic to its stores in Toms River, there would be no incentive to produce maps that showed the most direct route to Delaware instead of detouring through Toms Rivers.
Yet giving just such exclusive rights to some important internet territory is one of the key issues involved in a proposed settlement between Google and all the book publishers and authors in most of the English-speaking world.
Briefly -- Google undertook a project to digitize millions of books in the libraries of several major universities such as the University of Michigan and Stanford. Google copied books in their entirety that are in the public domain, as well as those still in copyright. A handful of US publishers and the Authors' Guild, a not-for-profit organization representing US book authors, sued Google for copyright infringement. Just a few weeks ago in federal court in downtown Manhattan, the judge listened to a day's worth of objections and support for a proposed settlement agreement that runs over 300 pages.
This complex agreement accomplishes several things that would be beneficial to the public, authors, and the scholarly community. Under it, digitized books that are part of Google's database would be made available in snippets as search results, and, unless the publisher or author objected, the entire book could be part of paid-for library subscriptions or various kinds of ebooks. Previously buried and obscure works would suddenly see the light of day. And, because Google would facilitate text-to-speech functions for this database, all of these some 17 million books (Google has given varying estimates of the numbers digitized) would become available to those who have sight disabilities.
Why would Google spend all that money -- millions to digitize, more millions to litigate the case it had to know would come, and more millions to settle that case -- for what will amount to a library lending and ebook business? Keep in mind that Google's revenue alone last year was $23.6 billion. This is more than half the $40.3 billion in total revenue generated in the United States by more than 100,000 publishers. And not one dollar of Google's revenue came from publishing books. It came from the enormous ad revenues generated by Google's search and Ad Sense business. With a profit margin of approximately 25%, search in 2010 is far more profitable than publishing.
If the settlement is approved by the Court, Google will be the only search engine that will serve up search results that include the contents of some 5-10 million books -- the books whose authors, publishers, copyright holders can't be found or don't want to be found. Because of the intersection of copyright and class action law woven together by the proposed settlement, no one else will be able to do that. What does that mean for Google? It means that the results and experience from a Google search, as opposed to the results from any other search engine, will be richer. It means that Google's ability to refine its algorithms for search results and its analysis of consumer behavior, interests, and needs will have a depth and a range that no one else can match.
A recent article in Ars Technica described Google's current practice of keeping consumer data for 9 months, much longer than any other major search engine, because it uses the data for a variety of important (and profitable) business needs: "Search data is mined . . . by watching how users correct their own spelling mistakes, how they write in their native language, and what sites they visit after searches. That information has been crucial to Google's famously algorithm-driven approach to problems like spell check, machine language translation, and improving its main search engine."
Google's exclusive ability to map these books, and to observe how consumers interact with that map and the content that these books represent, would give Google a significant competitive advantage in the most profitable internet related market in which it is already dominant. Not surprisingly, the Department of Justice has announced that it is investigating.
Google has publicly proclaimed that without this settlement these out-of-print books will remain buried in libraries with no ability for most people to find them. But is that necessarily true? If it is indeed a public good for these books to be accessible, then shouldn't it be public institutions, perhaps with private cooperation and funding where appropriate, that accomplish that result?
Couldn't the Library of Congress start to assemble a digital database that would be used (perhaps for a fee) by all search engines? After all, US copyright law currently requires that two copies of every work registered be deposited with the Library of Congress, unless exempted by regulation. Why not have one of them be digital with appropriate safeguards? Couldn't (and shouldn't) Congress finally enact some kind of safe harbor or compulsory license scheme so that digital copies of past work are made available for limited uses such as search with compensation to rightsholders where appropriate?
After all, if the goal is to create a library for benefit to the public then a private database won't cut it. If this settlement is approved and actually starts to operate, Google's insuperable advantage may well prevent all the other possible players, both public and private, from helping to create something truly public and accessible to all.
So yes, search is important to Google, and to Google's rivals Microsoft and Yahoo, who want the same copyrighte
There are also other libraries and universiti
The Google scanning project has made no attempt to identify which are "orphan works" or to trace the rights holders. The Settlement covers every book published in the US, the UK, Australia, and Canada before January 5, 2009. Millions of books were scanned that are not only copyrighte
The Settlement contains use and payment provisions applying to the entire rest of the book's copyright term, however many decades that may be. The Google database of scanned books is not publicly available, but one party said in a letter to the court that it is a total mess, with bestseller
No attempt has been made to locate authors of "orphan works." Google merely asserted the rights to fully use every book not opted out before a tight opt-out deadline, and made a scant effort to locate authors even en masse.
The Google Settlement contains clauses saying that if a copyright holder opts into the Settlement but Google uses the books without that person's permission
Any truly orphaned US work is by definition published after 1923. As a book collector, I know that most truly rare books are old enough to already be in the public domain. Furthermor
In fact, Google's whole "orphans" PR is a shuck, designed to disguise as altruism a massive grab to control the publishing industries of several countries. The language of the Settlement indicates plans to _sell_ those books, not give them away to the public. Google has announced that they will launch their e-bookstor
Other search engines such as Microsoft and Yahoo are merely disguising as altruism a desire to use copyrighte
THis reminds me of something that happened in San Francisco a few years back. Google offered to build a city-wide wifi system which would provide broadband wireless Internet to everyone in the city for free. The City was initially interested
Talk about biting off your nose to spite you face...
Imagine an interweb based on true democracy. It would be possible to directly ask the population to select their preferred choice. By posing a question, and responding to the majority vote. Thus any route forward we might take, could be rapidly reassessed and revised.
If it became clear that a particular organisati
“If websites, databases and other content are the landscape of the virtual world, then search engines are the maps”
But the prime motive force remains the human brain.
Some people don't like change: are you one of those?
I suspect there are several agendas going on. First, Google (and the Internet tubes in general), have take away royality free, no copy right hassle reprints. Who is going to buy Shakespear
Second, it seems that some writers and publishers seem to be arguing for permanent copyright for themselves
As for the map analogy--c
It is incorrect to credit Google with the "exclusive ability to map these books." Anyone digitizing books -- or producing them originally in digital form (most books are sent from the publisher to the printer as pdfs) -- can index the text using, say, the open-sourc
What Google has done is to deploy a small army of scanners at two university libraries -- but libraries all over the world are doing this for themselves -- and many publishers are doing the same in their production of eBooks. Scholastic
So, while I agree with the sentiment of your post, the "insuperab
If the settlement is with a class represente
If the Authors' Guild claims to represent these authors who "cannot" be found -- well, perhaps they can claim to be the widow of the unknown soldier as well.
Google might find a more winning strategy in attacking the basis of the class, rather than touting their rights based on some public good which should more rightfully be extended to publishers and libraries.
And, what happens to the advertisin
The widow of the unknown soldier indeed.