Publishing Matters
What's on your mind?
 Thursday, April 03, 2008
Note: The impact of Google on the way we do business is really a by-product of much more significant culture change in the evolution of human society. Michael Cook, a Managing Director at AG Asset Management, a money management firm in New York City, who is also an essayist, gave me permission to share his thoughts with you. He can be contacted at mcook@ag-am.com.
—Eugene G. Schwartz, Editor at Large


Life as we know it depends on DNA to transmit information from one generation to the next. Until the appearance of the human race, this was the only way favorable adaptations were retained. Thus, only those adaptations that were genetic in nature drove the progress of evolution. With the invention of language, however, a new type of evolution could occur—what Julian Huxley termed “psycho-social” evolution. The DNA of this evolution is language, and with language came the ability for humans to transmit information from one generation to the next linguistically, as well as genetically. This meant that adaptations innovated by individuals not only could be continued and built upon, but also that individual learning could accumulate from generation to generation. This sped up the pace of evolution immeasurably.

The accumulation of social knowledge brought with it new dilemmas. After a period of time, the traditions and knowledge of the human species became so vast that storing it efficiently became difficult. Oral tradition depends upon memory, which is limited. The art of memory systems was developed by the Greeks to extend the range of human memory, and the poetry of Homer used rhythms, rhymes, and other patterns to aid the memory so that it could retain vast amounts of cultural information. But these techniques were limited: ultimately the problem of storing what we could loosely refer to as the psycho-social “genome” became serious. This problem was solved by the invention of writing systems.

However, to be useful, information must not only be stored, it must be retrieved. Fairly recently in human history it was possible to have every book ever written on your bookshelf. The invention of the printing press was a watershed event in the technology of writing, which ensured that this could not remain true for long! Nevertheless, the retrieval of information from the general store was still something that could be done in a fairly straightforward manner. Of course, centers of learning—monasteries, universities, libraries – developed to manage the growing base of human knowledge. But at some point, it started to become clear that the problem of information retrieval was becoming a roadblock to the continuing development of knowledge. It also became clear that computer technology was well suited to addressing the retrieval issue.

In 1965, J.C.R. Licklider wrote Libraries of the Future, which summarized a project he had undertaken at Bolt Beranek and Newman. In his book, Licklider predicted that all human knowledge would be available on a “fast, random access computer” by the year 2000. His vision seems to be coming true. In December 2004, Google announced a project in which the libraries of five of the world’s leading academic institutions are to be digitized and made available for search and reading online.

But still, even if everything is “available” online, how can relevant information retrieval be effectuated? This is the key problem that Google addressed, and its successful solution to it, although just a beginning, essentially created the “search” industry. Google’s initial solution is called the PageRank algorithm. It was the breakthrough that started delivering search results that are relevant to the user’s search. Before Google, this had really not been the case. Their insight was to use the link structure of the web—the fact that web documents “point” to other web documents - to measure how popular sites were, and to then trust the “wisdom of crowds” by using a site’s popularity as a measure of its relevance. This, in conjunction with the appearance of search terms on the site, proved to be a surprisingly effective ranking mechanism, and the first algorithm that consistently gave users results they found useful.

At present the search industry is evolving very fast—everybody seems to have incorporated Google’s insight into their algorithm, and the race is on to understand what users mean, and what they are intending with their searches. Google’s PageRank algorithm does not address semantic content: indeed, this is part of the genius of the solution—the way it neatly sidesteps this very difficult problem. The next generation of Web Search is yet to come! But the major breakthrough that made search results relevant was invented and engineered by Google.

So here’s the progression as I see it—the thumbnail sketch of the evolution of life on earth: DNA, language, writing, printing, computers, the Internet, Google’s search algorithm.

This is why I say that the future of search is the future of life on earth, and that Google’s algorithm represents a watershed event, analogous to the invention of writing, or the invention of the printing press.

Am I overstating my case? Perhaps. But I don’t think so.

—Michael Cook

Posted by: Eugene Schwartz, Editor-at-Large

Thursday, April 03, 2008 3:03:25 PM (Eastern Daylight Time, UTC-04:00)
Michael's argument, as usual, is quite sound. What jumped out at me as I read it was the compression factor of our evolution. How many thousands of years passed between the time language evolved and written communication was developed to store our language? Then, how many thousands of years passed between the evolution of written language and the invention of the printing press? Then, only hundreds of years passed between the printing press and computers. Decades then elapsed between computers and the Internet. Now, only a few years were required for Google to add the search algorithm. What's the next evolution in this chain? Whatever it is, it probably has invented - or it will be soon.
Michael Davis
Comments are closed.