Very Large Databases – Googzilla Being Coy

August 31, 2009

I read Technofeel’s “VLDB09 Part Two” and noted another Google head fake. Technofeel points out that Google’s paw prints were all over the conference from his point of view. MapReduce and Hadoop (an open source semi MapReduce) presentations caught his attention. In my opinion, the most interesting comment in the write up was:

Finally, I ended my visit at VLDB09 with two presentation of Google Interns about data mining to get structured result sets out of semi unstructured pages with lists and tables.

These two Google papers are important. You can get links to them from Technofeel’s article. Let me make two or three observations:

The use of “interns” is a way for the Google to reward bright folks while keeping the big guns off the podium. The experience of the Google Books product manager makes this use of interns prudent.
The content of the papers is not intern grade. When you work through the two documents, you will learn that Google has made significant advances in methods for working out issues in manipulating Google-scale structured data and discerning context.
The traditional world of relational databases is on a collision course with Googzilla. Big data are part of the Google core competency.

Those are some interns because their co authors are among Google’s most sophisticated researchers and academic colleagues. Technofeel’s instincts are good. He may want to check the bios of the secondary and tertiary authors of these Google papers. The interns are not the hubs on these wheels.

Stephen Arnold, August 31, 2009

Written by Stephen E. Arnold · Filed Under Business strategy, Database, Google, News, Technology, Text analytics, Text processing

Comments

One Response to “Very Large Databases – Googzilla Being Coy”

Getting Closer to Real Time With Hadoop | Digital Asset Management on September 21st, 2009 1:49 am

[…] Very Large Databases – Googzilla Being Coy (arnoldit.com) VN:F [1.6.5_908]please wait…Rating: 0.0/10 (0 votes cast)VN:F [1.6.5_908]Rating: 0 (from 0 votes) Spread the DAM news: […]

Search the site
Subscribe to Beyond Search
Feature archive
News archive

Stephen E. Arnold monitors search, content processing, text mining and related topics from his high-tech nerve center in rural Kentucky. He tries to winnow the goose feathers from the giblets. He works with colleagues worldwide to make this Web log useful to those who want to go "beyond search". Contact him at sa [at] arnoldit.com. His Web site with additional information about search is arnoldit.com.