Chasing Non Swimmers from the Data Lake
November 22, 2014
If you are one of the Big Data believers, you will find “Clearing Up Muddied Waters in the ‘Data Lakes’” a reminder about the plasticity of concepts and their connotations. The write up addresses a clever phrase used to describe a storage pool into which
You store raw data at its most granular level so that you can perform any ad-hoc aggregation at any time. The classic data warehouse and data mart approaches do not support this.
The write up points out that the original notion of a data lake has been prodded, stretched, and pulled. Not surprisingly, after the verbal chiropractic, data lake is just not its old self.
Who are the perpetrators of this conceptual improvement? A “real” journalist and—no big surprise—several Big Data experts laboring away at a mid tier consulting firm.
So what? The coiner of the phrase points me and other readers to the original write up about data lakes here. Worth revisiting? Will the “real” journalist or the mid tier consultants likely to read the source document? I would guess not.
Stephen E Arnold, November 22, 2014