Indiana University and a Big Data Set
January 20, 2013
Short honk: If you are looking for a big data set to show off your Big Data system, Indiana University can help. “Click Dataset” says:
To foster the study of the structure and dynamics of Web traffic networks, we make available a large dataset (‘Click Dataset’) of HTTP requests made by users at Indiana University. Gathering anonymized requests directly from the network rather than relying on server logs and browser instrumentation allows one to examine large volumes of traffic data while minimizing biases associated with other data sources.
There are some caveats, but for the firms with sci-fi type Big Data analytics’ systems, the issues should be irrelevant. “Truthy” in advertising? For companies with real world systems, the caveats are important.
Stephen E Arnold, January 20, 2013