Do Math In Apache Hadoop With H20
May 2, 2013
If you want to do math in Hadoop, this information on Oxdata/h2o from GitHub is for you. Apache Hadoop, the software library designed for the processing of large sets of data is run by H20 to do math over BigData. The vision for the introduction involves using the primary execution framework for whatever algorithm is presented. The program also reads and writes from and to HDFS, S3, NoSQL and SQL. It is even able to pass and evaluate R-like expressions. The article explains,
“H2O keeps familiar interfaces like R, Excel & JSON so that big data enthusiasts & & experts can explore, munge, model and score datasets using a range of simple to advanced algorithms. Data collection is easy. Decision making is hard. H2O makes it fast and easy to derive insights from your data through faster and better predictive modeling. H2O has a vision of online scoring and modeling in a single platform.”
The targeted users are mainly data analysts. H20 hopes to vitalize the community of invested software engineering enthusiasts and provide everyone concerned with the tools to hack data with math and algorithms. If you are interested in being a part of this community, join the Google group h20stream.
Chelsea Kerwin, May 02, 2013
Sponsored by ArnoldIT.com, developer of Augmentext