news4geeks.net
30Jan/120

Pentaho open sources ‘big data’ integration tools under Apache 2.0

Posted by vica

BI vendor Pentaho is open sourcing a number of tools related to big data in the 4.3 release of its Kettle data-integration platform and has moved the project overall to the Apache 2.0 license, the company announced Monday.

While Kettle had always been available in a community edition at no charge, the tools being open sourced were previously only available in the company's commercialized edition. They include integrations for Hadoop's file system and MapReduce as well as connectors to NoSQL databases such as Cassandra and MongoDB.

Those technologies are some of the most popular tools associated with the analysis of "big data," an industry buzzword referring to the ever-larger amounts of unstructured information being generated by websites, sensors, and other sources, along with transactional data from enterprise applications. Read more...