Hadoop Summit 2015 — As the big data universe evolves and expands to the cloud, Pentaho, a Hitachi Data Systems company, today announced support for integration with two popular big data technologies, Amazon Elastic MapReduce (EMR) and SAP HANA. Available today, Pentaho 5.4 offers new capabilities that build on a pragmatic and future-proof platform for big data orchestration and analytics at scale, further empowering organizations to drive value with Pentaho’s Big Data Blueprint use case designs.
Big Data Deployment in the Cloud, Amazon EMR Support
In Pentaho 5.4 customers can now use Amazon EMR to natively transform and orchestrate data as well as design and run Hadoop MapReduce in-cluster on EMR. IT organizations now have new and powerful ways to operationalize a cloud-based data refinery architecture for on-demand governed delivery of data sets.
Amazon Web Services and Pentaho customer, The Lucky Group, consolidates, refines, and analyzes several Terabytes of data to drive a deeper understanding of retail customer segmentation, acquisition, and retention. According to Jay Khavani, Sr. Manager Business Intelligence, The Lucky Group, “Pentaho was indispensable in transitioning to a cloud-based data architecture, with an adaptive big data layer that ensures business continuity when incorporating new technologies.”
Blended Data Delivered, SAP HANA Support
Enterprises can now leverage SAP HANA’s high-performance capabilities on a wider variety of data. Pentaho 5.4’s integration with SAP HANA enables governed data delivery across multiple structured and unstructured sources.
Scaling Hadoop in a Big Data Environment
Enterprises running Hadoop find that data variety and volumes increase over time, making reliable performance and scalability mission-critical priorities. Pentaho recently executed a controlled study that demonstrates sustained processing performance of Pentaho MapReduce running at scale on a 129-node Hadoop cluster. The results build on the value of the Pentaho platform, delivering high performance processing at enterprise scale in big data deployments.
Additional capabilities and enhancements in Pentaho 5.4 include:
Integration of Pentaho Data Integration (PDI) with Apache Spark, enabling orchestration of Spark jobs
New APIs to simplify embedding analytics into business applications and processes
A refreshed, modern look for Pentaho Data Integration
Ability to localize Pentaho in French, German and Japanese
“We continue to deliver on our vision to help organizations get value out of any data in any environment with Pentaho 5.4,” said Christopher Dziekan, Chief Product Officer, Pentaho. “Our open and adaptable approach means customers choose the best technology for their businesses today without the worry of being locked-out in the future.”