Pentaho BI
Pentaho is the world’s most popular open source business intelligence suite. It offers complete range of business intelligence (BI) capabilities including reporting, analysis, dashboards, data integration and data mining.

Pentaho allows users to select individual products for their use. You can choose whole BI suite or you can choose individual product depend on your requirements.

Pentaho BI Suit includes Pentaho Reporting, Pentaho Analysis, Pentaho Dashboards, Pentaho Data Integration and Pentaho Data Mining components.

Pentaho Reporting

Pentaho Reporting consists of Pentaho Report Designer, Metadata Editor, Design Studio, Pentaho BI Server, Reporting engine and Pentaho Administration console. Pentaho reporting solution provides following key features:

  • Rich report designer for creating pixel perfect reports
  • Metadata creation via Meta Data editor to provide web-based ad hoc query and reporting for business users
  • Fine-tuning of reports created with Report Designer and ad hoc reporting through Design Studio
  • Broad data source support including relational, OLAP, or XML-based data sources
  • Popular output format support including PDF, HTML, XLS, RTF, or plain text
  • A complex scheduling subsystem that enables users to set reports to execute at given intervals
  • The ability to email a published report to other users
  • Cross platform support for both client & Server components (Mac, Linux/Unix, Windows)
  • 100% Java based product so provides portability, scalability, integration
  • Provides ability to securely view, schedule, share, and deliver reports over the web




Pentaho Analysis
Pentaho Analysis consists Mondrian ROLAP engine, Schema Workbench and Aggregation Designer components. Following are the key features of Pentaho Analysis solution.

  • Providing speed-of-thought response times to complex analytical queries
  • Presenting data multi-dimensionally and letting users select what dimensions and measures to explore (i.e. sales by region, sales by time period etc.)
  • Making it easy for users to freely explore business information by interactively drilling into and cross-tabulating data
  • Offer complete integration with other products in the Pentaho BI Suite
  • Provides user friendly design tool to create OLAP schema (Schema Workbench)
  • Provides a Aggregation Designer tool for performance enhancement of ROLAP queries
  • Supports all popular open source and proprietary databases as it is based on ROLAP architecture
  • Supports both web based and XLS based access
  • Based on J2EE architecture and supports both JDBC and JNDI connectivity




Pentaho Dashboards
Pentaho Dashboards solution provides following key features.

  • Rich, interactive displays including Adobe Flash-based visualizations so that business users can immediately see which business metrics are on track, and which need attention
  • Self-service dashboard designer that lets business users easily create personalized dashboards with zero training
  • Integration with Pentaho Reporting and Pentaho Analysis so that users can drill to underlying reports and analysis to understand what factors are contributing to good or bad performance
  • Portal integration to make it easy to deliver relevant business metrics to large numbers of users, seamlessly integrated into their application
  • Integrated alerting to continuously monitor for exceptions and notify users to take action
  • J2EE based architecture and AJAX provide scalability, portability, integration, and ease-of-use



Pentaho Data Integration (Kettle)
Pentaho Data Integration is a powerful, metadata-driven ETL tool designed to bridge the gap between business and IT. Pentaho data integration provides following key features.

  • Rich transformation library with over 100 out-of-the-box mapping objects
  • Broad data source support including packaged applications, over 30 open source and proprietary database platforms, flat files, Excel documents, and more
  • Advanced data warehousing support for Slowly Changing and Junk Dimensions
  • Proven enterprise-class performance and scalability
  • Integration with the Pentaho BI Suite for Enterprise Information Integration (EII), advanced scheduling, and process integration
  • 100% metadata driven
  • 100% java, cross-platform support
  • Extensible architecture and so easy to develop custom connectors for legacy data sources


Pentaho Data Mining
Pentaho Data Mining is based on Weka project and provides powerful data mining capabilities. Some of the key features of Pentaho data mining are as follows

  • Comprehensive set of machine learning algorithms from the Weka project including clustering, segmentation, decision trees, random forests, neural networks, and principal component analysis.
  • Integration with Pentaho Data Integration and automated the process of transforming data into the format the data mining engine needs.
  • Algorithms can either be applied directly to a dataset or called from Java code.
  • Output can be viewed graphically, interacted with programmatically, or used data source for reports, further analysis, and other processes.
  • Filters are provided for discretization, normalization, re-sampling, attribute selection, and transforming and combining attributes.
  • Classifiers provide models for predicting nominal or numeric quantities. Learning schemes include decision trees and lists, instance-based classifiers, support vector machines, multi-layer perceptrons, logistic regression, Bayes’ nets, and other advanced techniques.
  • The data-mining engine is also well suited for developing new machine learning schemes, enabling customers to incorporate their own models.
  • Inputs and outputs can be controlled programmatically, enabling developers to create completely custom solutions using the components provided.


For more information please visit www.pentaho.com