Synopses & Reviews
Your one-stop resource for open source BI and data warehousing solutions
Pentaho is a full-featured, open source Business Intelligence suite that lets you build data warehouses and rich, powerful BI applications at a fraction of the cost of a proprietary solution. This book gets you up and running with Pentaho within minutes: right from the start you'll be running example reports, dashboards, and OLAP pivot tables while you learn about Pentaho concepts and architecture. Using a practical case study, you'll learn what dimensional modeling is and how to apply it to design a data warehouse. You'll create and populate your data warehouse with Pentaho data integration tools. Finally, you'll learn how to build your own BI applications on top of your data warehouse using Pentaho reporting, analysis, dashboarding, and data mining tools.
Understand important Pentaho concepts, including action sequences and the solution repository
Apply the key concepts of dimensional modeling and construct a data warehouse using star schemas
Use Pentaho data integration tools to build ETL applications
Explore advanced PDI features including remote execution and clustering
Design and deploy reports and charts using Pentaho Report Designer
Leverage OLAP and create interactive pivot tables with drill up/drill down using Pentaho Analysis Services
Concentrate and compact BI content for business users with comprehensive dashboards
Discover and explore patterns in your data using Pentaho data mining
The book covers all components and related products that make up the Pentaho BI Suite. For each component (and where applicable), installation, usage and maintenance are discussed and illustrated. Background theory is given as needed to provide context for those readers with no prior BI knowledge or experience.
When people have read this book they will have learned the following things: The components and products that form the Pentaho Business intelligence suite (and hows these products and components fulfill particular BI needs) How to install and configure Pentaho, and how to connect it to a database/datawarehouse How to design a datawarehouse using Pentaho and related open source tools How to build and load a datawarehouse with pentaho data integration/Kettle How to manually create JFree (pentaho reporting services) reports using direct SQL queries How to set up a metadata layer to allow ad-hoc and selfservice reporting (not involving direct SQL queries) How to create Mondrian (pentaho analysis services) cubes, and attach them to a JPivot cube browser How to deploy reports, cubes and metadata to the pentaho platform in order to distribute BI solutions to end-users How to set up scheduling, subscription and automatic distribution
Your all-in-one resource for using Pentaho with MySQL for Business Intelligence and Data Warehousing
Open-source Pentaho provides business intelligence (BI) and data warehousing solutions at a fraction of the cost of proprietary solutions. Now you can take advantage of Pentaho for your business needs with this practical guide written by two major participants in the Pentaho community.
The book covers all components of the Pentaho BI Suite. You'll learn to install, use, and maintain Pentaho-and find plenty of background discussion that will bring you thoroughly up to speed on BI and Pentaho concepts.
- Of all available open source BI products, Pentaho offers the most comprehensive toolset and is the fastest growing open source product suite
- Explains how to build and load a data warehouse with Pentaho Kettle for data integration/ETL, manually create JFree (pentaho reporting services) reports using direct SQL queries, and create Mondrian (Pentaho analysis services) cubes and attach them to a JPivot cube browser
- Review deploying reports, cubes and metadata to the Pentaho platform in order to distribute BI solutions to end-users
- Shows how to set up scheduling, subscription and automatic distribution
The companion Web site provides complete source code examples, sample data, and links to related resources.
About the Author
is an application developer focusing on open source Web technology, databases, and Business Intelligence. He is an active member of the MySQL and Pentaho communities, and you can follow his blog at http://rpbouman.blogspot.com/.
Jos van Dongen is a seasoned Business Intelligence professional and well-known author and presenter. He speaks regularly at conferences and seminars. You can find more information about Jos at http://www.tholis.com.
Table of Contents
Part I Getting Started with Pentaho.
Chapter 1 Quick Start: Pentaho Examples.
Chapter 2 Prerequisites.
Chapter 3 Server Installation and Configuration.
Chapter 4 The Pentaho BI Stack.
Part II Dimensional Modeling and Data Warehouse Design.
Chapter 5 Example Business Case: World Class Movies.
Chapter 6 Data Warehouse Primer.
Chapter 7 Modeling the Business Using Star Schemas.
Chapter 8 The Data Mart Design Process.
Part III ETL and Data Integration.
Chapter 9 Pentaho Data Integration Primer.
Chapter 10 Designing Pentaho Data Integration Solutions.
Chapter 11 Deploying Pentaho Data Integration Solutions.
Part IV Business Intelligence Applications.
Chapter 12 The Metadata Layer.
Chapter 13 Using The Pentaho Reporting Tools.
Chapter 14 Scheduling, Subscription, and Bursting.
Chapter 15 OLAP Solutions Using Pentaho Analysis Services.
Chapter 16 Data Mining with Weka.
Chapter 17 Building Dashboards.