Table of Contents
HgIS applies BI tools (Pentaho platform) and GIS for the groundwater domain. Technologies and tools used in HgIS are listed here.
Since Hitachi acquired Pentaho, the development of the Pentaho platform has stagnated, and the platform's website is confusing. The original author of PDI (Matt Casters) created Apache Hop, a fork of PDI that finally has elegant architecture and is being intensively developed. PDI transformations can be imported into it.
Pentaho platform is a set of business intelligence tools.
It contains the following components (download from SourceForge):
- Pentaho Data Integration (PDI, also known as Kettle)1) It is an ETL tool. It can read data from diverse databases, resources (e-mail, local computer, FTP, HTTP), and files (MS Excel, MS Access, CSV, TXT, ESRI SHP, XML, JSON, YAML, RSS, XBase, or ZIP). Official resource lists all steps of transformations and jobs. Geospatial data can be handled with PDI GIS Plugins. See more info in Pentaho Data Integration Cheat Sheet.
- Pentaho Metadata Editor – creates an abstract data layer. See a figure:
Adaptive Execution Layer (AEL) runs transformations not only locally or on the Pentaho Server, but also on Spark. Pentaho also supports cloud platforms like Amazon Web Services, Google Cloud Platform, or Microsoft Azure (Cloud agnostic analytic pipeline). Alternative to AEL is Apache Beam.
Pentaho platform is built on
PostgreSQL / PostGIS
Other GIS tools
For detailed information, go to the Czech version of this site.