Core Concepts
Providing a federated database architecture using knowledge graphs to query heterogeneous data across distinct repositories.
Abstract
Modern applications face challenges managing diverse datasets with different models, requiring specific tools and techniques. In the oil reserves discovery scenario, workflows process raw data files independently, storing metadata in various data stores without direct relationships. The proposed polystore architecture aims to provide a seamless interface for users to access heterogeneous data stores by creating a global conceptual schema and local conceptual schemas for each external store. Provenance is used to link the consumed and generated data, allowing users to formulate queries based on the global schema transparently.
The architecture was implemented as a RESTful web service in a microservice approach, simulating an Oil & Gas industry case. It was compared against a relational multidatabase system based on foreign data wrappers, showing reduced query complexity and minimal increase in query processing time. The HKPoly service component diagram includes HKBase services for managing domain metadata and provenance data storage.
Stats
"The results demonstrated that the proposed architecture allows query writing two times less complex than the one written for the relational multidatabase system."
"Adding an excess of no more than 30% in query processing time."
Quotes
"A single data store to manage heterogeneous data using a common data model is not effective in such a scenario."
"The proposed architecture allows query writing two times less complex than the one written for the relational multidatabase system."