File(s) under permanent embargo
Performance-based ontology matching: a data-parallel approach for an effectiveness-independent performance-gain in ontology matching
journal contribution
posted on 2023-05-18, 17:07 authored by Muhammad Bilal AminMuhammad Bilal Amin, Khan, WA, Lee, S, Byeong KangByeong KangOntology matching is among the core techniques used for heterogeneity resolution by information and knowledge-based systems. However, due to the excess and ever-evolving nature of data, ontologies are becoming large-scale and complex; consequently, leading to performance bottlenecks during ontology matching. In this paper, we present our performance-based ontology matching system. Today’s desktop and cloud platforms are equipped with parallelism-enabled multicore processors. Our system benefits from this opportunity and provides effectiveness-independent data parallel ontology matching resolution over parallelism-enabled platforms. Our system decomposes complex ontologies into smaller, simpler, and scalable subsets depending upon the needs of matching algorithms. Matching process over these subsets is divided from granular to finer-level abstraction of independent matching requests, matching jobs, and matching tasks, running in parallel over parallelism-enabled platforms. Execution of matching algorithms is aligned for the minimization of the matching space during the matching process. We comprehensively evaluated our system over OAEI’s dataset of fourteen real world ontologies from diverse domains, having different sizes and complexities. We have executed twenty different matching tasks over parallelism-enabled desktop and Microsoft Azure public cloud platform. In a single-node desktop environment, our system provides an impressive performance speedup of 4.1, 5.0, and 4.9 times for medium, large, and very large-scale ontologies. In a single-node cloud environment, our system provides an impressive performance speedup of 5.9, 7.4, and 7.0 times for medium, large, and very large-scale ontologies. In a multi-node (3 nodes) environment, our system provides an impressive performance speedup of 15.16 and 21.51 times over desktop and cloud platforms respectively.
History
Publication title
Applied IntelligenceVolume
43Pagination
356-385ISSN
0924-669XDepartment/School
School of Information and Communication TechnologyPublisher
Kluwer Academic PublPlace of publication
Van Godewijckstraat 30, Dordrecht, Netherlands, 3311 GzRights statement
Copyright 2015 Springer Science+Business Media New YorkRepository Status
- Restricted