eCite Digital Repository

Performance-based ontology matching: a data-parallel approach for an effectiveness-independent performance-gain in ontology matching

Citation

Amin, MB and Khan, WA and Lee, S and Kang, BH, Performance-based ontology matching: a data-parallel approach for an effectiveness-independent performance-gain in ontology matching, Applied Intelligence, 43, (2) pp. 356-385. ISSN 0924-669X (2015) [Refereed Article]

Copyright Statement

Copyright 2015 Springer Science+Business Media New York

DOI: doi:10.1007/s10489-015-0648-z

Abstract

Ontology matching is among the core techniques used for heterogeneity resolution by information and knowledge-based systems. However, due to the excess and ever-evolving nature of data, ontologies are becoming large-scale and complex; consequently, leading to performance bottlenecks during ontology matching. In this paper, we present our performance-based ontology matching system. Today’s desktop and cloud platforms are equipped with parallelism-enabled multicore processors. Our system benefits from this opportunity and provides effectiveness-independent data parallel ontology matching resolution over parallelism-enabled platforms. Our system decomposes complex ontologies into smaller, simpler, and scalable subsets depending upon the needs of matching algorithms. Matching process over these subsets is divided from granular to finer-level abstraction of independent matching requests, matching jobs, and matching tasks, running in parallel over parallelism-enabled platforms. Execution of matching algorithms is aligned for the minimization of the matching space during the matching process. We comprehensively evaluated our system over OAEI’s dataset of fourteen real world ontologies from diverse domains, having different sizes and complexities. We have executed twenty different matching tasks over parallelism-enabled desktop and Microsoft Azure public cloud platform. In a single-node desktop environment, our system provides an impressive performance speedup of 4.1, 5.0, and 4.9 times for medium, large, and very large-scale ontologies. In a single-node cloud environment, our system provides an impressive performance speedup of 5.9, 7.4, and 7.0 times for medium, large, and very large-scale ontologies. In a multi-node (3 nodes) environment, our system provides an impressive performance speedup of 15.16 and 21.51 times over desktop and cloud platforms respectively.

Item Details

Item Type:Refereed Article
Keywords:ontology matching, heterogeneity resolution, multithreading, parallel processing, parallel programming, semantic web
Research Division:Information and Computing Sciences
Research Group:Computer Software
Research Field:Computer System Architecture
Objective Division:Information and Communication Services
Objective Group:Computer Software and Services
Objective Field:Application Tools and System Utilities
Author:Kang, BH (Professor Byeong Kang)
ID Code:106685
Year Published:2015
Web of Science® Times Cited:2
Deposited By:Computing and Information Systems
Deposited On:2016-02-17
Last Modified:2017-11-13
Downloads:0

Repository Staff Only: item control page