eCite Digital Repository

SemanticTwig: a semantic approach to optimize XML query processing


Bao, Z and Ling, TW and Lu, J and Chen, B, SemanticTwig: a semantic approach to optimize XML query processing, Database Systems for Advanced Applications, 19-21 March 2008, New Delhi, India, pp. 282-298. ISBN 978-3-540-78567-5 (2008) [Refereed Conference Paper]

Copyright Statement

Copyright 2008 Springer

DOI: doi:10.1007/978-3-540-78568-2_22


Twig pattern matching (TPM) is the core operation of XML query processing. Existing approaches rely on either efficient data structures or novel labeling/indexing schemes to reduce the intermediate result size, but none of them takes into account the rich semantic information resided in XML document and the query issued. Moreover, in order to fulfill the semantics of the XPath/XQuery query, most of them require costly post processing to eliminate redundant matches and group matching results. In this paper, we propose an innovative semanticsaware query optimization approach to overcome these limitations. In particular, we exploit the functional dependency derived from the given semantic information to stop query processing early; we distinguish the output and predicate nodes of a query, then propose a query breakup technique and build a query plan, such that for each distinct query output, we avoid finding the redundant matches having the same results as the first match in most cases. Both I/O and structural join cost are saved, and much less intermediate results are produced. Experiments show the effectiveness of our optimization.

Item Details

Item Type:Refereed Conference Paper
Keywords:structured query processing, semi-structured data
Research Division:Information and Computing Sciences
Research Group:Data management and data science
Research Field:Data management and data science not elsewhere classified
Objective Division:Information and Communication Services
Objective Group:Information systems, technologies and services
Objective Field:Information systems, technologies and services not elsewhere classified
UTAS Author:Bao, Z (Dr Zhifeng Bao)
ID Code:92215
Year Published:2008
Web of Science® Times Cited:2
Deposited By:Information and Communication Technology
Deposited On:2014-06-10
Last Modified:2015-02-12

Repository Staff Only: item control page