File(s) under permanent embargo
Breaking out of the MisMatch trap
conference contribution
posted on 2023-05-23, 08:43 authored by Zeng, Y, Bao, Z, Ling, TW, Jagadish, HV, Li, GWhen users issue a query to a database, they have expectations about the results. If what they search for is unavailable in the database, the system will return an empty result or, worse, erroneous mismatch results.We call this problem the MisMatch Problem. In this paper, we solve the MisMatch problem in the context of XML keyword search. Our solution is based on two novel concepts that we introduce: Target Node Type and Distinguishability. Using these concepts, we develop a low-cost post-processing algorithm on the results of query evaluation to detect the MisMatch problem and generate helpful suggestions to users. Our approach has three noteworthy features: (1) for queries with the MisMatch problem, it generates the explanation, suggested queries and their sample results as the output to users, helping users judge whether the MisMatch problem is solved without reading all query results; (2) it is portable as it can work with any LCA-based matching semantics and is orthogonal to the choice of result retrieval method adopted; (3) it is lightweight in the way that it occupies a very small proportion of the whole query evaluation time. Extensive experiments on three real datasets verify the effectiveness, efficiency and scalability of our approach. A search engine called XClear has been built and is available at http://xclear.comp.nus.edu.sg.
History
Publication title
Proceedings of 30th IEEE International Conference on Data EngineeringEditors
IEEEPagination
1-12Department/School
School of Information and Communication TechnologyPublisher
IEEEPlace of publication
USAEvent title
30th IEEE International Conference on Data EngineeringEvent Venue
Chicago, USADate of Event (Start Date)
2014-03-31Date of Event (End Date)
2014-04-04Rights statement
Copyright 2014 IEEERepository Status
- Restricted