eCite Digital Repository

Semantics based intelligent search in large digital repositories using Hadoop MapReduce


Idris, M and Hussain, S and Ali, T and Kang, BH and Lee, S, Semantics based intelligent search in large digital repositories using Hadoop MapReduce, Lecture Notes in Computer Science 8867: Proceedings of the 8th International Conference on Ubiquitous Computing and Ambient Intelligence (UCAmI 2014), 2-5 December 2014, Belfast, UK, pp. 292-295. ISSN 0302-9743 (2014) [Refereed Conference Paper]

Copyright Statement

Copyright 2014 Springer International Publishing Switzerland

DOI: doi:10.1007/978-3-319-13102-3_48


Information contained in large digital repositories consisting of billions of documents represented in various formats make it difficult to retrieve the desired information. It is necessary to develop techniques that are accurate and fast enough to retrieve the desired information from hay stack of online digital repositories. On one hand, Keyword based systems and techniques have high recall and performance, however, they have low precision. On the other hand, semantics based systems have high precision and good recall, however, their performance decreases with data growth. Therefore, to improve precision and performance, we propose semantics based searching framework using Hadoop MapReduce to process the data at large scale. We apply semantic techniques to extract required information from digital documents and MapReduce programming model to apply these techniques. Application of semantic techniques using MapReduce distributed model will result in high precision and good performance of user query result.

Item Details

Item Type:Refereed Conference Paper
Research Division:Information and Computing Sciences
Research Group:Information Systems
Research Field:Information Systems not elsewhere classified
Objective Division:Information and Communication Services
Objective Group:Information Services
Objective Field:Information Services not elsewhere classified
Author:Kang, BH (Professor Byeong Kang)
ID Code:98419
Year Published:2014
Deposited By:Information and Communication Technology
Deposited On:2015-02-13
Last Modified:2018-01-16

Repository Staff Only: item control page