University of Tasmania
Browse

File(s) under permanent embargo

Recommending Environmental Big Data Using Semantically Guided Machine Learning

chapter
posted on 2023-05-24, 04:37 authored by Dutta, R, Morshed, A, Jagannath Aryal
In information technology, Big Data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data-processing applications. The trend to larger data sets is due to the additional information derivable from analysis of a single large set of related data, as compared with separate smaller sets with the same total amount of data. Scientists regularly encounter limitations due to large data sets in many areas, including meteorology, genetics, complex physics simulations, and environmental research. Wireless technology-based automated data gathering from the large environmental sensor networks have increased the quantity of sensor data available for analysis and sensor informatics. Next-generation environmental monitoring, natural resource management, and agricultural decision support systems are becoming heavily dependent on very large scale multiple sensor network deployments, massive-scale accumulation, harmonization, web-based Big Data integration and interpretation of Big Data. With large amount of the data availability, the complexity of data has also increased hence regular maintenance of large-scale sensor are becoming a difficult challenge. Uncertainty factors in the environmental monitoring processes are more evident than before due to current technological transparency achieved by most recent advanced communication technologies. The other challenges include capture, storage, search, sharing, analysis, and visualization. Data availability from a particular environmental sensor web is often very limited and data quality is subsequently very poor. This practical limitation could be due to difficult geographical location of the sensor node or sensor station, extreme environmental conditions, communication network failure, and lastly technical failure of the sensor node. Data uncertainty from a sensor network makes the network unreliable and inefficient. This inefficiency leads to failure of natural resource management systems such as agricultural water resource management, weather forecast, crop management including irrigation scheduling and natural resource-based crop business model systems. The ultimate challenge in environmental forecasting and decision support systems, is to overcome the data uncertainty and make the derived output more accurate. It is evident that there is a need to capture and integrate environmental knowledge from various independent sources including sensor networks, individual sensory system, large-scale environmental simulation models, and historical environmental data for each of the independent sources). It is not good enough to produce efficient decision support system using a single data source. So there is an urgent requirement for on demand complementary knowledge integration where different sources of environmental sensor data could be used to complement each other automatically.

History

Publication title

Large Scale and Big Data: Processing and Management

Editors

S Sakr, MM Gaber

Pagination

463-494

ISBN

978-1-4665-8150-0

Department/School

School of Geography, Planning and Spatial Sciences

Publisher

CRC Press

Place of publication

Florida, USA

Extent

19

Rights statement

Copyright 2014 Taylor & Francis Group, LLC

Repository Status

  • Restricted

Socio-economic Objectives

Expanding knowledge in the environmental sciences

Usage metrics

    University Of Tasmania

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC