eCite Digital Repository

An automated model to score the privacy of unstructured information - social media case


Aghasian, E and Garg, S and Montgomery, J, An automated model to score the privacy of unstructured information - social media case, Computers and Security, 92 Article 101778. ISSN 0167-4048 (2020) [Refereed Article]

Copyright Statement

© 2020 Elsevier Ltd. All rights reserved.

DOI: doi:10.1016/j.cose.2020.101778


One of the common forms of data which is shared by online social media users is free-text formats including comments, posts, blogs and tweets. While users mostly share this unstructured data with their preferred social groups, this textual data may contain sensitive information such as their political or religious views, job details, their opinions and emotions and so on. Hence, sharing this unstructured data can escalate privacy risks and concerns for social media users. Analyses the privacy of unstructured data occurred from textual information comes with difficulties as understanding the calculation metrics are challenging. Although there are various studies on privacy evaluation from the extracted structured information from unstructured data, there are limited privacy scoring methods concentrating on the views of the individuals and cannot satisfy the privacy scoring of shared unstructured data in social networks appropriately. Here, in this paper, we propose an automated fuzzy-based model that can extract the privacy-related features as well as the related shared structured data and measure and warn users regarding the textual data privacy risks they have shared in online social platforms. The proposed model can facilitate mitigation actions for users’ free-format texts shared in various social networks. The evaluation of the study indicates that the proposed model can measure the users’ privacy risk in a more accurate manner compared with previously proposed methods and available commercialised software in the domain.

Item Details

Item Type:Refereed Article
Keywords:privacy, social networks, unstructured data, data privacy score, sentiment analysis, machine learning
Research Division:Information and Computing Sciences
Research Group:Computer vision and multimedia computation
Research Field:Pattern recognition
Objective Division:Expanding Knowledge
Objective Group:Expanding knowledge
Objective Field:Expanding knowledge in the information and computing sciences
UTAS Author:Aghasian, E (Mr Erfan Aghasian)
UTAS Author:Garg, S (Dr Saurabh Garg)
UTAS Author:Montgomery, J (Dr James Montgomery)
ID Code:137804
Year Published:2020
Web of Science® Times Cited:8
Deposited By:Information and Communication Technology
Deposited On:2020-03-04
Last Modified:2020-04-17

Repository Staff Only: item control page