eCite Digital Repository

Chinese trending search terms popularity rank prediction

Citation

Han, SC and Liang, Y and Chung, H and Kim, H and Kang, BH, Chinese trending search terms popularity rank prediction, Information Technology and Management, 17, (2) pp. 133-139. ISSN 1385-951X (2016) [Refereed Article]

Copyright Statement

Copyright 2015 Springer Science+Business Media New York

DOI: doi:10.1007/s10799-015-0238-0

Abstract

Baidu, the most popular Chinese search engine, monitors what their users are currently searching and provides top 50 search terms, called trending search terms, in descending order of popularity ranking. The paper focused on predicting the popularity ranking trends of this top trending search terms in Baidu. Based on the data analysis, two issues were identified that could affect accuracy of using the ranking data for predicting the popularity of trending searched terms. Firstly, all trending terms are disappeared from the top 50 terms list when the popularity is getting lower. However, there are several trending terms that reappear to the top 50 terms list after they disappeared. New distinct search terms can be differentiated from reappearances of old terms so we proposed the term distinction model by using the related news articles of a trending search term provided by Baidu. Secondly, it is necessary to handle the missing value when the term is out of the trending term list. To achieve the goal of this paper, we collected top 50 trending search terms from Baidu engine and its related news articles hourly for 6 months (from 1st March 2013 to 31th August 2013). Based on the proposed model, we found that the optimal disappearing interval can be 9 h, and using rank 51 for the missing values was the most successful. We conducted evaluations by using 3 months data (from 1st September 2013 to 30th November 2013), and four machine learning techniques where compared to evaluate the most accurate for predicting the popularity rank of trending search terms. Feed Forward Neural Network was achieved 78.81 % the most highest prediction accuracy, and achieved 85.55 % accuracy in 3 error range.

Item Details

Item Type:Refereed Article
Keywords:trending topics, Chinese search trends, Baidu, trending search terms
Research Division:Information and Computing Sciences
Research Group:Information Systems
Research Field:Information Systems not elsewhere classified
Objective Division:Information and Communication Services
Objective Group:Information Services
Objective Field:Information Services not elsewhere classified
Author:Han, SC (Ms Caren Han)
Author:Liang, Y (Mr Yulu Liang)
Author:Chung, H (Mr David Chung)
Author:Kang, BH (Professor Byeong Kang)
ID Code:106775
Year Published:2016
Deposited By:Computing and Information Systems
Deposited On:2016-02-19
Last Modified:2017-11-13
Downloads:0

Repository Staff Only: item control page