Understanding the Correlations between Social Attention and Topic Trends of Scientific Publications

来源 :Journal of Data and Information Science | 被引量 : 0次 | 上传用户:csrsyz
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Purpose:We propose and apply a simplified nowcasting model to understand the correlations between social attention and topic trends of scientific publications.Design/methodology/approach:First,topics are generated from the obesity corpus by using the latent Dirichlet allocation(LDA) algorithm and time series of keyword search trends in Google Trends are obtained.We then establish the structural time series model using data from January 2004 to December 2012,and evaluate the model using data from January 2013.We employ a state-space model to separate different non-regression components in an observational time series(i.e.the tendency and the seasonality) and apply the “spike and slab prior” and stepwise regression to analyze the correlations between the regression component and the social media attention.The two parts are combined using Markov-chain Monte Carlo sampling techniques to obtain our results.Findings:The results of our study show that(1) the number of publications on child obesity increases at a lower rate than that of diabetes publications;(2) the number of publication on a given topic may exhibit a relationship with the season or time of year;and(3) there exists a correlation between the number of publications on a given topic and its social media attention,i.e.the search frequency related to that topic as identified by Google Trends.We found that our model is also able to predict the number of publications related to a given topic.Research limitations:First,we study a correlation rather than causality between topics’ trends and social media.As a result,the relationships might not be robust,so we cannotpredict the future in the long run.Second,we cannot identify the reasons or conditions that are driving obesity topics to present such tendencies and seasonal patterns,so we might need to do “field” study in the future.Third,we need to improve the efficiency of our model by finding more efficient variable selection models,because the stepwise regression method is time consuming,especially for a large number of variables.Practical implications:This paper analyzes publication topic trends from three perspectives:tendency,seasonality,and correlation with social media attention,providing a new perspective for identifying and understanding topical themes in academic publications.Originality/value:To the best of our knowledge,we are the first to apply the state-space model to examine the relationships between healthcare-related publications and social media to investigate the relationships between a topic’s evolvement and people’s search behavior in social media.This paper thus provides a new viewpoint in the correlation analysis area,and demonstrates the value of considering social media attention in the analysis of publication topic trends. Purpose: We propose and apply a simplified nowcasting model to understand the correlations between social attention and topic trends of scientific publications. Design / methodology / appachach: First, topics are generated from the obesity corpus by using the latent Dirichlet allocation (LDA) algorithm and time series of keyword search trends in Google Trends are obtained. We then establish the structural time series model using data from January 2004 to December 2012, and evaluate the model using data from January 2013. We employ a state-space model to separate different non -regression components in an observational time series (iethe tendency and the seasonality) and apply the “spike and slab prior” and stepwise regression to analyze the correlations between the regression component and the social media attention. two parts are combined using Markov-chain Monte Carlo sampling techniques to obtain our results. Findings: The results of our study show that (1) the number of publications on child obesity increases at a lower rate than that of diabetes publications; (2) the number of publication on a given topic may exhibit a relationship with the season or time of year; and (3) there exists a correlation between the number of publications on a given topic and its social media attention, iethe search frequency related to that topic identified by Google Trends.We found that our model is also able to predict the number of publications related to a given topic. Research limitations: First, we study a correlation rather than causality between topics’ trends and social media. As a result, the relationships might not be robust, so we cannotpredict the future in the long run. Second, we can not identify the reasons or conditions that are driving obesity topics to present such we need to improve the efficiency of our model by finding more efficient variable selection models, because the stepwise regression method is time consuming, especially for a large number of variables. Practical implications: This paper publication publication topic trends from three perspectives: tendency, seasonality, and correlation with social media attention, providing a new perspective for identifying and understanding topical themes in academic publications.Originality / value: To the best of our knowledge, we are the first to apply the state-space model to examine the relationships between healthcare-related publications and social media to investigate the relationships between a topic’s evolvement and people’s search behavior in social media.This paper provides a new viewpoint in the correlation analysis area, and demonstrates the value of considering social media attention in the analysis of publication topic trends.
其他文献
盈江:全面提升福利主任业务素质为进一步开展好盈江项目区儿童福利示范工作,2013年2月1日,盈江项目区组织10个项目村儿童福利主任在民政局会议室召开了为期一天的儿童福利主
实行经济结构的战略性调整是“十五”计划的主线,是加快经济发展的关键。我区“十五”时期国民经济结构调整要狠抓四个突破点:一、放大亮点,发展壮大非公有经济。在市场经济
考察了铜-稀土氧化物催化剂在催化燃烧工业装置中使用时的耐用性。使用两年的数据表明,催化剂性能稳定,对于含甲苯、二甲苯的废气净化率为90~100%。用BET方法、扫描电镜、X-射
一九八二年三月十一日至十八日在成都召开了全省地震工作计划会议。参加会议的有各地震中心站、队、厂,各地、市、州地震局(办),机关各处、室和局分析预报研究中心等单位、
柱内效应和柱外效应是影响高效液相色谱柱效的主要因素。我们以提高柱效为目的,根椐martin等人提出的高效短柱色谱法和Galay提出的减小柱外效应理论,对日立635-m型高效液相
提起立陶宛的竞技体育,你会很快想到他们的篮球,因为他们是欧洲甚至是世界上的篮球强国。但说到足球,你对立陶宛又了解多少呢?一直以来,立陶宛足球就被人们定义为欧洲三流,但真的是这样吗?看看2010年世界杯欧洲区的预选赛战况你会发现,立陶宛两战全胜,进5球,不失一球,位列G组第一。而他战胜的球队是“黄玫瑰”克罗地亚和首轮3比1战胜法国的奥地利队。
无论从哪个角度看,安庆都只是一个小城。但所有爱好与熟悉黄梅戏的戏迷,对安庆都不免会产生一种异样的情怀。安庆和黄梅戏之间的关系是那样密切,以至于你想到安庆,耳边就不由
中国国家统计局副局长邱晓华最近指出,中国城乡居民收入差距大大高于账面上的三比一。这个差距应该为五比一,甚至达到六比一。邱晓华解释说,去年中国城市居民收入为6860元人
在2008北京奥运会上,浙江奥运健儿共获得了两金、四银、一铜的好成绩。无论是赛场上、奥运村中,还是街头景区,到处都有浙江人的身影—— In 2008 Beijing Olympic Games, Zh
期刊
本文究了干法灰化样品,以直流粉末法对茶叶等植物体中稀土元素进行光谱测定。工作中以无机盐制配“人工模拟基体”,用其配制标准试样,对茶叶及铁花所作的分析其精密度与准确