教育资源为主的文档平台

当前位置: 查字典文档网> 所有文档分类> 研究报告> 信息产业> The Pulse of News in Social Media Forecasting Popularity

The Pulse of News in Social Media Forecasting Popularity

上传者:何存富
|
上传时间:2015-05-05
|
次下载

The Pulse of News in Social Media Forecasting Popularity



The Pulse of News in Social Media: Forecasting Popularity
Roja Bandari
?
Sitaram Asur
?
Bernardo Huberman
?
Abstract
News articles are extremely time sensitive by nature. There
is also intense competition among news items to propagate
as widely as possible. Hence, the task of predicting the pop-
ularity of news items on the social web is both interesting
and challenging. Prior research has dealt with predicting
eventual online popularity based on early popularity. It is
most desirable, however, to predict the popularity of items
prior to their release, fostering the possibility of appropriate
decision making to modify an article and the manner of its
publication. In this paper, we construct a multi-dimensional
feature space derived from properties of an article and eval-
uate the e?cacy of these features to serve as predictors of
online popularity. We examine both regression and classi?-
cation algorithms and demonstrate that despite randomness
in human behavior, it is possible to predict ranges of pop-
ularity on twitter with an overall 84% accuracy. Our study
also serves to illustrate the di?erences between traditionally
prominent sources and those immensely popular on the so-
cial web.
1 Introduction
News articles are very dynamic due to their relation to
continuously developing events that typically have short
lifespans. For a news article to be popular, it is essential
for it to propagate to a large number of readers within
a short time. Hence there exists a competition among
di?erent sources to generate content which is relevant
to a large subset of the population and becomes virally
popular.
Traditionally, news reporting and broadcasting has
been costly, which meant that large news agencies dom-
inated the competition. But the ease and low cost of on-
line content creation and sharing has recently changed
the traditional rules of competition for public attention.
News sources now concentrate a large portion of their
attention on online mediums where they can dissemi-
nate their news e?ectively and to a large population. It
is therefore common for almost all major news sources to
have active accounts in social media services like Twitter
to take advantage of the enormous reach these services
?
UCLA.
?
HP Labs.
?
HP Labs.
provide.
Due to the time-sensitive aspect and the intense
competition for attention, accurately estimating the
extent to which a news article will spread on the web
is extremely valuable to journalists, content providers,
advertisers, and news recommendation systems. This
is also important for activists and politicians who are
using the web increasingly more to in?uence public
opinion.
However, predicting online popularity of news arti-
cles is a challenging task. First, context outside the web
is often not readily accessible and elements such as local
and geographical conditions and various circumstances
that a?ect the population make this prediction di?cult.
Furthermore, network properties such as the structure
of social networks that are propagating the news, in?u-
ence variations among members, and interplay between
di?erent sections of the web add other layers of com-
plexity to this problem. Most signi?cantly, intuition
suggests that the content of an article must play a cru-
cial role in its popularity. Content that resonates with
a majority of the readers such as a major world-wide
event can be expected to garner wide attention while
speci?c content relevant only to a few may not be as
successful.
Given the complexity of the problem due to the
above mentioned factors, a growing number of recent
studies [1], [2], [3], [4], [5] make use of early measure-
ments of an item’s popularity to predict its future suc-
cess. In the present work we investigate a more di?cult
problem, which is prediction of social popularity with-
out using early popularity measurements, by instead
solely considering features of a news article prior to its
publication. We focus this work on observable features
in the content of an article as well as its source of publi-
cation. Our goal is to discover if any predictors relevant
only to the content exist and if it is possible to make a
reasonable forecast of the spread of an article based on
content featu res.
The news data for our study was collected from
Feedzilla
1
–a news feed aggregator– and measurements
of the spread are performed on Twitter
2
, an immensely
1
http://wendang.chazidian.com
2
http://wendang.chazidian.com

版权声明:此文档由查字典文档网用户提供,如用于商业用途请与作者联系,查字典文档网保持最终解释权!

下载文档

热门试卷

2016年四川省内江市中考化学试卷
广西钦州市高新区2017届高三11月月考政治试卷
浙江省湖州市2016-2017学年高一上学期期中考试政治试卷
浙江省湖州市2016-2017学年高二上学期期中考试政治试卷
辽宁省铁岭市协作体2017届高三上学期第三次联考政治试卷
广西钦州市钦州港区2016-2017学年高二11月月考政治试卷
广西钦州市钦州港区2017届高三11月月考政治试卷
广西钦州市钦州港区2016-2017学年高一11月月考政治试卷
广西钦州市高新区2016-2017学年高二11月月考政治试卷
广西钦州市高新区2016-2017学年高一11月月考政治试卷
山东省滨州市三校2017届第一学期阶段测试初三英语试题
四川省成都七中2017届高三一诊模拟考试文科综合试卷
2017届普通高等学校招生全国统一考试模拟试题(附答案)
重庆市永川中学高2017级上期12月月考语文试题
江西宜春三中2017届高三第一学期第二次月考文科综合试题
内蒙古赤峰二中2017届高三上学期第三次月考英语试题
2017年六年级(上)数学期末考试卷
2017人教版小学英语三年级上期末笔试题
江苏省常州西藏民族中学2016-2017学年九年级思想品德第一学期第二次阶段测试试卷
重庆市九龙坡区七校2016-2017学年上期八年级素质测查(二)语文学科试题卷
江苏省无锡市钱桥中学2016年12月八年级语文阶段性测试卷
江苏省无锡市钱桥中学2016-2017学年七年级英语12月阶段检测试卷
山东省邹城市第八中学2016-2017学年八年级12月物理第4章试题(无答案)
【人教版】河北省2015-2016学年度九年级上期末语文试题卷(附答案)
四川省简阳市阳安中学2016年12月高二月考英语试卷
四川省成都龙泉中学高三上学期2016年12月月考试题文科综合能力测试
安徽省滁州中学2016—2017学年度第一学期12月月考​高三英语试卷
山东省武城县第二中学2016.12高一年级上学期第二次月考历史试题(必修一第四、五单元)
福建省四地六校联考2016-2017学年上学期第三次月考高三化学试卷
甘肃省武威第二十三中学2016—2017学年度八年级第一学期12月月考生物试卷

网友关注

国家公务员考试行测题库:行测判断推理模拟题13
2019国考暑期行测题库:行测每日一练常识判断练习题07.27
2019国考暑期行测题库:行测每日一练判断推理练习题07.12
国家公务员考试行测题库:行测判断推理模拟题0719
2019国考暑期行测题库:行测每日一练判断推理练习题07.05
2019国考暑期行测题库:行测每日一练资料分析练习题08.02
2019国考暑期行测题库:行测每日一练数量关系练习题08.13
2019国家公务员行测模拟题:假言命题
国家公务员考试行测题库:行测判断推理模拟题10
国家公务员考试行测类比推理模拟题0717
2019国家公务员考试申论模拟题:中国建筑
2019国考暑期行测题库:行测每日一练言语理解练习题08.06
2019国家公务员考试行测题库:行测判断推理模拟题0705
2019国考暑期行测题库:行测每日一练常识判断练习题07.09
国家公务员考试行测题库:行测判断推理模拟题0712
2019国考暑期行测题库:行测每日一练判断推理练习题07.25
2019国考暑期申论每周一练模拟练习题:创新探索经验交流
国家公务员考试申论模拟题:大雪封堵作为交警,你如何处理?
2019国考暑期行测题库:行测每日一练判断推理练习题08.01
2019国考暑期行测题库:行测每日一练资料分析练习题07.06
国家公务员考试申论模拟题:单独两孩
2019国考暑期申论每周一练模拟练习题:数字经济
2019国考暑期行测题库:行测每日一练判断推理练习题08.07
国家公务员考试申论模拟题:应该如何提高网速
国家公务员考试行测题库:行测判断推理模拟题0723
2019国考暑期行测题库:行测每日一练言语理解练习题07.17
2019国家公务员申论模拟题:“老漂族”的烦恼
国家公务员考试行测题库:行测资料分析模拟题02
2018国家公务员考试申论真题答案解析
国家公务员考试行测题库:行测判断推理模拟题0731

网友关注视频

【部编】人教版语文七年级下册《逢入京使》优质课教学视频+PPT课件+教案,安徽省
冀教版小学数学二年级下册1
河南省名校课堂七年级下册英语第一课(2020年2月10日)
沪教版牛津小学英语(深圳用) 六年级下册 Unit 7
外研版英语七年级下册module1unit3名词性物主代词讲解
二次函数求实际问题中的最值_第一课时(特等奖)(冀教版九年级下册)_T144339
沪教版八年级下册数学练习册20.4(2)一次函数的应用2P8
外研版英语三起6年级下册(14版)Module3 Unit2
8.练习八_第一课时(特等奖)(苏教版三年级上册)_T142692
沪教版牛津小学英语(深圳用) 五年级下册 Unit 7
【获奖】科粤版初三九年级化学下册第七章7.3浓稀的表示
苏科版数学七年级下册7.2《探索平行线的性质》
8.对剪花样_第一课时(二等奖)(冀美版二年级上册)_T515402
外研版八年级英语下学期 Module3
沪教版牛津小学英语(深圳用) 四年级下册 Unit 7
冀教版英语四年级下册第二课
沪教版牛津小学英语(深圳用) 四年级下册 Unit 12
冀教版小学英语四年级下册Lesson2授课视频
沪教版牛津小学英语(深圳用) 四年级下册 Unit 2
【部编】人教版语文七年级下册《逢入京使》优质课教学视频+PPT课件+教案,辽宁省
七年级下册外研版英语M8U2reading
沪教版牛津小学英语(深圳用) 五年级下册 Unit 10
苏科版八年级数学下册7.2《统计图的选用》
人教版二年级下册数学
8 随形想象_第一课时(二等奖)(沪教版二年级上册)_T3786594
青岛版教材五年级下册第四单元(走进军营——方向与位置)用数对确定位置(一等奖)
冀教版小学数学二年级下册第二周第2课时《我们的测量》宝丰街小学庞志荣
【部编】人教版语文七年级下册《老山界》优质课教学视频+PPT课件+教案,安徽省
化学九年级下册全册同步 人教版 第22集 酸和碱的中和反应(一)
外研版英语三起5年级下册(14版)Module3 Unit2