The Pulse of News in Social Media Forecasting Popularity
上传者:何存富|上传时间:2015-05-05|密次下载
The Pulse of News in Social Media Forecasting Popularity
The Pulse of News in Social Media: Forecasting Popularity
Roja Bandari
?
Sitaram Asur
?
Bernardo Huberman
?
Abstract
News articles are extremely time sensitive by nature. There
is also intense competition among news items to propagate
as widely as possible. Hence, the task of predicting the pop-
ularity of news items on the social web is both interesting
and challenging. Prior research has dealt with predicting
eventual online popularity based on early popularity. It is
most desirable, however, to predict the popularity of items
prior to their release, fostering the possibility of appropriate
decision making to modify an article and the manner of its
publication. In this paper, we construct a multi-dimensional
feature space derived from properties of an article and eval-
uate the e?cacy of these features to serve as predictors of
online popularity. We examine both regression and classi?-
cation algorithms and demonstrate that despite randomness
in human behavior, it is possible to predict ranges of pop-
ularity on twitter with an overall 84% accuracy. Our study
also serves to illustrate the di?erences between traditionally
prominent sources and those immensely popular on the so-
cial web.
1 Introduction
News articles are very dynamic due to their relation to
continuously developing events that typically have short
lifespans. For a news article to be popular, it is essential
for it to propagate to a large number of readers within
a short time. Hence there exists a competition among
di?erent sources to generate content which is relevant
to a large subset of the population and becomes virally
popular.
Traditionally, news reporting and broadcasting has
been costly, which meant that large news agencies dom-
inated the competition. But the ease and low cost of on-
line content creation and sharing has recently changed
the traditional rules of competition for public attention.
News sources now concentrate a large portion of their
attention on online mediums where they can dissemi-
nate their news e?ectively and to a large population. It
is therefore common for almost all major news sources to
have active accounts in social media services like Twitter
to take advantage of the enormous reach these services
?
UCLA.
?
HP Labs.
?
HP Labs.
provide.
Due to the time-sensitive aspect and the intense
competition for attention, accurately estimating the
extent to which a news article will spread on the web
is extremely valuable to journalists, content providers,
advertisers, and news recommendation systems. This
is also important for activists and politicians who are
using the web increasingly more to in?uence public
opinion.
However, predicting online popularity of news arti-
cles is a challenging task. First, context outside the web
is often not readily accessible and elements such as local
and geographical conditions and various circumstances
that a?ect the population make this prediction di?cult.
Furthermore, network properties such as the structure
of social networks that are propagating the news, in?u-
ence variations among members, and interplay between
di?erent sections of the web add other layers of com-
plexity to this problem. Most signi?cantly, intuition
suggests that the content of an article must play a cru-
cial role in its popularity. Content that resonates with
a majority of the readers such as a major world-wide
event can be expected to garner wide attention while
speci?c content relevant only to a few may not be as
successful.
Given the complexity of the problem due to the
above mentioned factors, a growing number of recent
studies [1], [2], [3], [4], [5] make use of early measure-
ments of an item’s popularity to predict its future suc-
cess. In the present work we investigate a more di?cult
problem, which is prediction of social popularity with-
out using early popularity measurements, by instead
solely considering features of a news article prior to its
publication. We focus this work on observable features
in the content of an article as well as its source of publi-
cation. Our goal is to discover if any predictors relevant
only to the content exist and if it is possible to make a
reasonable forecast of the spread of an article based on
content featu
res.
The news data for our study was collected from
Feedzilla
1
–a news feed aggregator– and measurements
of the spread are performed on Twitter
2
, an immensely
1
http://wendang.chazidian.com
2
http://wendang.chazidian.com
下载文档
热门试卷
- 2016年四川省内江市中考化学试卷
- 广西钦州市高新区2017届高三11月月考政治试卷
- 浙江省湖州市2016-2017学年高一上学期期中考试政治试卷
- 浙江省湖州市2016-2017学年高二上学期期中考试政治试卷
- 辽宁省铁岭市协作体2017届高三上学期第三次联考政治试卷
- 广西钦州市钦州港区2016-2017学年高二11月月考政治试卷
- 广西钦州市钦州港区2017届高三11月月考政治试卷
- 广西钦州市钦州港区2016-2017学年高一11月月考政治试卷
- 广西钦州市高新区2016-2017学年高二11月月考政治试卷
- 广西钦州市高新区2016-2017学年高一11月月考政治试卷
- 山东省滨州市三校2017届第一学期阶段测试初三英语试题
- 四川省成都七中2017届高三一诊模拟考试文科综合试卷
- 2017届普通高等学校招生全国统一考试模拟试题(附答案)
- 重庆市永川中学高2017级上期12月月考语文试题
- 江西宜春三中2017届高三第一学期第二次月考文科综合试题
- 内蒙古赤峰二中2017届高三上学期第三次月考英语试题
- 2017年六年级(上)数学期末考试卷
- 2017人教版小学英语三年级上期末笔试题
- 江苏省常州西藏民族中学2016-2017学年九年级思想品德第一学期第二次阶段测试试卷
- 重庆市九龙坡区七校2016-2017学年上期八年级素质测查(二)语文学科试题卷
- 江苏省无锡市钱桥中学2016年12月八年级语文阶段性测试卷
- 江苏省无锡市钱桥中学2016-2017学年七年级英语12月阶段检测试卷
- 山东省邹城市第八中学2016-2017学年八年级12月物理第4章试题(无答案)
- 【人教版】河北省2015-2016学年度九年级上期末语文试题卷(附答案)
- 四川省简阳市阳安中学2016年12月高二月考英语试卷
- 四川省成都龙泉中学高三上学期2016年12月月考试题文科综合能力测试
- 安徽省滁州中学2016—2017学年度第一学期12月月考高三英语试卷
- 山东省武城县第二中学2016.12高一年级上学期第二次月考历史试题(必修一第四、五单元)
- 福建省四地六校联考2016-2017学年上学期第三次月考高三化学试卷
- 甘肃省武威第二十三中学2016—2017学年度八年级第一学期12月月考生物试卷
网友关注
- 国家公务员考试行测题库:行测判断推理模拟题13
- 2019国考暑期行测题库:行测每日一练常识判断练习题07.27
- 2019国考暑期行测题库:行测每日一练判断推理练习题07.12
- 国家公务员考试行测题库:行测判断推理模拟题0719
- 2019国考暑期行测题库:行测每日一练判断推理练习题07.05
- 2019国考暑期行测题库:行测每日一练资料分析练习题08.02
- 2019国考暑期行测题库:行测每日一练数量关系练习题08.13
- 2019国家公务员行测模拟题:假言命题
- 国家公务员考试行测题库:行测判断推理模拟题10
- 国家公务员考试行测类比推理模拟题0717
- 2019国家公务员考试申论模拟题:中国建筑
- 2019国考暑期行测题库:行测每日一练言语理解练习题08.06
- 2019国家公务员考试行测题库:行测判断推理模拟题0705
- 2019国考暑期行测题库:行测每日一练常识判断练习题07.09
- 国家公务员考试行测题库:行测判断推理模拟题0712
- 2019国考暑期行测题库:行测每日一练判断推理练习题07.25
- 2019国考暑期申论每周一练模拟练习题:创新探索经验交流
- 国家公务员考试申论模拟题:大雪封堵作为交警,你如何处理?
- 2019国考暑期行测题库:行测每日一练判断推理练习题08.01
- 2019国考暑期行测题库:行测每日一练资料分析练习题07.06
- 国家公务员考试申论模拟题:单独两孩
- 2019国考暑期申论每周一练模拟练习题:数字经济
- 2019国考暑期行测题库:行测每日一练判断推理练习题08.07
- 国家公务员考试申论模拟题:应该如何提高网速
- 国家公务员考试行测题库:行测判断推理模拟题0723
- 2019国考暑期行测题库:行测每日一练言语理解练习题07.17
- 2019国家公务员申论模拟题:“老漂族”的烦恼
- 国家公务员考试行测题库:行测资料分析模拟题02
- 2018国家公务员考试申论真题答案解析
- 国家公务员考试行测题库:行测判断推理模拟题0731
网友关注视频
- 【部编】人教版语文七年级下册《逢入京使》优质课教学视频+PPT课件+教案,安徽省
- 冀教版小学数学二年级下册1
- 河南省名校课堂七年级下册英语第一课(2020年2月10日)
- 沪教版牛津小学英语(深圳用) 六年级下册 Unit 7
- 外研版英语七年级下册module1unit3名词性物主代词讲解
- 二次函数求实际问题中的最值_第一课时(特等奖)(冀教版九年级下册)_T144339
- 沪教版八年级下册数学练习册20.4(2)一次函数的应用2P8
- 外研版英语三起6年级下册(14版)Module3 Unit2
- 8.练习八_第一课时(特等奖)(苏教版三年级上册)_T142692
- 沪教版牛津小学英语(深圳用) 五年级下册 Unit 7
- 【获奖】科粤版初三九年级化学下册第七章7.3浓稀的表示
- 苏科版数学七年级下册7.2《探索平行线的性质》
- 8.对剪花样_第一课时(二等奖)(冀美版二年级上册)_T515402
- 外研版八年级英语下学期 Module3
- 沪教版牛津小学英语(深圳用) 四年级下册 Unit 7
- 冀教版英语四年级下册第二课
- 沪教版牛津小学英语(深圳用) 四年级下册 Unit 12
- 冀教版小学英语四年级下册Lesson2授课视频
- 沪教版牛津小学英语(深圳用) 四年级下册 Unit 2
- 【部编】人教版语文七年级下册《逢入京使》优质课教学视频+PPT课件+教案,辽宁省
- 七年级下册外研版英语M8U2reading
- 沪教版牛津小学英语(深圳用) 五年级下册 Unit 10
- 苏科版八年级数学下册7.2《统计图的选用》
- 人教版二年级下册数学
- 8 随形想象_第一课时(二等奖)(沪教版二年级上册)_T3786594
- 青岛版教材五年级下册第四单元(走进军营——方向与位置)用数对确定位置(一等奖)
- 冀教版小学数学二年级下册第二周第2课时《我们的测量》宝丰街小学庞志荣
- 【部编】人教版语文七年级下册《老山界》优质课教学视频+PPT课件+教案,安徽省
- 化学九年级下册全册同步 人教版 第22集 酸和碱的中和反应(一)
- 外研版英语三起5年级下册(14版)Module3 Unit2
精品推荐
- 2016-2017学年高一语文人教版必修一+模块学业水平检测试题(含答案)
- 广西钦州市高新区2017届高三11月月考政治试卷
- 浙江省湖州市2016-2017学年高一上学期期中考试政治试卷
- 浙江省湖州市2016-2017学年高二上学期期中考试政治试卷
- 辽宁省铁岭市协作体2017届高三上学期第三次联考政治试卷
- 广西钦州市钦州港区2016-2017学年高二11月月考政治试卷
- 广西钦州市钦州港区2017届高三11月月考政治试卷
- 广西钦州市钦州港区2016-2017学年高一11月月考政治试卷
- 广西钦州市高新区2016-2017学年高二11月月考政治试卷
- 广西钦州市高新区2016-2017学年高一11月月考政治试卷
分类导航
- 互联网
- 电脑基础知识
- 计算机软件及应用
- 计算机硬件及网络
- 计算机应用/办公自动化
- .NET
- 数据结构与算法
- Java
- SEO
- C/C++资料
- linux/Unix相关
- 手机开发
- UML理论/建模
- 并行计算/云计算
- 嵌入式开发
- windows相关
- 软件工程
- 管理信息系统
- 开发文档
- 图形图像
- 网络与通信
- 网络信息安全
- 电子支付
- Labview
- matlab
- 网络资源
- Python
- Delphi/Perl
- 评测
- Flash/Flex
- CSS/Script
- 计算机原理
- PHP资料
- 数据挖掘与模式识别
- Web服务
- 数据库
- Visual Basic
- 电子商务
- 服务器
- 搜索引擎优化
- 存储
- 架构
- 行业软件
- 人工智能
- 计算机辅助设计
- 多媒体
- 软件测试
- 计算机硬件与维护
- 网站策划/UE
- 网页设计/UI
- 网吧管理