Top Banner
Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Computer Network Information Center, Chinese Academ Chinese Academ y of Sciences y of Sciences
12

Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

Dec 14, 2015

Download

Documents

Moses Byrd
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

Discovering the Fake Followers in the Micro-blogging via Machine Learning

Yi Shen Jianjun Yu

October 16, 2013

Computer Network Information Center, Chinese Academy of SciencesChinese Academy of Sciences

Page 2: Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

2

Micro-blogging

Page 3: Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

3

The Celebrities in Twitter

Page 4: Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

Purchasing Fake Followers

Page 5: Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

Fake followers Markets

Market Link Price For 1K Followers

intertwitter.com $9

solarank.com $6.95

purchase-twitter-followers.net $7.5

fakefollowerstwitter.com $20

Page 6: Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

http://www.socialsellingu.com/fake-twitter-profiles-infographic/

39% of @facebook followers are fake 34% of @ladygaga followers are fake 31% of @justinbieber followers are fake 32% of @katyperry followers are fake 32% of @espn followers are fake33% of @britneyspears followers are fake 27% of @youtube followers are fake 

A Data Report

Page 7: Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

Troubles Caused by Fake Followers

Noise for Social Network analysis Privacy and Security Problem Spam Problem

Page 8: Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

Method of Detection

Binary Classification Problem Extract discriminative features Voting-SVM as the classifier

Page 9: Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

How to get ground-truth data?

Purchase from different merchants Keep tracking them for a long period

Page 10: Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

The Features for Classification

The Ratio of Followee Count and Follower Count (RFF)The Percentage of Bidirectional Friends (PBF)Average Repost Frequency of the Posts (ARF)Ratio of the Original Posts (ROP)Proportion of Nighttime Posts (PNP)Topic DiversityThe standard deviation of post-count(σpost).The general slope of post-count(gpost).The standard deviation of followee-count(σ followee)The decrease frequency of followee-count(DFfollowee).The standard deviation of follower-count (σ follower).

Page 11: Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

Result

Accuracy Precision Recall F1

98.1% 97.7% 96.6% 0.964

Page 12: Discovering the Fake Followers in the Micro-blogging via Machine Learning Yi Shen Jianjun Yu October 16, 2013 Chinese Academy of Sciences Computer Network.

Thank you!