User-centric Cross-network Social Multimedia Computing

Size: px
Start display at page:

Download "User-centric Cross-network Social Multimedia Computing"

Transcription

1 Part III User-centric Cross-network Social Multimedia Computing Jitao Sang Multimedia Computing Group National Lab of Pattern Recognition, Institute of Automation Chinese Academy of Sciences

2 Big Data & Multimedia Big Data:any collection of data sets so large and complex that is difficult to process using traditional techniques. --- Wikipedia Enterprise & government data According to IDC, in 5 years, the data storage will reach 18EB (10 18 ), in fields of telecommunication, financial services, health care, public safety, transportation, education, etc. Internet data BAT(Baidu Alibaba Tencent)possess data in the scale of 10EB(10 18 ),and increase at a speed of PB per day. Personal data EMC2 estimated that an individual contributes to average 45 GB personal data (public service, credit record, video surveilliance, social media data, etc.) 2

3 Big Data & Multimedia Big Data:any collection of data sets so large and complex that is difficult to process using traditional techniques. --- Wikipedia 45% else 60% else else 70% Enterprise & government data Internet data Personal data Multimedia 3

4 Big Data & Social Multimedia Social Multimedia has significant big data 4V characteristics: YouTube: #[videos] > 2 billion; Facebook: #[pics] > 300 billion huge volume Volume Variety sources formats source:desktop/mobile, official/individual; format:traditional photo/video/audio, new media-pic tweet/audio pic/geo-tagged media YouTube: uploading 72 hour video per min. Skype: up to 1.4 million mins chat per min exponential growth Velocity Veracity low value density format: 1 hour video with few semantics; generation:open environment -> low quality, duplicate data; demands:personalized 4

5 Big Data & Social Multimedia Social Multimedia has significant big data characteristics: capacity in data storage YouTube: #[videos] > 2 billion; Facebook: #[pics] > 300 billion huge volume Volume Variety sources formats source:desktop/mobile, official/individual; format:traditional photo/video/audio, analysis new media-pic tweet/audio pic/geo-tagged media complexity in data efficiency in data capture & computing YouTube: uploading 72 hour video per min. Skype: up to 1.4 million mins chat per min exponential growth Velocity Veracity low value density format: 1 hour video with few semantics; data accuracy and quality generation:open environment -> low quality, duplicate data; demands:personalized 5

6 Variety in Social Multimedia Multiple Modalities received extensive attentions in the small data era Multiple Sources

7 Variety in Social Multimedia beyond multiple modalities: the heterogeneous data created same modality, different information. and consumed in various social media networks upload favorite playlist rate share comment +1 Content + Context. Multiple Sources

8 Variety in Social Multimedia beyond multiple modalities the heterogeneous data created and consumed in various social media networks Multiple Sources

9 Multisource in Social Multimedia Macro-level analysis: Characteristics of different social media networks. - degree distribution, clustering coefficiency [Ahn et al. 2007], - degree centrality, shortest path [Magnani and Rossi, 2011]; User activity patterns in macro-level. - user tagging patterns [Guo et al. 2009]; - user participation motivations [Choudhury and Sundaram, 2011]. Diffusion dynamics between social media networks. - cite and influence correlation [Leskovec et al. 2007]; - diffusion and evolution patterns [Rodriguez et al. 2013]; - jointly analyze network characteristics, user activity patterns, and diffusion dynamics [Kim et al. 2014]

10 Multisource in Social Multimedia Micro-level analysis and applications: Concept: different perspectives for the same concept/event, e.g., the distribution and evolution of social events among Twitter, Facebook, etc. Jasmine Revolution User: different domains involved by the same individual, e.g., unique user registers and participates into several social media websites.

11 User-centric Solution Heterogeneous data among different social media networks share the unique user space:

12 Cross-network User Account Collection Identical user account among different social media services. Google account Tencent account Users are voluntary to discover their accounts in multiple networks. User account linkage mining is a separated research topic.

13 User-centric Cross-network Dataset Identify cross-network user accounts for unique individual API API API API API API Use corresponding APIs to collect user data Cross-network Dataset

14 User-centric Cross-network Dataset ,000 registered users in About.me. # User Over 50% users share at least 4 accounts.

15 User-centric Cross-network Dataset

16 User-centric Cross-network Social Multimedia Computing User-centric Cross-network Social Multimedia Computing From Users: On Users: For Users: Cross-network Knowledge Association Mining Mining the correlation based on overlapped users perceptions. Cross-network User Modeling Integrating heterogeneous user data for comprehensive user understanding. Cross-network Collaborated Multimedia Applications Exploring user-centric crossnetwork characteristics to design collaborated solutions. 16

17 User-centric Cross-network Social Multimedia Computing User-centric Cross-network Social Multimedia Computing From Users: On Users: For Users: Cross-network Knowledge v Association Mining Mining the correlation based on overlapped users perceptions. Cross-network User Modeling Integrating heterogeneous user data for comprehensive user understanding. Cross-network Collaborated Multimedia Applications Exploring user-centric crossnetwork characteristics to design collaborated solutions. Ming Yan, Jitao Sang, and Changsheng Xu. Mining Cross-network Association for YouTube Video Promotion. ACM Multimedia,

18 Background: Heterogeneous Knowledge Association Heterogeneous Knowledge Association Cross-network Application UGC behavior, e.g., tweeting history Consuming pattern Targeted advertising Video browsing behavior Following social network Cross-network video promotion 18

19 Challenge: Cross-network Knowledge Gap No explicit association exists between different social media networks. The association is not necessarily semantic-based. Traditional semantic-based solution cannot address all scenarios. A data-driven cross-network association mining solution is needed. 19

20 Motivation: Overlapping User Collaboration Assumption: If abundant users heavily involve with pattern in social media network and pattern in network, it is very likely that pattern and pattern are closely associated. pattern pattern pattern pattern We refer to this associated pattern pairs as crowd-perceptive correlated.

21 Cross-network Knowledge Association Mining YouTube video Twitter followee Social Media Network A Topic Discovery Topics A Overlapping User-based Topic Association Cross-network Application Social Media Network B Topic Discovery Topics B 21

22 Twitter user friend network ( ) Following ACM Multimedia Bill Britney LDA Association Mining ( ) YouTube videos ( ) icorr-lda Aggregation ( ) Twitter user tweet stream Tweets Nic@nicing-Aug 28 #VoteObama, presidential debate Nic@nicing-Aug 28 I like LDA ( ) Association Mining Heterogeneous Topic Modeling Cross-network Topic Association 22

23 Cross-network Topic Association Mining

24 Cross-network Topic Association Mining 1 Transition Probabilitybased Association 2 Regression-based Association 3 Latent Attribute-based Association

25 Cross-network Topic Association Mining 1 Transition Probabilitybased Association = over all the overlapped users = ( ) ( Twitter Distribution Transfer Matrix YouTube Distribution the same overlapped user

26 Cross-network Topic Association Mining 1 Noisy user topic Transition Probabilitybased distributions will deteriorate the derived Association association. 2 Regression-based Association 3 Latent Attribute-based Association

27 Cross-network Topic Association Mining 2 Regression-based Association 1 norm: Lasso problem 2 norm: ridge regression problem min Overlapped user Twitter distribution 2 Overlapped user YouTube distribution 1 or 2

28 Cross-network Topic Association Mining 1 Noisy user topic Transition Probabilitybased distributions will deteriorate the derived Association association. 2 (1) Non-linear association Regression-based is not allowed. (2) Non-overlapped Associationusers are not exploited. 3 Latent Attribute-based Association

29 Cross-network Topic Association Mining 3 Latent Attribute-based Association shared representation coefficients : YouTube distribution Twitter distribution Age gender education location occupation Latent user attributes

30 Cross-network Knowledge Association Mining 3 Latent Attribute-based Association Not only coupled to unique user attributes over the overlapped users, but minimizing the reconstruction error over all the non-overlapped users. min,,,.. 1, 1,, : base vector in latent attribute space; : shared latent user attribute representation. + =, =,, ; =,, =,. =min = 30

31 Experiments: Cross-network Topic Association Quantitatively calculate Mean Absolute Error (MAE) over half of the overlapped users. prediction error over all topics in Twitter topic space = 31

32 Experiments: Association Mining between Twitter Tweet & YouTube Video digital devices US presidential election 32

33 Experiments: Association Mining between Twitter Tweet & YouTube Video horse riding social media marketing beer US presidential debate 33

34 Experiments: Association Mining between Twitter Network & YouTube Video game video semantic game- Visualization of discovered Twitter topics related correlated Berlin popular followees geographical correlated German TV show 34

35 Experiments: Association Mining between Twitter Network & YouTube Video famous actor war & political Australian official account cute animal 35

36 User-centric Cross-network Social Multimedia Computing User-centric Cross-network Social Multimedia Computing From Users: On Users: For Users: Cross-network Knowledge Association Mining Mining the correlation based on overlapped users perceptions. Cross-network User Modeling Integrating heterogeneous user data for comprehensive user understanding. Cross-network Collaborated Multimedia Applications Exploring user-centric crossnetwork characteristics to design collaborated solutions. Zhengyu Deng, Jitao Sang, and Changsheng Xu. Cross-network User Modeling with Local Social Regularization. Submitted for publication. 36

37 Background: User Data are Heterogeneous Heterogeneity is beyond modalities. tagged photo audio photo image tweet geo-tagged video 37

38 Background: User Data are Heterogeneous Heterogeneity is obvious within the same modality. upload favorite playlist rate share comment +1 Complex social interactions aggravate the heterogeneity. 38

39 Motivation: Integrating Heterogeneous Data How to unify different behaviors? Cross-network user behavior quantification upload favorite playlist rate share comment How to integrate social relation with behaviors? Collaborative filtering with local social regularization Latent user space 39

40 Cross-network user behavior quantification upload share playlist comment favorite rate +1 40

41 Cross-network user behavior quantification user favorite matrix upload matrix 1 Users Users 1 Users playlist matrix Users share matrix 1 1 Users Users 1 1 comment matrix Users 1 video 1 rate matrix Users 1 1 Users +1 matrix 1 Users

42 Cross-network user behavior quantification Multiple-kernel learning: Users share matrix 1 Users Users 1 1 favorite matrix Users 1 +1 matrix 1 Users

43 Cross-network user behavior quantification upload matrix 1 Users 1 share matrix User s 1 1 fused matrix favorite matrix Users 1 rate matrix playlist matrix Users Users 1 1 User s 1 +1 matrix 1 User s

44 Collaborative filtering with local social regularization A B C?? 44

45 Collaborative filtering with local social regularization?? 45

46 Collaborative filtering with local social regularization video space latent user space Social interaction on Google+ Derived user latent representation 46

47 Experiments: Evaluation on Video Recommendation +1 47

48 User-centric Cross-network Social Multimedia Computing User-centric Cross-network Social Multimedia Computing From Users: On Users: For Users: Cross-network Knowledge Association Mining Mining the correlation based on overlapped users perceptions. Cross-network User Modeling Integrating heterogeneous user data for comprehensive user understanding. Cross-network Collaborated Multimedia Applications Exploring user-centric crossnetwork characteristics to design collaborated solutions. Zhengyu Deng, Ming Yan, Jitao Sang, Changsheng Xu. Twitter is Faster: Personalized Time-aware Video Recommendation from Twitter to YouTube, TOMCCAP,

49 Challenge: Sparsity in Personalization Long-term interest Application: Real-time Personalized video search /recommendation Key problem: Dynamic Interest Modeling Challenge: Sparse user data in single network,difficult to capture the interest drifting Short-term interest Stable and on general topics, e.g., sports, politics Evolve with time, vulnerable to transient events, e.g., focuses on FIFA World Cup around July,

50 Motivation: Twitter is Faster Twitter has been recognized as an efficient platform for information sharing and spread. Virginia earthquake tweets heat map (08/23/2011) 50

51 Motivation: Twitter is Faster Twitter is faster than many social media services Twitter is faster than Wikipedia. Twitter is faster than Digg. Will this conclusion apply to micro-level? Is the time interval unique for different topics? 51

52 Data Analysis: Statistics The examined 22 trending events. The involved user number for each event. 52

53 Data Analysis: Cross-network Temporal User Behavior Analysis Twitter responses faster than YouTube in macro level Twitter YouTube #Users Wild-Card Round Division Round AFC and NFC championship Super bowl XLVII Elapsed days from Jan.1 st,

54 Data Analysis: Cross-network Temporal User Behavior Analysis Twitter responses faster than YouTube in individual level 54

55 Data Analysis: Cross-network Temporal User Behavior Analysis The cross-network temporal dynamic characteristic is topic-sensitive 55

56 Cross-network Collaborated Video Recommendation Data analysis conclusion: for specific user, his/her short-term interest change emerges first on Twitter Basic idea: exploit the Twitter behavior towards short-term interest modeling User tweets YouTube profile Short-term user modeling Long-term user modeling 56

57 Cross-network Collaborated Video Recommendation Dataset: evaluate on 10 of the 22 trending events. Ground-truth: user s favorite videos on YouTube. Baselines: only considering user interested topics on Twitter, or profiles on YouTube. 57

58 User-centric Cross-network Social Multimedia Computing Cross-network Knowledge Association Cross-network User Modeling Cross-network Collaborated Multimedia Applications Mining the correlation based on overlapped users perceptions. MM 2014 TMM under review Integrating heterogeneous user data for comprehensive user understanding. ICME 2013 TMM under review Exploring user-centric crossnetwork characteristics to design collaborated solutions. ICME 2013 ICIAP 2013, TOMCCAP TKDE under review

Functionality, Challenges and Architecture of Social Networks

Functionality, Challenges and Architecture of Social Networks Functionality, Challenges and Architecture of Social Networks INF 5370 Outline Social Network Services Functionality Business Model Current Architecture and Scalability Challenges Conclusion 1 Social Network

More information

Learning Graph-based POI Embedding for Location-based Recommendation

Learning Graph-based POI Embedding for Location-based Recommendation Learning Graph-based POI Embedding for Location-based Recommendation Min Xie, Hongzhi Yin, Hao Wang, Fanjiang Xu, Weitong Chen, Sen Wang Institute of Software, Chinese Academy of Sciences, China The University

More information

Search Computing: Business Areas, Research and Socio-Economic Challenges

Search Computing: Business Areas, Research and Socio-Economic Challenges Search Computing: Business Areas, Research and Socio-Economic Challenges Yiannis Kompatsiaris, Spiros Nikolopoulos, CERTH--ITI NEM SUMMIT Torino-Italy, 28th September 2011 Media Search Cluster Search Computing

More information

HYDRA Large-scale Social Identity Linkage via Heterogeneous Behavior Modeling

HYDRA Large-scale Social Identity Linkage via Heterogeneous Behavior Modeling HYDRA Large-scale Social Identity Linkage via Heterogeneous Behavior Modeling Siyuan Liu Carnegie Mellon. University Siyuan Liu, Shuhui Wang, Feida Zhu, Jinbo Zhang, Ramayya Krishnan. HYDRA: Large-scale

More information

Big Data - Some Words BIG DATA 8/31/2017. Introduction

Big Data - Some Words BIG DATA 8/31/2017. Introduction BIG DATA Introduction Big Data - Some Words Connectivity Social Medias Share information Interactivity People Business Data Data mining Text mining Business Intelligence 1 What is Big Data Big Data means

More information

3 Data, Data Mining. Chengkai Li

3 Data, Data Mining. Chengkai Li CSE4334/5334 Data Mining 3 Data, Data Mining Chengkai Li Department of Computer Science and Engineering University of Texas at Arlington Fall 2018 (Slides partly courtesy of Pang-Ning Tan, Michael Steinbach

More information

Extracting Information from Social Networks

Extracting Information from Social Networks Extracting Information from Social Networks Reminder: Social networks Catch-all term for social networking sites Facebook microblogging sites Twitter blog sites (for some purposes) 1 2 Ways we can use

More information

A study of Video Response Spam Detection on YouTube

A study of Video Response Spam Detection on YouTube A study of Video Response Spam Detection on YouTube Suman 1 and Vipin Arora 2 1 Research Scholar, Department of CSE, BITS, Bhiwani, Haryana (India) 2 Asst. Prof., Department of CSE, BITS, Bhiwani, Haryana

More information

Social Network Mining An Introduction

Social Network Mining An Introduction Social Network Mining An Introduction Jiawei Zhang Assistant Professor Florida State University Big Data A Questionnaire Please raise your hands, if you (1) use Facebook (2) use Instagram (3) use Snapchat

More information

TempWeb rd Temporal Web Analytics Workshop

TempWeb rd Temporal Web Analytics Workshop TempWeb 2013 3 rd Temporal Web Analytics Workshop Stuff happens continuously: exploring Web contents with temporal information Omar Alonso Microsoft 13 May 2013 Disclaimer The views, opinions, positions,

More information

Version 11

Version 11 The Big Challenges Networked and Electronic Media European Technology Platform The birth of a new sector www.nem-initiative.org Version 11 1. NEM IN THE WORLD The main objective of the Networked and Electronic

More information

Chapter 1. Social Media and Social Computing. October 2012 Youn-Hee Han

Chapter 1. Social Media and Social Computing. October 2012 Youn-Hee Han Chapter 1. Social Media and Social Computing October 2012 Youn-Hee Han http://link.koreatech.ac.kr 1.1 Social Media A rapid development and change of the Web and the Internet Participatory web application

More information

Based on Big Data: Hype or Hallelujah? by Elena Baralis

Based on Big Data: Hype or Hallelujah? by Elena Baralis Based on Big Data: Hype or Hallelujah? by Elena Baralis http://dbdmg.polito.it/wordpress/wp-content/uploads/2010/12/bigdata_2015_2x.pdf 1 3 February 2010 Google detected flu outbreak two weeks ahead of

More information

Cloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018

Cloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018 Cloud Computing 2 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning

More information

Overview of Web Mining Techniques and its Application towards Web

Overview of Web Mining Techniques and its Application towards Web Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous

More information

Name, affiliation, contact info, photo, some general interest info.

Name, affiliation, contact info, photo, some general interest info. Name, affiliation, contact info, photo, some general interest info. Name: Thomas Plagemann Affiliation: University of Oslo Contact info: plageman@ifi.uio.no General interest: Multimedia systems and applications

More information

Nowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype?

Nowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype? Big data hype? Big Data: Hype or Hallelujah? Data Base and Data Mining Group of 2 Google Flu trends On the Internet February 2010 detected flu outbreak two weeks ahead of CDC data Nowcasting http://www.internetlivestats.com/

More information

Web Mining TEAM 8. Professor Anita Wasilewska CSE 634 Data Mining

Web Mining TEAM 8. Professor Anita Wasilewska CSE 634 Data Mining Web Mining TEAM 8 Paper - You Are What You Tweet : Analyzing Twitter for Public Health Authors : Paul, Michael J., and Mark Dredze. Conference : AAAI Publications, Fifth International AAAI Conference on

More information

Supervised Random Walks

Supervised Random Walks Supervised Random Walks Pawan Goyal CSE, IITKGP September 8, 2014 Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 1 / 17 Correlation Discovery by random walk Problem definition Estimate

More information

Database and Knowledge-Base Systems: Data Mining. Martin Ester

Database and Knowledge-Base Systems: Data Mining. Martin Ester Database and Knowledge-Base Systems: Data Mining Martin Ester Simon Fraser University School of Computing Science Graduate Course Spring 2006 CMPT 843, SFU, Martin Ester, 1-06 1 Introduction [Fayyad, Piatetsky-Shapiro

More information

Social media as a data source for research

Social media as a data source for research Social media as a data source for research Neli Blagus, Slavko Žitnik and Marko Bajec University of Ljubljana Faculty for computer and information science 17 April 2018 Neli Blagus et al. (FRI) Social

More information

Chapter 1. Introduction. 1.1 Content Quality - Motivation

Chapter 1. Introduction. 1.1 Content Quality - Motivation 2 Chapter 1 Introduction The meteoric rise of the Internet has led to an increase of Web 2.0 websites on the World Wide Web (WWW). Web 2.0 websites are a paradigm shift from the traditional websites and

More information

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Empowering People with Knowledge the Next Frontier for Web Search Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Important Trends for Web Search Organizing all information Addressing user

More information

REVIEW ON BIG DATA ANALYTICS AND HADOOP FRAMEWORK

REVIEW ON BIG DATA ANALYTICS AND HADOOP FRAMEWORK REVIEW ON BIG DATA ANALYTICS AND HADOOP FRAMEWORK 1 Dr.R.Kousalya, 2 T.Sindhupriya 1 Research Supervisor, Professor & Head, Department of Computer Applications, Dr.N.G.P Arts and Science College, Coimbatore

More information

Telling Experts from Spammers Expertise Ranking in Folksonomies

Telling Experts from Spammers Expertise Ranking in Folksonomies 32 nd Annual ACM SIGIR 09 Boston, USA, Jul 19-23 2009 Telling Experts from Spammers Expertise Ranking in Folksonomies Michael G. Noll (Albert) Ching-Man Au Yeung Christoph Meinel Nicholas Gibbins Nigel

More information

The Ultimate YouTube SEO Guide: Tips & Tricks on How to Increase Views and Rankings for your Online Videos

The Ultimate YouTube SEO Guide: Tips & Tricks on How to Increase Views and Rankings for your Online Videos The Ultimate YouTube SEO Guide: Tips & Tricks on How to Increase Views and Rankings for your Online Videos The Ultimate App Store Optimization Guide Summary 1. Introduction 2. Choose the right video topic

More information

Recommender Systems: Practical Aspects, Case Studies. Radek Pelánek

Recommender Systems: Practical Aspects, Case Studies. Radek Pelánek Recommender Systems: Practical Aspects, Case Studies Radek Pelánek 2017 This Lecture practical aspects : attacks, context, shared accounts,... case studies, illustrations of application illustration of

More information

Link Prediction for Social Network

Link Prediction for Social Network Link Prediction for Social Network Ning Lin Computer Science and Engineering University of California, San Diego Email: nil016@eng.ucsd.edu Abstract Friendship recommendation has become an important issue

More information

SOCIAL MEDIA. Charles Murphy

SOCIAL MEDIA. Charles Murphy SOCIAL MEDIA Charles Murphy Social Media Overview 1. Introduction 2. Social Media Areas Blogging Bookmarking Deals Location-based Music Photo sharing Video 3. The Fab Four FaceBook Google+ Linked In Twitter

More information

A data-driven framework for archiving and exploring social media data

A data-driven framework for archiving and exploring social media data A data-driven framework for archiving and exploring social media data Qunying Huang and Chen Xu Yongqi An, 20599957 Oct 18, 2016 Introduction Social media applications are widely deployed in various platforms

More information

G(B)enchmark GraphBench: Towards a Universal Graph Benchmark. Khaled Ammar M. Tamer Özsu

G(B)enchmark GraphBench: Towards a Universal Graph Benchmark. Khaled Ammar M. Tamer Özsu G(B)enchmark GraphBench: Towards a Universal Graph Benchmark Khaled Ammar M. Tamer Özsu Bioinformatics Software Engineering Social Network Gene Co-expression Protein Structure Program Flow Big Graphs o

More information

SCALABLE KNOWLEDGE BASED AGGREGATION OF COLLECTIVE BEHAVIOR

SCALABLE KNOWLEDGE BASED AGGREGATION OF COLLECTIVE BEHAVIOR SCALABLE KNOWLEDGE BASED AGGREGATION OF COLLECTIVE BEHAVIOR P.SHENBAGAVALLI M.E., Research Scholar, Assistant professor/cse MPNMJ Engineering college Sspshenba2@gmail.com J.SARAVANAKUMAR B.Tech(IT)., PG

More information

Predict Topic Trend in Blogosphere

Predict Topic Trend in Blogosphere Predict Topic Trend in Blogosphere Jack Guo 05596882 jackguo@stanford.edu Abstract Graphical relationship among web pages has been used to rank their relative importance. In this paper, we introduce a

More information

THE AUSTRALIAN ONLINE LANDSCAPE REVIEW AUGUST 2015

THE AUSTRALIAN ONLINE LANDSCAPE REVIEW AUGUST 2015 THE AUSTRALIAN ONLINE LANDSCAPE REVIEW AUGUST 2015 STATE OF THE ONLINE LANDSCAPE August 2015 Welcome to the August 2015 edition of Nielsen s Online Landscape Review. The online landscape in August saw

More information

Analysis of Tweets: Donald Trump and sexual harassment allegations

Analysis of Tweets: Donald Trump and sexual harassment allegations Analysis of Tweets: Donald Trump and sexual harassment allegations MAIN CONCEPT Twitter historical database was used to search for suitable tweets. Search term of Trump and harassment was consequently

More information

Entity Information Management in Complex Networks

Entity Information Management in Complex Networks Entity Information Management in Complex Networks Yi Fang Department of Computer Science 250 N. University Street Purdue University, West Lafayette, IN 47906, USA fangy@cs.purdue.edu ABSTRACT Entity information

More information

Introduction to Data Mining and Data Analytics

Introduction to Data Mining and Data Analytics 1/28/2016 MIST.7060 Data Analytics 1 Introduction to Data Mining and Data Analytics What Are Data Mining and Data Analytics? Data mining is the process of discovering hidden patterns in data, where Patterns

More information

Extracting Information from Complex Networks

Extracting Information from Complex Networks Extracting Information from Complex Networks 1 Complex Networks Networks that arise from modeling complex systems: relationships Social networks Biological networks Distinguish from random networks uniform

More information

Available online at ScienceDirect. Procedia Technology 17 (2014 )

Available online at  ScienceDirect. Procedia Technology 17 (2014 ) Available online at www.sciencedirect.com ScienceDirect Procedia Technology 17 (2014 ) 528 533 Conference on Electronics, Telecommunications and Computers CETC 2013 Social Network and Device Aware Personalized

More information

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai.

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai. UNIT-V WEB MINING 1 Mining the World-Wide Web 2 What is Web Mining? Discovering useful information from the World-Wide Web and its usage patterns. 3 Web search engines Index-based: search the Web, index

More information

Crazy YouTube Stats. Seminar Topics. sales. According to Nielsen, YouTube reaches more US adults. YouTube is available on 350 million devices

Crazy YouTube Stats. Seminar Topics. sales. According to Nielsen, YouTube reaches more US adults. YouTube is available on 350 million devices Tom Gardocki Interstate Landscape Co., Inc Dirt Ninja on YouTube, Facebook and Instagram Seminar Topics 1. The power of YouTube 2. Gear to make videos 3. Video editing software 4. Time lapse videos 5.

More information

Dynamic Embeddings for User Profiling in Twitter

Dynamic Embeddings for User Profiling in Twitter Dynamic Embeddings for User Profiling in Twitter Shangsong Liang 1, Xiangliang Zhang 1, Zhaochun Ren 2, Evangelos Kanoulas 3 1 KAUST, Saudi Arabia 2 JD.com, China 3 University of Amsterdam, The Netherlands

More information

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Glossary A B C D E F G H I J K L M N O P Q R S T U V W X Y Z A App See Application Application An application (sometimes known as an app ) is a computer program which allows the user to perform a specific

More information

Sampling Large Graphs for Anticipatory Analysis

Sampling Large Graphs for Anticipatory Analysis Sampling Large Graphs for Anticipatory Analysis Lauren Edwards*, Luke Johnson, Maja Milosavljevic, Vijay Gadepally, Benjamin A. Miller IEEE High Performance Extreme Computing Conference September 16, 2015

More information

Sentiment analysis under temporal shift

Sentiment analysis under temporal shift Sentiment analysis under temporal shift Jan Lukes and Anders Søgaard Dpt. of Computer Science University of Copenhagen Copenhagen, Denmark smx262@alumni.ku.dk Abstract Sentiment analysis models often rely

More information

Network performance through the eyes of customers

Network performance through the eyes of customers ericsson.com/ mobility-report Network performance through the eyes of customers Extract from the Ericsson Mobility Report June 2018 2 Articles Network performance through the eyes of customers Understanding

More information

DATA MINING II - 1DL460

DATA MINING II - 1DL460 DATA MINING II - 1DL460 Spring 2016 A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt16 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,

More information

Sukjun Lim Strategic Planning, User interaction, Design research specialist

Sukjun Lim Strategic Planning, User interaction, Design research specialist Sukjun Lim Strategic Planning, User interaction, Design research specialist LINE MUSIC Strategic Planning and Interaction Design Convergence LINE MUSIC is an on-line music streaming service. Japan has

More information

Communications Made Easy. YouTube (Part 1)

Communications Made Easy. YouTube (Part 1) PKIDs Communications Made Easy YouTube (Part 1) *6 to mute, *7 to unmute Use Raise Hand feature to ask the presenter questions Use chat box for technical assistance For the audio portion of this call,

More information

BIG DATA SCIENTIST Certification. Big Data Scientist

BIG DATA SCIENTIST Certification. Big Data Scientist BIG DATA SCIENTIST Certification Big Data Scientist Big Data Science Professional (BDSCP) certifications are formal accreditations that prove proficiency in specific areas of Big Data. To obtain a certification,

More information

Implementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky

Implementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky Implementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky The Chinese University of Hong Kong Abstract Husky is a distributed computing system, achieving outstanding

More information

#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks

#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks 1 #mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks Bang Hui Lim, Dongyuan Lu Tao Chen, Min-yen Kan Web Information Retrieval / Natural Language Processing Group 1. http://www.statista.com/statistics/272014/global-social-networks-ranked-by-number-of-users/

More information

Prof. Dr. Christian Bizer

Prof. Dr. Christian Bizer STI Summit July 6 th, 2011, Riga, Latvia Global Data Integration and Global Data Mining Prof. Dr. Christian Bizer Freie Universität ität Berlin Germany Outline 1. Topology of the Web of Data What data

More information

Jianyong Wang Department of Computer Science and Technology Tsinghua University

Jianyong Wang Department of Computer Science and Technology Tsinghua University Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity

More information

You are Who You Know and How You Behave: Attribute Inference Attacks via Users Social Friends and Behaviors

You are Who You Know and How You Behave: Attribute Inference Attacks via Users Social Friends and Behaviors You are Who You Know and How You Behave: Attribute Inference Attacks via Users Social Friends and Behaviors Neil Zhenqiang Gong Iowa State University Bin Liu Rutgers University 25 th USENIX Security Symposium,

More information

ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 4. Prof. James She

ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 4. Prof. James She ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 4 Prof. James She james.she@ust.hk 1 Selected Works of Activity 4 2 Selected Works of Activity 4 3 Last lecture 4 Mid-term

More information

DS504/CS586: Big Data Analytics Data Pre-processing and Cleaning Prof. Yanhua Li

DS504/CS586: Big Data Analytics Data Pre-processing and Cleaning Prof. Yanhua Li Welcome to DS504/CS586: Big Data Analytics Data Pre-processing and Cleaning Prof. Yanhua Li Time: 6:00pm 8:50pm R Location: AK 232 Fall 2016 The Data Equation Oceans of Data Ocean Biodiversity Informatics,

More information

Recommendation System for Location-based Social Network CS224W Project Report

Recommendation System for Location-based Social Network CS224W Project Report Recommendation System for Location-based Social Network CS224W Project Report Group 42, Yiying Cheng, Yangru Fang, Yongqing Yuan 1 Introduction With the rapid development of mobile devices and wireless

More information

CPSC 340: Machine Learning and Data Mining. Multi-Dimensional Scaling Fall 2017

CPSC 340: Machine Learning and Data Mining. Multi-Dimensional Scaling Fall 2017 CPSC 340: Machine Learning and Data Mining Multi-Dimensional Scaling Fall 2017 Assignment 4: Admin 1 late day for tonight, 2 late days for Wednesday. Assignment 5: Due Monday of next week. Final: Details

More information

CLOUD REPORT LITTLE CHANGE IN GDPR-READINESS LEVELS WITH MAY 2018 DEADLINE LOOMING. 24.6% of cloud services rated high on GDPR-readiness

CLOUD REPORT LITTLE CHANGE IN GDPR-READINESS LEVELS WITH MAY 2018 DEADLINE LOOMING. 24.6% of cloud services rated high on GDPR-readiness SEPTEMBER 2017 CLOUD REPORT LITTLE CHANGE IN GDPR-READINESS LEVELS WITH MAY 2018 DEADLINE LOOMING 24.6% of cloud services rated high on GDPR-readiness REPORT HIGHLIGHTS 24.6 percent of cloud services are

More information

Website Privacy Policy

Website Privacy Policy Website Privacy Policy Last updated: May 12, 2016 This privacy policy (the Privacy Policy ) applies to this website and all services provided through this website, including any games or sweepstakes (collectively,

More information

Internet Applications. Q. What is Internet Explorer? Explain features of Internet Explorer.

Internet Applications. Q. What is Internet Explorer? Explain features of Internet Explorer. Internet Applications Q. What is Internet Explorer? Explain features of Internet Explorer. Internet explorer: Microsoft Internet Explorer is a computer program called a browser that helps you interact

More information

Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting

Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting Yaguang Li Joint work with Rose Yu, Cyrus Shahabi, Yan Liu Page 1 Introduction Traffic congesting is wasteful of time,

More information

Youtube Graph Network Model and Analysis Yonghyun Ro, Han Lee, Dennis Won

Youtube Graph Network Model and Analysis Yonghyun Ro, Han Lee, Dennis Won Youtube Graph Network Model and Analysis Yonghyun Ro, Han Lee, Dennis Won Introduction A countless number of contents gets posted on the YouTube everyday. YouTube keeps its competitiveness by maximizing

More information

Survey on Recommendation of Personalized Travel Sequence

Survey on Recommendation of Personalized Travel Sequence Survey on Recommendation of Personalized Travel Sequence Mayuri D. Aswale 1, Dr. S. C. Dharmadhikari 2 ME Student, Department of Information Technology, PICT, Pune, India 1 Head of Department, Department

More information

Syed Ismail Shah, PhD Chairman, PTA,

Syed Ismail Shah, PhD Chairman, PTA, Preparing for the Changing Rules of the Game Syed Ismail Shah, PhD Chairman, PTA, e-mail: ismail@pta.gov.pk Sequence Presentation Outline 2 Paradigm Shift in Telecom Sector-OTT Services Difference Between

More information

Who is behind terrorist events?

Who is behind terrorist events? Detecting and analysing terrorist-related online contents and financing activities Who is behind terrorist events? Deep analysis and a new model of knowledge representation for supporting Intelligence

More information

Demystifying movie ratings 224W Project Report. Amritha Raghunath Vignesh Ganapathi Subramanian

Demystifying movie ratings 224W Project Report. Amritha Raghunath Vignesh Ganapathi Subramanian Demystifying movie ratings 224W Project Report Amritha Raghunath (amrithar@stanford.edu) Vignesh Ganapathi Subramanian (vigansub@stanford.edu) 9 December, 2014 Introduction The past decade or so has seen

More information

Module 1: Internet Basics for Web Development (II)

Module 1: Internet Basics for Web Development (II) INTERNET & WEB APPLICATION DEVELOPMENT SWE 444 Fall Semester 2008-2009 (081) Module 1: Internet Basics for Web Development (II) Dr. El-Sayed El-Alfy Computer Science Department King Fahd University of

More information

D B M G Data Base and Data Mining Group of Politecnico di Torino

D B M G Data Base and Data Mining Group of Politecnico di Torino DataBase and Data Mining Group of Data mining fundamentals Data Base and Data Mining Group of Data analysis Most companies own huge databases containing operational data textual documents experiment results

More information

On User-centric QoE Prediction for VoIP & Video Streaming based on Machine-Learning

On User-centric QoE Prediction for VoIP & Video Streaming based on Machine-Learning UNIVERSITY OF CRETE On User-centric QoE Prediction for VoIP & Video Streaming based on Machine-Learning Michalis Katsarakis, Maria Plakia, Paul Charonyktakis & Maria Papadopouli University of Crete Foundation

More information

BOOK RECOMMENDATION INTERNET BROWSER SYSTEM (BRIB)

BOOK RECOMMENDATION INTERNET BROWSER SYSTEM (BRIB) BOOK RECOMMENDATION INTERNET BROWSER SYSTEM (BRIB) Rosnin Mustaffa (rosnin.mustaffa@mimos.my), Goon Wooi Kin (wk.goon@mimos.my), Muhammad Awis Jamaluddin Johari (awis.johari@mimos.my), Amru Yusrin Amruddin

More information

Mining Web Data. Lijun Zhang

Mining Web Data. Lijun Zhang Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems

More information

University of Saskatchewan

University of Saskatchewan Julita Vassileva University of Saskatchewan The Participative Web (Web 2.0) Technologies Applications Social issues Cultural Heritage Physical museums Virtual museums Future Museums = Encyclopaedias? Participative

More information

Link Analysis in Weibo

Link Analysis in Weibo Link Analysis in Weibo Liwen Sun AMPLab, EECS liwen@cs.berkeley.edu Di Wang Theory Group, EECS wangd@eecs.berkeley.edu Abstract With the widespread use of social network applications, online user behaviors,

More information

INTERNET OF THINGS CAPACITY BUILDING CHALLENGES OF BIG DATA AND PLANNED SOLUTIONS BY ITU. ICTP Workshop 17 March 2016

INTERNET OF THINGS CAPACITY BUILDING CHALLENGES OF BIG DATA AND PLANNED SOLUTIONS BY ITU. ICTP Workshop 17 March 2016 INTERNET OF THINGS CAPACITY BUILDING CHALLENGES OF BIG DATA AND PLANNED SOLUTIONS BY ITU ICTP Workshop 17 March 2016 Halima N Letamo Training and Development Officer International Telecommunication Union

More information

Review on Techniques of Collaborative Tagging

Review on Techniques of Collaborative Tagging Review on Techniques of Collaborative Tagging Ms. Benazeer S. Inamdar 1, Mrs. Gyankamal J. Chhajed 2 1 Student, M. E. Computer Engineering, VPCOE Baramati, Savitribai Phule Pune University, India benazeer.inamdar@gmail.com

More information

Charting the Progress of Smart City Development in Shanghai

Charting the Progress of Smart City Development in Shanghai Charting the Progress of Smart City Development in Shanghai Xueguo Wen Executive Vice President of Shanghai Academy 2017 TM Forum 1 C ONTENTS Current situation Experience and outlook Strategic cooperation

More information

COLLABORATIVE LOCATION AND ACTIVITY RECOMMENDATIONS WITH GPS HISTORY DATA

COLLABORATIVE LOCATION AND ACTIVITY RECOMMENDATIONS WITH GPS HISTORY DATA COLLABORATIVE LOCATION AND ACTIVITY RECOMMENDATIONS WITH GPS HISTORY DATA Vincent W. Zheng, Yu Zheng, Xing Xie, Qiang Yang Hong Kong University of Science and Technology Microsoft Research Asia WWW 2010

More information

Music Recommendation with Implicit Feedback and Side Information

Music Recommendation with Implicit Feedback and Side Information Music Recommendation with Implicit Feedback and Side Information Shengbo Guo Yahoo! Labs shengbo@yahoo-inc.com Behrouz Behmardi Criteo b.behmardi@criteo.com Gary Chen Vobile gary.chen@vobileinc.com Abstract

More information

Latent Space Model for Road Networks to Predict Time-Varying Traffic. Presented by: Rob Fitzgerald Spring 2017

Latent Space Model for Road Networks to Predict Time-Varying Traffic. Presented by: Rob Fitzgerald Spring 2017 Latent Space Model for Road Networks to Predict Time-Varying Traffic Presented by: Rob Fitzgerald Spring 2017 Definition of Latent https://en.oxforddictionaries.com/definition/latent Latent Space Model?

More information

Introduction to Data Science Day 2

Introduction to Data Science Day 2 Introduction to Data Science Day 2 Data Matters Summer workshop series in data science Sponsored by the Odum Institute, RENCI, and NCDS Thomas M. Carsey carsey@unc.edu Examples of Data Science Google Flu

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu 3/6/2012 Jure Leskovec, Stanford CS246: Mining Massive Datasets, http://cs246.stanford.edu 2 In many data mining

More information

Mobile Payments. Moving at the Speed of Innovation. Eric Kuhn, Enterprise Digital Product Development April 2017

Mobile Payments. Moving at the Speed of Innovation. Eric Kuhn, Enterprise Digital Product Development April 2017 Mobile Payments Moving at the Speed of Innovation Eric Kuhn, Enterprise Digital Product Development April 2017 Agenda Mobile Payments Landscape Why Your Digital Products are Essential for Growth How to

More information

Mining Web Data. Lijun Zhang

Mining Web Data. Lijun Zhang Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems

More information

Accessing Web Archives

Accessing Web Archives Accessing Web Archives Web Science Course 2017 Helge Holzmann 05/16/2017 Helge Holzmann (holzmann@l3s.de) Not today s topic http://blog.archive.org/2016/09/19/the-internet-archive-turns-20/ 05/16/2017

More information

FACTS & FIGURES FEBRUARY 2014

FACTS & FIGURES FEBRUARY 2014 FEBRUARY 2014 These figures will be updated regularly. - Layar B.V. FEBRUARY 2014 WHAT THE MARKET SAYS ABOUT AUGMENTED REALITY 1. INTRODUCTION At Layar we measure everything that we do and our customers

More information

LECTURE 12. Web-Technology

LECTURE 12. Web-Technology LECTURE 12 Web-Technology Household issues Course evaluation on Caracal o https://caracal.uu.nl o Between 3-4-2018 and 29-4-2018 Assignment 3 deadline extension No lecture/practice on Friday 30/03/18 2

More information

UNCLASSIFIED. R-1 ITEM NOMENCLATURE PE D8Z: Data to Decisions Advanced Technology FY 2012 OCO

UNCLASSIFIED. R-1 ITEM NOMENCLATURE PE D8Z: Data to Decisions Advanced Technology FY 2012 OCO Exhibit R-2, RDT&E Budget Item Justification: PB 2012 Office of Secretary Of Defense DATE: February 2011 BA 3: Advanced Development (ATD) COST ($ in Millions) FY 2010 FY 2011 Base OCO Total FY 2013 FY

More information

Big Data Analytics: What is Big Data? Stony Brook University CSE545, Fall 2016 the inaugural edition

Big Data Analytics: What is Big Data? Stony Brook University CSE545, Fall 2016 the inaugural edition Big Data Analytics: What is Big Data? Stony Brook University CSE545, Fall 2016 the inaugural edition What s the BIG deal?! 2011 2011 2008 2010 2012 What s the BIG deal?! (Gartner Hype Cycle) What s the

More information

CS224W Project Write-up Static Crawling on Social Graph Chantat Eksombatchai Norases Vesdapunt Phumchanit Watanaprakornkul

CS224W Project Write-up Static Crawling on Social Graph Chantat Eksombatchai Norases Vesdapunt Phumchanit Watanaprakornkul 1 CS224W Project Write-up Static Crawling on Social Graph Chantat Eksombatchai Norases Vesdapunt Phumchanit Watanaprakornkul Introduction Our problem is crawling a static social graph (snapshot). Given

More information

Highlights from the TV & video. consumer. trend report 2011

Highlights from the TV & video. consumer. trend report 2011 Highlights from the TV & video consumer trend report 2011 About this report TV has been an integral part of people s lives since the 1940s, providing consumers with news, information and entertainment.

More information

Web Science Your Business, too!

Web Science Your Business, too! Web Science & Technologies University of Koblenz Landau, Germany Web Science Your Business, too! Agenda What is Web Science? An explanation by analogy What do we do about it? Understanding collective effects

More information

WEBSITE & SOCIAL MEDIA PRIVACY POLICY

WEBSITE & SOCIAL MEDIA PRIVACY POLICY WEBSITE & SOCIAL MEDIA PRIVACY POLICY This website is the property of Girls on the Run of Northern Virginia ( Girls on the Run ), an Independent Council of Girls on the Run International, Inc. of Charlotte,

More information

IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 17, NO. 7, JULY Relational User Attribute Inference in Social Media

IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 17, NO. 7, JULY Relational User Attribute Inference in Social Media IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 17, NO. 7, JULY 2015 1031 Relational User Attribute Inference in Social Media Quan Fang, Jitao Sang, Changsheng Xu, Fellow, IEEE, and M. Shamim Hossain, Senior Member,

More information

A Study of the Correlation between the Spatial Attributes on Twitter

A Study of the Correlation between the Spatial Attributes on Twitter A Study of the Correlation between the Spatial Attributes on Twitter Bumsuk Lee, Byung-Yeon Hwang Dept. of Computer Science and Engineering, The Catholic University of Korea 3 Jibong-ro, Wonmi-gu, Bucheon-si,

More information

Web Service Recommendation Using Hybrid Approach

Web Service Recommendation Using Hybrid Approach e-issn 2455 1392 Volume 2 Issue 5, May 2016 pp. 648 653 Scientific Journal Impact Factor : 3.468 http://www.ijcter.com Web Service Using Hybrid Approach Priyanshi Barod 1, M.S.Bhamare 2, Ruhi Patankar

More information

ALEXANDRIA. Temporal Retrieval, Exploration and Analytics in Web Archives. Wolfgang Nejdl. L3S Research Center Hannover, Germany

ALEXANDRIA. Temporal Retrieval, Exploration and Analytics in Web Archives. Wolfgang Nejdl. L3S Research Center Hannover, Germany ALEXANDRIA Temporal Retrieval, Exploration and Analytics in Web Archives Wolfgang Nejdl L3S Research Center Hannover, Germany Looking back: The Austrian Socialist Party and Europe What is missing? ALEXANDRIA

More information

Introduction to the Mathematics of Big Data. Philippe B. Laval

Introduction to the Mathematics of Big Data. Philippe B. Laval Introduction to the Mathematics of Big Data Philippe B. Laval Fall 2017 Introduction In recent years, Big Data has become more than just a buzz word. Every major field of science, engineering, business,

More information

Analysing Search Trends

Analysing Search Trends Data Mining in Business Intelligence 7 March 2013, Ben-Gurion University Analysing Search Trends Yair Shimshoni, Google R&D center, Tel-Aviv. shimsh@google.com Outline What are search trends? The Google

More information