Collecting social media data based on open APIs
|
|
- Griffin Miller
- 5 years ago
- Views:
Transcription
1 Collecting social media data based on open APIs Ye Li With Qunyan Zhang, Haixin Ma, Weining Qian, and Aoying Zhou
2 Outline Social Media Data Set Data Feature Data Model Data Collecting and Limitations Motivation Methods Applications on Social Media Data Set
3 Sina Weibo: Largest microblog service in China A Twitter-like service with rapid growth of active users
4 A short review of our work on Social media data collecting and analytics : Birth of Sina Weibo (Chinese Twitter) : Research focused on Sina Weibo : Continuous data crawling with a distributed crawler : Various work on social media data analytics : Microblog Cube: An online collective behavior analytics portal : Work focused on real time data crawling and On-line analysis : RCBA: A system on real-time social collective behavior analytics
5 The features of social media data: Large-scale, Rapid growth, Real-time Event Monitoring Data Analysis Social CRM Emotion Sensing
6 Why collecting social media data? Sensing the world Lots of hot events happen every day Who are talking about? What are they talking about? Why are they talking about them?
7 Data Model: Social Stream Global Stream: 1. High density 2. Rapid growth 3. Large scale User Stream: 1. High quantity 2. Quite large scale
8 Two ways to collect Weibo data Crawl the web page (too hard) Using open APIs
9 How to collect the Weibo data? Using open APIs Weibo provides a lot of APIs to developer: Status (the content of tweets) Comments (the comments of tweets) Users (the profile of Weibo users) Friendships (the relationships between Weibo users) Links: 微博 API
10 How to collect the Weibo data? Using open APIs The process of collect data: 1. Become a developer (or a partner with Sina Weibo) 2. Create your application 3. Get users authorization 4. Collect the data Links: 新手指南
11 The limitation of open APIs Frequency limitation Request n times per hour for an application Request m times per day for an application Request x times per hour for an IP address n, m, x are determined by the quality of application Proportion limit For a specific API, we can t collect the whole data For example, we can only collect the recent 2000 retweets with the repost API
12 What we have done on data collecting? Off-line collecting and analyitcs Distributed crawler Data include: social network, tweets, user profiles Pros and Cons Entire data set Out-of-date Off-line data set
13 Application of the data set One of the largest social media corpus in universities All tweets of 2 million users (before Dec. 2013) Continuously updating Billions of tweets Their profiles and social networks 1 Billion followship relationships 10+TB raw data
14 Application of the data set An online collective behavior analytics portal (193 events from to )
15 Next step we want to: collect the data for real-time analysis We have to deal with the following problems: What type of data we should collect? How to collect the data as much as possible? (API limitations) How to detect the hot event?
16 Framework Real-time Sampler: Collect data with specific strategies Event Monitor Monitor the daily hotspot on internet Data Analyzer Analyze the updated data in real-time Real-time Sampler Event Monitor Database Data Analyzer Online System
17 Real-time data sampler Multi-threads crawler Collect threads: Global Timeline Data Opinion Leader Data Hotspot Data Manage threads: Resource Dispatcher Data Filter Data Monitor Resource Dispatcher Global Timeline Sampler Opinion Leader Sampler Hotspot Sampler Database Data Filter Data Monitor
18 Sample Strategy Adaptive dispatch Time Hotspot Repeat filter Real-time monitoring Monitor new retweets Monitor potential hotspot Resource Dispatcher Global Timeline Sampler Opinion Leader Sampler Hotspot Sampler Database Data Filter Data Monitor
19 According the data feature in real world, we can: Dispatch the resource of data sampler Compare the data feature which we sample with the real data set
20 Data scale and effectiveness Data scale(per day): Tweets (700M+) and Retweets 3+ million tweets and 10+ million retweets 1+ million users Compare with the off-line data set The tweets of the Malaysian airline event
21 Event Monitor and Data Analyzer Event Monitor: Get the hot events information through the Internet Data Analyzer: Analyze the hot events with the event related tweets Event Monitor Input: Web resource Output: Event Information (Title, Introduction, News, Images, Videos )
22 Application of real-time data set RCBA: A system on real-time social collective behavior monitor and analytics
23 Event Time Series
24 User Analytics
25 Location Discussion
26 Word cloud and popular mood
27 Data Report
28 Summary Social media data Data crawling strategies Applications on the data set Off-line analysis On-line analysis
29 Thanks!
A Distributed Multi-facet Search Engine of Microblogs Based on SolrCloud
American Journal of Software Engineering, 2017, Vol. 5, No. 1, 20-26 Available online at http://pubs.sciepub.com/ajse/5/1/3 Science and Education Publishing DOI:10.12691/ajse-5-1-3 A Distributed Multi-facet
More informationOn Statistical Characteristics of Real-life Knowledge Graphs
On Statistical Characteristics of Real-life Knowledge Graphs Wenliang Cheng, Chengyu Wang, Bing Xiao, Weining Qian, Aoying Zhou Institute for Data Science and Engineering East China Normal University Shanghai,
More informationSOCIAL MEDIA. Charles Murphy
SOCIAL MEDIA Charles Murphy Social Media Overview 1. Introduction 2. Social Media Areas Blogging Bookmarking Deals Location-based Music Photo sharing Video 3. The Fab Four FaceBook Google+ Linked In Twitter
More informationAcolyte: An In-Memory Social Network Query System
Acolyte: An In-Memory Social Network Query System Ze Tang, Heng Lin, Kaiwei Li, Wentao Han, and Wenguang Chen Department of Computer Science and Technology, Tsinghua University Beijing 100084, China {tangz10,linheng11,lkw10,hwt04}@mails.tsinghua.edu.cn
More informationLink Analysis in Weibo
Link Analysis in Weibo Liwen Sun AMPLab, EECS liwen@cs.berkeley.edu Di Wang Theory Group, EECS wangd@eecs.berkeley.edu Abstract With the widespread use of social network applications, online user behaviors,
More informationAnalysis and Identification of Spamming Behaviors in Sina Weibo Microblog
Analysis and Identification of Spamming Behaviors in Sina Weibo Microblog Chengfeng Lin alex_lin@sjtu.edu.cn Yi Zhou zy_21th@sjtu.edu.cn Kai Chen kchen@sjtu.edu.cn Jianhua He Aston University j.he7@aston.ac.uk
More informationMicrosoft Perform Data Engineering on Microsoft Azure HDInsight.
Microsoft 70-775 Perform Data Engineering on Microsoft Azure HDInsight http://killexams.com/pass4sure/exam-detail/70-775 QUESTION: 30 You are building a security tracking solution in Apache Kafka to parse
More informationPrototype Report. Soccer Data Web Crawler. Team No. 02
Prototype Report Soccer Data Web Crawler Team No. 02 First Name Last Name Role Trupti Sardesai Project Manager Wenchen Tu Prototyper Subessware Selvameena Karunamoorthy System/Software Architect Pranshu
More informationMBB Robot Crawler Data Report in 2014H1
MBB Robot Crawler Data Report in 2014H1 Contents Contents 1 Introduction... 1 2 Characteristics and Trends of Web Services... 3 2.1 Increasing Size of Web Pages... 3 2.2 Increasing Average Number of Access
More informationzum.com Service Introduction
zum.com Service Introduction 2016 Index 01. Introduction of zum.com Convenient Main Page Convenient Search system Convenient News Convenient Shopping Convenient Mobile 02. The difference of zum.com Philosophy
More informationGraphCEP Real-Time Data Analytics Using Parallel Complex Event and Graph Processing
Institute of Parallel and Distributed Systems () Universitätsstraße 38 D-70569 Stuttgart GraphCEP Real-Time Data Analytics Using Parallel Complex Event and Graph Processing Ruben Mayer, Christian Mayer,
More informationAdvisor/Committee Members Dr. Chris Pollett Dr. Mark Stamp Dr. Soon Tee Teoh. By Vijeth Patil
Advisor/Committee Members Dr. Chris Pollett Dr. Mark Stamp Dr. Soon Tee Teoh By Vijeth Patil Motivation Project goal Background Yioop! Twitter RSS Modifications to Yioop! Test and Results Demo Conclusion
More informationThe Design of a Live Social Observatory System
The Design of a Live Social Observatory System Huanbo Luan 1,2, Juanzi Li 2, Maosong Sun 2, Tat-Seng Chua 1 1 School of Computing, National University of Singapore 2 Department of Computer Science and
More informationOUR TOP DATA SOURCES AND WHY THEY MATTER
OUR TOP DATA SOURCES AND WHY THEY MATTER TABLE OF CONTENTS INTRODUCTION 2 MAINSTREAM WEB 3 MAJOR SOCIAL NETWORKS 4 AUDIENCE DATA 5 VIDEO 6 FOREIGN SOCIAL NETWORKS 7 SYNTHESIO DATA COVERAGE 8 1 INTRODUCTION
More informationW W E K E Y P E R F O R M A N C E I N D I C AT O R S J U LY 2 7,
W W E K E Y P E R F O R M A N C E I N D I C AT O R S J U LY 2 7, 2 0 1 7 Average US Primetime Cable TV Ratings Raw, SmackDown and Primetime Cable TV Ratings Social Media Followers 2 (average, in millions)
More informationA global technology leader approaching $42B in sales with 57,000 people, and customers in 160+ countries LENOVO. ALL RIGHTS RESERVED
A global technology leader approaching $42B in sales with 57,000 people, and customers in 160+ countries. 2 Lenovo s Performance Lenovo WW PC Market Share 19.7% 2014 13.1% 2013 2012 9.6% 8.2% 2011 6.5%
More informationISSN: Page 74
Extraction and Analytics from Twitter Social Media with Pragmatic Evaluation of MySQL Database Abhijit Bandyopadhyay Teacher-in-Charge Computer Application Department Raniganj Institute of Computer and
More informationONE SOCIAL. A Writing Project. Presented to. The Faculty of the Department of Computer Science. San José State University
ONE SOCIAL A Writing Project Presented to The Faculty of the Department of Computer Science San José State University In Partial Fulfillment of the Requirements for the Degree Master of Computer Science
More informationJanuary, European Animation, VFX & Games Industry Strategies, Trends & Opportunities. digital.vector. Animation, VFX & Games Market Research
January, 2018 European Animation, VFX & Games Industry Strategies, Trends & Opportunities digital.vector Animation, VFX & Games Market Research Contents European Animation, VFX & Games Industry European
More informationSampling Large Graphs: Algorithms and Applications
Sampling Large Graphs: Algorithms and Applications Don Towsley Umass - Amherst Joint work with P.H. Wang, J.Z. Zhou, J.C.S. Lui, X. Guan Measuring, Analyzing Large Networks - large networks can be represented
More informationIntroduction to Twitter
Introduction to Twitter Objectives After completing this class you will be able to: Identify what Twitter is Create a Twitter Account Customize your Twitter profile and settings Follow other users on Twitter
More informationMessenger Wars 2. How Facebook climbed back to #1
Messenger Wars 2 How Facebook climbed back to #1 Source: Max Morse for TechCrunch, 2013 https://www.flickr.com/photos/techcrunch/9728625374/in/photolist- Since our hugely popular Messenger Wars: How Facebook
More informationA Fast and High Throughput SQL Query System for Big Data
A Fast and High Throughput SQL Query System for Big Data Feng Zhu, Jie Liu, and Lijie Xu Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, China 100190
More informationHYDRA Large-scale Social Identity Linkage via Heterogeneous Behavior Modeling
HYDRA Large-scale Social Identity Linkage via Heterogeneous Behavior Modeling Siyuan Liu Carnegie Mellon. University Siyuan Liu, Shuhui Wang, Feida Zhu, Jinbo Zhang, Ramayya Krishnan. HYDRA: Large-scale
More informationJanuary, Asian Animation, VFX & Games Industry Strategies, Trends & Opportunities. digital.vector. Animation, VFX & Games Market Research
January, 2018 Asian Animation, VFX & Games Industry Strategies, Trends & Opportunities digital.vector Animation, VFX & Games Market Research Contents Asian Animation, VFX & Games Industry Asian Animation
More informationPrateek Mittal Princeton University
On Your Social Network De-anonymizablity: Quantification and Large Scale Evaluation with Seed Knowledge Shouling Ji, Weiqing Li, and Raheem Beyah Georgia Institute of Technology Neil Zhenqiang Gong University
More informationBoosting the Performance of FPGA-based Graph Processor using Hybrid Memory Cube: A Case for Breadth First Search
Boosting the Performance of FPGA-based Graph Processor using Hybrid Memory Cube: A Case for Breadth First Search Jialiang Zhang, Soroosh Khoram and Jing Li 1 Outline Background Big graph analytics Hybrid
More informationSocial Network Mining An Introduction
Social Network Mining An Introduction Jiawei Zhang Assistant Professor Florida State University Big Data A Questionnaire Please raise your hands, if you (1) use Facebook (2) use Instagram (3) use Snapchat
More informationDetect Rumors in Microblog Posts Using Propagation Structure via Kernel Learning
Detect Rumors in Microblog Posts Using Propagation Structure via Kernel Learning Jing Ma 1, Wei Gao 2*, Kam-Fai Wong 1,3 1 The Chinese University of Hong Kong 2 Victoria University of Wellington, New Zealand
More informationexam. Microsoft Perform Data Engineering on Microsoft Azure HDInsight. Version 1.0
70-775.exam Number: 70-775 Passing Score: 800 Time Limit: 120 min File Version: 1.0 Microsoft 70-775 Perform Data Engineering on Microsoft Azure HDInsight Version 1.0 Exam A QUESTION 1 You use YARN to
More informationImplementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky
Implementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky The Chinese University of Hong Kong Abstract Husky is a distributed computing system, achieving outstanding
More informationOracle Marketing Cloud Service Descriptions and Metrics January 25, 2018
Oracle Marketing Cloud Service Descriptions and Metrics January 25, 2018 Including: Infinity, Maximiser & Social Oracle Marketing Cloud Service Descriptions v012518 Page 1 of 12 Contents GLOSSARY AND METRICS...
More informationCharacterizing Smartphone Usage Patterns from Millions of Android Users
Characterizing Smartphone Usage Patterns from Millions of Android Users Huoran Li, Xuan Lu, Xuanzhe Liu Peking University Tao Xie UIUC Kaigui Bian Peking University Felix Xiaozhu Lin Purdue University
More informationAppendix A Additional Information
Appendix A Additional Information In this appendix, we provide more information on building practical applications using the techniques discussed in the chapters of this book. In Sect. A.1, we discuss
More informationExtracting Information from Social Networks
Extracting Information from Social Networks Reminder: Social networks Catch-all term for social networking sites Facebook microblogging sites Twitter blog sites (for some purposes) 1 2 Ways we can use
More informationA Novel Parallel Hierarchical Community Detection Method for Large Networks
A Novel Parallel Hierarchical Community Detection Method for Large Networks Ping Lu Shengmei Luo Lei Hu Yunlong Lin Junyang Zou Qiwei Zhong Kuangyan Zhu Jian Lu Qiao Wang Southeast University, School of
More informationConstant Contact. Responsyssy. VerticalResponse. Bronto. Monitor. Satisfaction
Contenders Leaders Marketing Cloud sy Scale Campaign aign Monitor Niche High Performers Satisfaction Email Marketing Products Products shown on the Grid for Email Marketing have received a minimum of 10
More informationScalable Streaming Analytics
Scalable Streaming Analytics KARTHIK RAMASAMY @karthikz TALK OUTLINE BEGIN I! II ( III b Overview Storm Overview Storm Internals IV Z V K Heron Operational Experiences END WHAT IS ANALYTICS? according
More informationEffective Detecting Microblog Spammers Using Big Data Fusion Algorithm
Int'l Conf. on Advances in Big Data Analytics ABDA'16 59 Effective Detecting Microblog Spammers Using Big Data Fusion Algorithm Yang Qiao 1, Huaping Zhang 1, Yanping Zhao 2, Yu Zhang 1, Yu Min 1 1 School
More informationA Novel deep learning models for Cold Start Product Recommendation using Micro blogging Information
A Novel deep learning models for Cold Start Product Recommendation using Micro blogging Information Chunchu.Harika, PG Scholar, Department of CSE, QIS College of Engineering and Technology, Ongole, Andhra
More informationExploring World s Interest in Paralympics through Twitter
Exploring World s Interest in Paralympics through Twitter Venkata Sravya Kalla, Thanaa Ghanem Information and Computer Science Department Metropolitan State University St. Paul, MN, 55106 cu9426bs@metrostate.edu,
More informationCreate an Account... 2 Setting up your account... 2 Send a Tweet... 4 Add Link... 4 Add Photo... 5 Delete a Tweet...
Twitter is a social networking site allowing users to post thoughts and ideas in 140 characters or less. http://www.twitter.com Create an Account... 2 Setting up your account... 2 Send a Tweet... 4 Add
More informationCentrality in networks
Centrality in networks Why study networks? Why study networks? Why study networks? Everything becomes connected Increasing need for people able to understand networks as people are getting more connected
More informationUSER GUIDE DASHBOARD OVERVIEW A STEP BY STEP GUIDE
USER GUIDE DASHBOARD OVERVIEW A STEP BY STEP GUIDE DASHBOARD LAYOUT Understanding the layout of your dashboard. This user guide discusses the layout and navigation of the dashboard after the setup process
More informationIntelligent Automation Incorporated
. 15400 Calhoun Drive, Suite 400 Rockville, Maryland, 20855 (301) 294-5200 http://www.i-a-i.com Enhancements for a Dynamic Data Warehousing and Mining System for Large-Scale Human Social Cultural Behavioral
More informationScraping and Preprocessing of Social Media Data
Preconference on Computational tools for text mining, processing and analysis. May 25th 2017, 9:00-17:00 (ICA San Diego) Scraping and Preprocessing of Social Media Data H A I LIANG, A SSISTANT PROFESSOR
More informationObserving the Evolution of Social Network on Weibo by Sampled Data
Observing the Evolution of Social Network on Weibo by Sampled Data Lu Ma, Gang Lu, Junxia Guo College of Information Science and Technology Beijing University of Chemical Technology Beijing, China sizheng@126.com
More informationLink Farming in Twitter
Link Farming in Twitter Pawan Goyal CSE, IITKGP Nov 11, 2016 Pawan Goyal (IIT Kharagpur) Link Farming in Twitter Nov 11, 2016 1 / 1 Reference Saptarshi Ghosh, Bimal Viswanath, Farshad Kooti, Naveen Kumar
More informationHow To Guide. ADENION GmbH Merkatorstraße Grevenbroich Germany Fon: Fax:
How To Guide ADENION GmbH Merkatorstraße 2 41515 Grevenbroich Germany Fon: +49 2181 7569-140 Fax: +49 2181 7569-199 The! Complete Guide to Social Media Sharing The following social media sharing guide
More informationA Review Paper on Big data & Hadoop
A Review Paper on Big data & Hadoop Rupali Jagadale MCA Department, Modern College of Engg. Modern College of Engginering Pune,India rupalijagadale02@gmail.com Pratibha Adkar MCA Department, Modern College
More informationspecial thanks
special thanks Who s more Lovable? members of congress CAR SALESPEOPLE lobbyists STOCKBROKERS LABOR UNION LEADERS lawyers marketers LESS LOVABLE MORE LOVABLE create marketing people love Content
More informationSoftware User's Manual
Software User's Manual Soccer Data Web Crawler First Name Last Name Role Trupti Sardesai Project Manager Wenchen Tu Prototyper Subessware Selvameena Karunamoorthy System/Software Architect Pranshu Kumar
More informationDS504/CS586: Big Data Analytics Data acquisition and measurement Prof. Yanhua Li
Welcome to DS504/CS586: Big Data Analytics Data acquisition and measurement Prof. Yanhua Li Time: 6:00pm 8:50pm THURSDAY Location: AK 232 Fall 2016 Data acquisition and measurement ia Sampling and Estimation
More informationNUSIS at TREC 2011 Microblog Track: Refining Query Results with Hashtags
NUSIS at TREC 2011 Microblog Track: Refining Query Results with Hashtags Hadi Amiri 1,, Yang Bao 2,, Anqi Cui 3,,*, Anindya Datta 2,, Fang Fang 2,, Xiaoying Xu 2, 1 Department of Computer Science, School
More informationSAND: A Fault-Tolerant Streaming Architecture for Network Traffic Analytics
1 SAND: A Fault-Tolerant Streaming Architecture for Network Traffic Analytics Qin Liu, John C.S. Lui 1 Cheng He, Lujia Pan, Wei Fan, Yunlong Shi 2 1 The Chinese University of Hong Kong 2 Huawei Noah s
More informationImproved Recommendation System Using Friend Relationship in SNS
Improved Recommendation System Using Friend Relationship in SNS Qing Liao 1,3( ), Bin Wang 1,3, Yanxiang Ling 1,3, Jingling Zhao 2, and Xinyue Qiu 2 1 School of Information and Communication Engineering,
More informationby SUBSPLASH ENGAGE YOUR AUDIENCE
by SUBSPLASH ENGAGE YOUR AUDIENCE POPULAR PACKAGES + PRICING Core Mobile Phone Plus Mobile Phone + Tablet Prime Mobile Phone + Tablet Mobile Apps Made available in these stores Made available in these
More informationRECOMMENDATIONS HOW TO ATTRACT CLIENTS TO ROBOFOREX
RECOMMENDATIONS HOW TO ATTRACT CLIENTS TO ROBOFOREX Your success as a partner directly depends on the number of attracted clients and their trading activity. You can hardly influence clients trading activity,
More informationMining the Boundaries of Social Networks: Crawling Facebook and Twitter for BlogIntelligence
Mining the Boundaries of Social Networks: Crawling Facebook and Twitter for BlogIntelligence Philipp Berger 1, Patrick Hennig 1, Thomas Klingbeil 2, Matthias Kohnen 2, Steffen Pade 2, and Christoph Meinel
More informationGoogle A-Z: What schools need to know about Google Apps for Education. Jill Judd WhippleHill
Google A-Z: What schools need to know about Google Apps for Education Jill Judd WhippleHill WhippleHill Communications company based in Bedford, NH. Specialize in private schools k-12 Software as a service
More informationA COMPREHENSIVE STUDY ON DATA EXTRACTION IN SINA WEIBO
A COMPREHENSIVE STUDY ON DATA EXTRACTION IN SINA WEIBO Xiao Cui 1 and Hao Shi 2 College of Engineering and Science, Victoria University, Melbourne, Australia ABSTRACT With the rapid growth of users in
More informationConsequences of Compromise: Characterizing Account Hijacking on Twitter
Consequences of Compromise: Characterizing Account Hijacking on Twitter Frank Li UC Berkeley With: Kurt Thomas (UCB Google), Chris Grier (UCB/ICSI Databricks), Vern Paxson (UCB/ICSI) Accounts on Social
More informationImplementation of Parallel CASINO Algorithm Based on MapReduce. Li Zhang a, Yijie Shi b
International Conference on Artificial Intelligence and Engineering Applications (AIEA 2016) Implementation of Parallel CASINO Algorithm Based on MapReduce Li Zhang a, Yijie Shi b State key laboratory
More informationA distributed framework for early trending topics detection on big social networks data threads
A distributed framework for early trending topics detection on big social networks data threads Athena Vakali, Kitmeridis Nikolaos, Panourgia Maria Informatics Department, Aristotle University, Thessaloniki,
More informationBest of SharePoint Sites and Communities
Best of SharePoint 2010 Sites and Communities Agenda Overview and SharePoint 2010 Basics SharePoint Foundation Sites Communities Business Needs IT Needs Microsoft SharePoint 2010 The business collaboration
More informationKony and TIBCO enable fast reliable Websockets Communication. Overview of the integration of WebSockets with TIBCO eftl and the Kony Platform
Kony and TIBCO enable fast reliable Websockets Communication Overview of the integration of WebSockets with TIBCO eftl and the Kony Platform Leading the way in enterprise mobility Founded in 2007 1400
More informationBig Data - Some Words BIG DATA 8/31/2017. Introduction
BIG DATA Introduction Big Data - Some Words Connectivity Social Medias Share information Interactivity People Business Data Data mining Text mining Business Intelligence 1 What is Big Data Big Data means
More informationPart 1. Learn how to collect streaming data from Twitter web API.
Tonight Part 1. Learn how to collect streaming data from Twitter web API. Part 2. Learn how to store the streaming data to files or a database so that you can use it later for analyze or representation
More informationAdvanced Computer Graphics CS 525M: Crowds replace Experts: Building Better Location-based Services using Mobile Social Network Interactions
Advanced Computer Graphics CS 525M: Crowds replace Experts: Building Better Location-based Services using Mobile Social Network Interactions XIAOCHEN HUANG Computer Science Dept. Worcester Polytechnic
More informationDATA MINING INTRO LECTURE. Introduction
DATA MINING INTRO LECTURE Introduction Instructors Aris (Aris Anagnostopoulos) Yiannis (Ioannis Chatzigiannakis) Evimaria (Evimaria Terzi) What is data mining? After years of data mining there is still
More informationDIGITAL MARKETING AND SOCIAL MEDIA COMMUNICATION FOR THE NSS EPALE CY TEAM AND STAKEHOLDERS
DIGITAL MARKETING AND SOCIAL MEDIA COMMUNICATION FOR THE NSS EPALE CY TEAM AND STAKEHOLDERS 6 και 7 Ιουνίου, 8:00 π.μ.-15:00 μ.μ. Παιδαγωγικό Ινστιτούτο Κύπρου, Αίθουσα Π206 THE SEMINAR WAS HELD DURING
More informationRule Based Classification on a Multi Node Scalable Hadoop Cluster
BITS Pilani K K Birla Goa Campus Rule Based Classification on a Multi Node Scalable Hadoop Cluster Shashank Gugnani Devavrat Khanolkar Tushar Bihany Nikhil Khadilkar Data Hypergrowth Reuters-21578: about
More informationYouTube & Vimeo. Differences, similarities which one is for you or should you be on both?
YouTube & Vimeo Differences, similarities which one is for you or should you be on both? Videos go online, but where? There are two BIG players, YouTube & Vimeo, and a lot of other smaller guys. Vimeo
More informationSystem and Software Architecture Description (SSAD)
System and Software Architecture Description (SSAD) LiveRiot Video Editing System and social networking enhancement Team 04 Yang Li Haoyu Huang Ye Tian Zichuan Wang Haishan Ye Kaiqi Zhang Mitra, Alok Project
More informationEmbracing social big data in wireless system design
Journal of Communications and Information Networks, Vol.2, No.1, Mar. 2017 DOI: 10.1007/s41650-017-0007-9 c Posts & Telecom Press and Springer Singapore 2017 Research paper Special Issue on Wireless Big
More informationBigDataBench: a Big Data Benchmark Suite from Web Search Engines
BigDataBench: a Big Data Benchmark Suite from Web Search Engines Wanling Gao, Yuqing Zhu, Zhen Jia, Chunjie Luo, Lei Wang, Jianfeng Zhan, Yongqiang He, Shiming Gong, Xiaona Li, Shujie Zhang, and Bizhu
More informationLeading in the compute era
Leading in the compute era Delivering the right compute, for the right workload, at the right economics every time. Ray Christian HP Server Product Manager Updated August 25, 2014 The most exciting shifts
More information2018 Trends in Hosting & Cloud Managed Services
PREVIEW 2018 Trends in Hosting & Cloud Managed Services DEC 2017 Rory Duncan, Research Director, Managed Services & Hosting Penny Jones, Principal Analyst - MTDC & Managed Services Aaron Sherrill, Senior
More informationConsumer Opinions and Habits A XIRRUS STUDY
Consumer Opinions and Habits A XIRRUS STUDY Executive Summary With more devices on the planet than people, it goes without saying that wireless is no longer a bonus - it s a necessity. By the end of 2015,
More informationnetworks data threads
Informatics Department Aristotle University of Thessaloniki A distributed framework for early trending topics detection on big social networks data threads AT HE NA VA KA L I, N I KOL AOS KI T MER IDIS,
More information2 Ontology evolution algorithm based on web-pages and users behavior logs
ISSN 1749-3889 (print), 1749-3897 (online) International Journal of Nonlinear Science Vol.18(2014) No.1,pp.86-91 Ontology Evolution Algorithm for Topic Information Collection Jing Ma 1, Mengyong Sun 1,
More informationIdentifying Web Spam With User Behavior Analysis
Identifying Web Spam With User Behavior Analysis Yiqun Liu, Rongwei Cen, Min Zhang, Shaoping Ma, Liyun Ru State Key Lab of Intelligent Tech. & Sys. Tsinghua University 2008/04/23 Introduction simple math
More informationKnowledge Discovery of Small Business Domain Using Web Crawling and Data Mining
Knowledge Discovery of Small Business Domain Using Web Crawling and Data Mining Latha M 1, Shivanand R D 2 1M.Tech. Department of Computer Science and Engineering, Bapuji Institute of Technology, Davanagere,
More informationMICROSOFT ONLINE (ONEDRIVE) VS G SUITE (GOOGLE DRIVE)
MICROSOFT ONLINE (ONEDRIVE) VS G SUITE (GOOGLE DRIVE) COST ONEDRIVE (MICROSOFT ONLINE) OneDrive offers three different business plans: First option: OneDrive for Business Plan 1 - $5.00/month per user
More informationCloud-based Twitter sentiment analysis for Ranking of hotels in the Cities of Australia
Cloud-based Twitter sentiment analysis for Ranking of hotels in the Cities of Australia Distributed Computing Project Final Report Elisa Mena (633144) Supervisor: Richard Sinnott Cloud-based Twitter Sentiment
More informationFlash, Any Time Everywhere
Flash, Any Time Everywhere WatchStor CUI HAO August 2016 1 About WatchStor Watchstor.com is a China leading IT Media, founded in 2008. Today, Watchstor.com has more than 4.1 million daily average page
More informationUsing Social Media to Extend Your Marketing Campaign Effectiveness
Using Social Media to Extend Your Email Marketing Campaign Effectiveness Constant Contact, Inc. 1601 Trapelo Road, Suite 329 Waltham, MA 02451 Phone: 1-866-876-8464 Using Social Media to Extend Your Email
More informationA data-driven framework for archiving and exploring social media data
A data-driven framework for archiving and exploring social media data Qunying Huang and Chen Xu Yongqi An, 20599957 Oct 18, 2016 Introduction Social media applications are widely deployed in various platforms
More informationSocial Network Analytics on Cray Urika-XA
Social Network Analytics on Cray Urika-XA Mike Hinchey, mhinchey@cray.com Technical Solutions Architect Cray Inc, Analytics Products Group April, 2015 Agenda 1. Introduce platform Urika-XA 2. Technology
More informationExtracting Information from Complex Networks
Extracting Information from Complex Networks 1 Complex Networks Networks that arise from modeling complex systems: relationships Social networks Biological networks Distinguish from random networks uniform
More informationCS224W Project Write-up Static Crawling on Social Graph Chantat Eksombatchai Norases Vesdapunt Phumchanit Watanaprakornkul
1 CS224W Project Write-up Static Crawling on Social Graph Chantat Eksombatchai Norases Vesdapunt Phumchanit Watanaprakornkul Introduction Our problem is crawling a static social graph (snapshot). Given
More informationA Letting agency s shop window is no longer a place on the high street, it is now online
A Letting agency s shop window is no longer a place on the high street, it is now online 1 Let s start by breaking down the two ways in which search engines will send you more traffic: 1. Search Engine
More informationAn Overview of Web Accessibility Evaluation of Government Websites in China Liang-cheng LI, Jia-jun BU*, Zhi YU, Wei WANG and Can WANG
2016 2 nd International Conference on Social Science and Development (ICSSD 2016) ISBN: 978-1-60595-356-4 An Overview of Web Accessibility Evaluation of Government Websites in China Liang-cheng LI, Jia-jun
More informationCollecting Tweets. User Timelines, User Update
Collecting Tweets User Timelines, User Update Outline HCDE user module UserTimeline.py Instantiation Parameters HCDE user module Update.py Using UserTimeline.py command line Part of the HCDE User Module
More informationDesign Inaccuracy Cross Link Authoring Flaw - ipaper Platform
COSEINC Design Inaccuracy Cross Link Authoring Flaw - ipaper Platform Aditya K Sood, - Sr. Security Researcher, Vulnerability Research Labs, COSEINC Email: Aditya [at] research.coseinc.com Website: http://www.coseinc.com
More informationCMSC5733 Social Computing
CMSC5733 Social Computing Tutorial 1: Python and Web Crawling Yuanyuan, Man The Chinese University of Hong Kong sophiaqhsw@gmail.com Tutorial Overview Python basics and useful packages Web Crawling Why
More informationAn Overview of PQC Workshops/Projects and Standardization Concerns in China
An Overview of PQC Workshops/Projects and Standardization Concerns in China Hong Xiang, Tao Xiang Chongqing University Zheng-feng Zhang Institute of Software Chinese Academy of Sciences Zheng-fu Han University
More informationRAW, SMACKDOWN AND PRIMETIME CABLE TV RATINGS
KEY PERFORMANCE INDICATORS MAY 3, 2018 AVERAGE US PRIMETIME CABLE TV RATINGS RAW, SMACKDOWN AND PRIMETIME CABLE TV RATINGS 2.47 +2% 2.53-2% 2.08 2.04 1.91 1.94 1.61 1.56 1.27-12% 1.11-8% 0.75 0.69 0.98
More informationCPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University
CPSC 426/526 Cloud Computing Ennan Zhai Computer Science Department Yale University Recall: Lec-7 In the lec-7, I talked about: - P2P vs Enterprise control - Firewall - NATs - Software defined network
More informationSTUDYING OF CLASSIFYING CHINESE SMS MESSAGES
STUDYING OF CLASSIFYING CHINESE SMS MESSAGES BASED ON BAYESIAN CLASSIFICATION 1 LI FENG, 2 LI JIGANG 1,2 Computer Science Department, DongHua University, Shanghai, China E-mail: 1 Lifeng@dhu.edu.cn, 2
More information