Digital Analytics Meetup #6. Θεσσαλονίκη, 21 Μαρτίου 2018

Size: px
Start display at page:

Download "Digital Analytics Meetup #6. Θεσσαλονίκη, 21 Μαρτίου 2018"

Transcription

1 Digital Analytics Meetup #6 Θεσσαλονίκη, 21 Μαρτίου 2018

2 Hello! Τάσος Βεντούρης Data Scientist and Game Hattrick Ltd You can find me Tasos Ventouris 2

3 More About Me! (2012) BSc Math Auth (2013) MSc Web Math Auth (2013) Open Knowledge Greece (2014) Open Knowledge Inter. (2014) Data Hattrick (2016) Found stackprime 3

4 The Data Science Handbook 4

5 Η ιστορία Πως ξεκίνησαν όλα; 5

6 Timeline Computer Science = Data Science από Peter Naur Πρώτη φορά σε δημοσίευση από Peter Naur Συνέδριο με τίτλο Data Science, classification, and related methods Ομιλία του Jeff Wu με τίτλο Statistics = Data Science? William S. Cleveland χρησιμοποίησε τη Data Science ως ανεξάρτητο όρο σε άρθρο της International Statistical Review Committee on Data for Science & Technology. Νέο περιοδικό με τίτλο Data Science Journal The Journal of Data Science από Columbia University DJ Patil & Jeff Hammerbacher χρησιμοποίησαν τον τίτλο Data Scientist Άρθρο από Harvard Business Review με τίτλο Data Scientist: The Sexiest Job of the 21st Century 6

7 Data Science Bubble or not?

8 8

9 9

10 The creation of data products *Data product = Ένα εργαλείο που δηµιουργήθηκε µε τη χρήση δεδοµένων και βοηθάει στη λήψη αποφάσεων. 10

11 11

12 Data Scientist Ποια είναι τα χαρακτηριστικά του;

13 13

14 A Data Scientist is a statistician who lives in San Francisco 14

15 A Data Scientist is a person who is able to 15

16 A Data Scientist is a person who is able to run a regression 15

17 A Data Scientist is a person who is able to run a regression write a sql query 15

18 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site 15

19 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment 15

20 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices 15

21 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame 15

22 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning 15

23 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning steal from the d3 gallery 15

24 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning steal from the d3 gallery 15

25 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning steal from the d3 gallery argue r versus python 15

26 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning steal from the d3 gallery argue r versus python use mapreduce 15

27 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning steal from the d3 gallery argue r versus python use mapreduce build a dashboard 15

28 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning steal from the d3 gallery argue r versus python use mapreduce build a dashboard clean up messy data 15

29 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning steal from the d3 gallery argue r versus python use mapreduce build a dashboard clean up messy data test a hypothesis 15

30 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning steal from the d3 gallery argue r versus python use mapreduce build a dashboard clean up messy data test a hypothesis script a shell 15

31 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning steal from the d3 gallery argue r versus python use mapreduce build a dashboard clean up messy data test a hypothesis script a shell hack a p-value 15

32 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning steal from the d3 gallery argue r versus python use mapreduce build a dashboard clean up messy data test a hypothesis script a shell hack a p-value machine-learn a model 15

33 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning steal from the d3 gallery argue r versus python use mapreduce build a dashboard clean up messy data test a hypothesis script a shell hack a p-value machine-learn a model talk to a business person 15

34 A Data Scientist is a person who is able to run a regression write a sql query scrape a web site design an experiment factor matrices use a data frame pretend to understand deep learning steal from the d3 gallery argue r versus python use mapreduce build a dashboard clean up messy data test a hypothesis script a shell hack a p-value machine-learn a model talk to a business person 15

35 Το μονοπάτι ενός Data Scientist RPG character? 16

36 Το μονοπάτι του Data Scientist 17

37 Use your brain to take decisions Don t use it to store info 18

38 Data Science Process Ποια είναι τα βήματα; 19

39 Το

40 Το

41 Reality??? 22

42 Machine Learning Ή αλλιώς μηχανές μάθησης 23

43 Machine Learning family 24

44 Azure ML

45 What is Azure Machine Learning? One of the best tools to learn ML Drag and drop R, Python, SQL, Jupyter integration No code (if you don t want it) Web Service (RESTful API) Part of Azure Cloud 26

46 Azure ML Modules 27

47 User interaction 28

48 Dataset Preview 29

49 Dataset Preview 29

50 Dataset Preview 29

51 Dataset Preview 30

52 Dataset Preview 31

53 Dataset Preview 31

54 Data Transformation 32

55 Data Transformation Filters (like median and moving average) Manipulation (add/remove/edit/join data) Sample/Partition/Split Scale and Reduce (Normalize, PCA) 32

56 Missing Data 33

57 Missing Data 33

58 Missing Data 33

59 Who said anything about PCA? 34

60 Split Data 35

61 Train Model 36

62 Visualise the results 37

63 Evaluate Model 38

64 Feature Importance 39

65 Overfitting? 40

66 Overfitting? 40

67 Overfitting? Min: -3s+x =

68 Overfitting? Min: -3s+x = 0.73 Max: 3s+x =

69 I like to get my hands dirty 41

70 Play time!

71 Thanks! Any questions? 43

MySQL. Βάσεις Δεδομένων Ευαγγελία Πιτουρά 1

MySQL. Βάσεις Δεδομένων Ευαγγελία Πιτουρά 1 Βάσεις Δεδομένων 2018-2019 Ευαγγελία Πιτουρά 1 https://dev.mysql.com/downloads/ Free software, (community server) Custom installation Connectors Java ή python (MariaDB is a community-developed fork of

More information

Data Analyst Nanodegree Syllabus

Data Analyst Nanodegree Syllabus Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working

More information

Dr. SubraMANI Paramasivam. Think & Work like a Data Scientist with SQL 2016 & R

Dr. SubraMANI Paramasivam. Think & Work like a Data Scientist with SQL 2016 & R Dr. SubraMANI Paramasivam Think & Work like a Data Scientist with SQL 2016 & R About the Speaker Group Leader Dr. SubraMANI Paramasivam PhD., MVP, MCT, MCSE (x2), MCITP (x2), MCP, MCTS (x3), MCSA CEO,

More information

As a reference, please find a version of the Machine Learning Process described in the diagram below.

As a reference, please find a version of the Machine Learning Process described in the diagram below. PREDICTION OVERVIEW In this experiment, two of the Project PEACH datasets will be used to predict the reaction of a user to atmospheric factors. This experiment represents the first iteration of the Machine

More information

Think & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI)

Think & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI) Think & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI) About the Speaker Dr. SubraMANI Paramasivam PhD., MCT, MCSE, MCITP, MCP, MCTS, MCSA CEO, Principal Consultant & Trainer

More information

Data Analyst Nanodegree Syllabus

Data Analyst Nanodegree Syllabus Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working

More information

Applied Machine Learning

Applied Machine Learning Applied Machine Learning Lab 3 Working with Text Data Overview In this lab, you will use R or Python to work with text data. Specifically, you will use code to clean text, remove stop words, and apply

More information

Data Science and Machine Learning Essentials

Data Science and Machine Learning Essentials Data Science and Machine Learning Essentials Lab 5B Publishing Models in Azure ML By Stephen Elston and Graeme Malcolm Overview In this lab you will publish one of the models you created in a previous

More information

Σύντομο Tutorial του προγράμματος MPPTEST στα Linux. 1. Γενικά περί μέτρησης και αξιολόγησης της απόδοσης του ΜΡΙ με τη χρήση του MPPTEST

Σύντομο Tutorial του προγράμματος MPPTEST στα Linux. 1. Γενικά περί μέτρησης και αξιολόγησης της απόδοσης του ΜΡΙ με τη χρήση του MPPTEST Σύντομο Tutorial του προγράμματος MPPTEST στα Linux 1. Γενικά περί μέτρησης και αξιολόγησης της απόδοσης του ΜΡΙ με τη χρήση του MPPTEST Το mpptest είναι ένα πρόγραμμα MPI, που μπορεί να εξετάσει πολλά

More information

Activator Library. Focus on maximizing the value of your data, gain business insights, increase your team s productivity, and achieve success.

Activator Library. Focus on maximizing the value of your data, gain business insights, increase your team s productivity, and achieve success. Focus on maximizing the value of your data, gain business insights, increase your team s productivity, and achieve success. ACTIVATORS Designed to give your team assistance when you need it most without

More information

Υλοποιήσεις του MPI To MPI είναι ένα standard και όχι μια βιβλιοθήκη. Δημοφιλείς υλοποιήσεις του MPI standard είναι οι ακόλουθες:

Υλοποιήσεις του MPI To MPI είναι ένα standard και όχι μια βιβλιοθήκη. Δημοφιλείς υλοποιήσεις του MPI standard είναι οι ακόλουθες: Τμήμα Μηχανικών Πληροφορικής Τ.Ε. Σχολή Τεχνολογικών Εφαρμογών Ακαδημαϊκό έτος 2016-2017 ΤΕΙ Ηπείρου - Άρτα Κατανεμημένα και Παράλληλα Συστήματα (εργαστήριο) Παραδείγματα με MPI Γκόγκος Χρήστος Υλοποιήσεις

More information

DATA SCIENCE INTRODUCTION QSHORE TECHNOLOGIES. About the Course:

DATA SCIENCE INTRODUCTION QSHORE TECHNOLOGIES. About the Course: DATA SCIENCE About the Course: In this course you will get an introduction to the main tools and ideas which are required for Data Scientist/Business Analyst/Data Analyst/Analytics Manager/Actuarial Scientist/Business

More information

SQL Server Machine Learning Marek Chmel & Vladimir Muzny

SQL Server Machine Learning Marek Chmel & Vladimir Muzny SQL Server Machine Learning Marek Chmel & Vladimir Muzny @VladimirMuzny & @MarekChmel MCTs, MVPs, MCSEs Data Enthusiasts! vladimir@datascienceteam.cz marek@datascienceteam.cz Session Agenda Machine learning

More information

Introduction to Data Science. Introduction to Data Science with Python. Python Basics: Basic Syntax, Data Structures. Python Concepts (Core)

Introduction to Data Science. Introduction to Data Science with Python. Python Basics: Basic Syntax, Data Structures. Python Concepts (Core) Introduction to Data Science What is Analytics and Data Science? Overview of Data Science and Analytics Why Analytics is is becoming popular now? Application of Analytics in business Analytics Vs Data

More information

On the basis inverse of the exterior point simplex algorithm

On the basis inverse of the exterior point simplex algorithm Πρακτικά 7ου Συνεδρίου Ε.Ε.Ε.Ε. «ιαχείριση Κινδύνων» Πανεπιστήµιο Πατρών, 6-8 Ιουνίου 2005, Σελ. 677-687 On the basis inverse of the exterior point simplex algorithm El-Said Badr, K. Paparrizos, N. Samaras

More information

Data Engineering for Data Science

Data Engineering for Data Science Engineering for Science Arup Nanda VP, Services Priceline booking.com priceline.com kayak.com agoda.com rentalcars.com opentable.com 2 Science and Machine Learning Customer Segmentation Prediction of Behavior

More information

Data Analytics Training Program

Data Analytics Training Program Data Analytics Training Program In exclusive association with 1200+ Trainings 20,000+ Participants 10,000+ Brands 45+ Countries [Since 2009] Training partner for Who Is This Course For? Programers Willing

More information

Proc. No. 98/2017. There is no need to change the expiry date of the Bank Guarantee.

Proc. No. 98/2017. There is no need to change the expiry date of the Bank Guarantee. Proc. No. 98/2017 No. Questions Answers 1 The deadline for submission has changed and now is 28th of September. Should we change the Bank Guarantee ending date (Λήξη) from November 30 to December 7th (1

More information

About Intellipaat. About the Course. Why Take This Course?

About Intellipaat. About the Course. Why Take This Course? About Intellipaat Intellipaat is a fast growing professional training provider that is offering training in over 150 most sought-after tools and technologies. We have a learner base of 700,000 in over

More information

Towards Reliable Interactive Data Cleaning: A User Survey and Recommendations

Towards Reliable Interactive Data Cleaning: A User Survey and Recommendations Towards Reliable Interactive Data Cleaning: A User Survey and Recommendations coax treasure out of messy, unstructured data Sanjay Krishnan, Daniel Haas, Eugene Wu, Michael Franklin HILDA 2016 1 2 204

More information

Online Display Advertising Report ADEX Q Greece. Greece 2017

Online Display Advertising Report ADEX Q Greece. Greece 2017 Online Display Advertising Report ADEX Q3 2017 Greece Greece 2017 12/5/2017 4:24 PM D:\IAB Google Drive\IAB HELLAS\!RESEARCH\!ADEX\!ADEX REPORT\!2017\ADEX 2017\!ADEX DISPLAY Q3 2017 IAB HELLAS 22.11.2017.ppt

More information

IT Tools MICROSOFT OFFICE SPECIALIST A: Enabling and Managing Office 365. Κωδικός Σεμιναρίου / Code. Που Απευθύνεται / Audience IT Professionals

IT Tools MICROSOFT OFFICE SPECIALIST A: Enabling and Managing Office 365. Κωδικός Σεμιναρίου / Code. Που Απευθύνεται / Audience IT Professionals 20347A: Enabling and Managing Office 365 Κωδικός Σεμιναρίου / Code MS-20347 Που Απευθύνεται / Audience IT Professionals ΠροϋποθέσειςΣυμμετοχής / Prerequisites Τουλάχιστον δύο χρόνια εμπειρίας στη διαχείριση

More information

Department of Computer Science University of Cyprus EPL646 Advanced Topics in Databases. Lecture 14

Department of Computer Science University of Cyprus EPL646 Advanced Topics in Databases. Lecture 14 Department of Computer Science University of Cyprus EPL646 Advanced Topics in Databases Lecture 14 Big Data Management IV: Big-data Infrastructures (Background, IO, From NFS to HFDS) Chapter 14-15: Abideboul

More information

Ένα πλαίσιο για συσταδοποίηση και κατηγοριοποίηση μεγαλών δεδομένων με χρήση του Spark

Ένα πλαίσιο για συσταδοποίηση και κατηγοριοποίηση μεγαλών δεδομένων με χρήση του Spark Ένα πλαίσιο για συσταδοποίηση και κατηγοριοποίηση μεγαλών δεδομένων με χρήση του Spark Χρήστος Μάλλιος, Βασίλης Βασσάλος, Τάσος Βενέτης και Ακριβή Βλάχου Στην σύγχρονη εποχή, ένας τεράστιος όγκος δεδομένων

More information

Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context

Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context 1 Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes

More information

Specialist ICT Learning

Specialist ICT Learning Specialist ICT Learning APPLIED DATA SCIENCE AND BIG DATA ANALYTICS GTBD7 Course Description This intensive training course provides theoretical and technical aspects of Data Science and Business Analytics.

More information

Distributed Machine Learning" on Spark

Distributed Machine Learning on Spark Distributed Machine Learning" on Spark Reza Zadeh @Reza_Zadeh http://reza-zadeh.com Outline Data flow vs. traditional network programming Spark computing engine Optimization Example Matrix Computations

More information

Introduction to Data Science

Introduction to Data Science Introduction to Data Science Lab 4 Introduction to Machine Learning Overview In the previous labs, you explored a dataset containing details of lemonade sales. In this lab, you will use machine learning

More information

Developing Intelligent Apps

Developing Intelligent Apps Developing Intelligent Apps Lab 1 Creating a Simple Client Application By Gerry O'Brien Overview In this lab you will construct a simple client application that will call an Azure ML web service that you

More information

Event: PASS SQL Saturday - DC 2018 Presenter: Jon Tupitza, CTO Architect

Event: PASS SQL Saturday - DC 2018 Presenter: Jon Tupitza, CTO Architect Event: PASS SQL Saturday - DC 2018 Presenter: Jon Tupitza, CTO Architect BEOP.CTO.TP4 Owner: OCTO Revision: 0001 Approved by: JAT Effective: 08/30/2018 Buchanan & Edwards Proprietary: Printed copies of

More information

MATLAB is a multi-paradigm numerical computing environment fourth-generation programming language. A proprietary programming language developed by

MATLAB is a multi-paradigm numerical computing environment fourth-generation programming language. A proprietary programming language developed by 1 MATLAB is a multi-paradigm numerical computing environment fourth-generation programming language. A proprietary programming language developed by MathWorks In 2004, MATLAB had around one million users

More information

Data Science Essentials

Data Science Essentials Data Science Essentials Lab 6 Introduction to Machine Learning Overview In this lab, you will use Azure Machine Learning to train, evaluate, and publish a classification model, a regression model, and

More information

1 από 6 2/11/ :19 μμ

1 από 6 2/11/ :19 μμ Applicant Login E-Learning Intranet Contact Us First Name: Last Name: E-mail: Your Country: Phone No: Status: Preferred Start: Interested In: Choose Password: Repeat Password: Select Country Select Status

More information

2/26/2017. Originally developed at the University of California - Berkeley's AMPLab

2/26/2017. Originally developed at the University of California - Berkeley's AMPLab Apache is a fast and general engine for large-scale data processing aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes Low latency: sub-second

More information

OctopusFS: Ένα Κατανεμημένο Σύστημα Διαχείρισης Αρχείων με Βαθμίδες Αποθήκευσης

OctopusFS: Ένα Κατανεμημένο Σύστημα Διαχείρισης Αρχείων με Βαθμίδες Αποθήκευσης OctopusFS: Ένα Κατανεμημένο Σύστημα Διαχείρισης Αρχείων με Βαθμίδες Αποθήκευσης Έλενα Κακουλλή Τεχνολογικό Πανεπιστήμιο Κύπρου Οδός Αρχιεπισκόπου Κυπριανού 30 3036 Λεμεσός, Κύπρος elena.kakoulli@cut.ac.cy

More information

Business Analytics Nanodegree Syllabus

Business Analytics Nanodegree Syllabus Business Analytics Nanodegree Syllabus Master data fundamentals applicable to any industry Before You Start There are no prerequisites for this program, aside from basic computer skills. You should be

More information

UNIVERSITY OF PIRAEUS DEPT. OF DIGITAL SYSTEMS MSc IN INFORMATION SYSTEMS & SERVICES BIG Data & ANALYTICS

UNIVERSITY OF PIRAEUS DEPT. OF DIGITAL SYSTEMS MSc IN INFORMATION SYSTEMS & SERVICES BIG Data & ANALYTICS UNIVERSITY OF PIRAEUS DEPT. OF DIGITAL SYSTEMS MSc IN INFORMATION SYSTEMS & SERVICES BIG Data & ANALYTICS CLUSTERING STREAMING DATA IN DISTRIBUTED ENVIRONMENTS BASED ON BELIEF PROPAGATION TECHNIQUES ZISIS

More information

ΠΑΝΕΠΙΣΤΗΜΙΟ ΘΕΣΣΑΛΙΑΣ ΤΜΗΜΑ ΗΛΕΚΤΡΟΛΟΓΩΝ ΜΗΧΑΝΙΚΩΝ ΚΑΙ ΜΗΧΑΝΙΚΩΝ ΥΠΟΛΟΓΙΣΤΩΝ. Διπλωματική Εργασία

ΠΑΝΕΠΙΣΤΗΜΙΟ ΘΕΣΣΑΛΙΑΣ ΤΜΗΜΑ ΗΛΕΚΤΡΟΛΟΓΩΝ ΜΗΧΑΝΙΚΩΝ ΚΑΙ ΜΗΧΑΝΙΚΩΝ ΥΠΟΛΟΓΙΣΤΩΝ. Διπλωματική Εργασία ΠΑΝΕΠΙΣΤΗΜΙΟ ΘΕΣΣΑΛΙΑΣ ΤΜΗΜΑ ΗΛΕΚΤΡΟΛΟΓΩΝ ΜΗΧΑΝΙΚΩΝ ΚΑΙ ΜΗΧΑΝΙΚΩΝ ΥΠΟΛΟΓΙΣΤΩΝ Διπλωματική Εργασία Θέμα: Υλοποίηση και αξιολόγηση πρωτοκόλλων δικτύωσης με χρήση της OpenFlow τεχνολογίας Ραδουνισλής Αναστάσιος

More information

Data Analytics Training Program using

Data Analytics Training Program using Data Analytics Training Program using In exclusive association with 1200+ Trainings 20,000+ Participants 10,000+ Brands 45+ Countries [Since 2009] Training partner for Who Is This Course For? Programers

More information

MicroStrategy Analytics Desktop

MicroStrategy Analytics Desktop MicroStrategy Analytics Desktop Quick Start Guide MicroStrategy Analytics Desktop is designed to enable business professionals like you to explore data, simply and without needing direct support from IT.

More information

The Evolution of Big Data Platforms and Data Science

The Evolution of Big Data Platforms and Data Science IBM Analytics The Evolution of Big Data Platforms and Data Science ECC Conference 2016 Brandon MacKenzie June 13, 2016 2016 IBM Corporation Hello, I m Brandon MacKenzie. I work at IBM. Data Science - Offering

More information

Arduino: LCD Thermometer with optional Alarm

Arduino: LCD Thermometer with optional Alarm Arduino: LCD Thermometer with optional Alarm http://wellsb.com/post/1328196426/arduino-lcd-thermometer-with-optional-alarm Το πείραμα αυτό το έκανα και δούλεψε άψογα και ως θερμόμετρο (C:\Documents and

More information

Tackling Big Data Using MATLAB

Tackling Big Data Using MATLAB Tackling Big Data Using MATLAB Alka Nair Application Engineer 2015 The MathWorks, Inc. 1 Building Machine Learning Models with Big Data Access Preprocess, Exploration & Model Development Scale up & Integrate

More information

Data Science Bootcamp Curriculum. NYC Data Science Academy

Data Science Bootcamp Curriculum. NYC Data Science Academy Data Science Bootcamp Curriculum NYC Data Science Academy 100+ hours free, self-paced online course. Access to part-time in-person courses hosted at NYC campus Machine Learning with R and Python Foundations

More information

Αλγοριθμική επίλυση προβλημάτων, προγραμματιστικοί διαγωνισμοί, σωστές και καλές λύσεις

Αλγοριθμική επίλυση προβλημάτων, προγραμματιστικοί διαγωνισμοί, σωστές και καλές λύσεις Αλγοριθμική επίλυση προβλημάτων, προγραμματιστικοί διαγωνισμοί, σωστές και καλές λύσεις Νίκος Παπασπύρου Σχολή Ηλεκτρολόγων Μηχ. και Μηχ. Υπολογιστών Εθνικό Μετσόβιο Πολυτεχνείο Image from Phil Wadler

More information

Tomaž Kaštrun. Data Science for Beginners

Tomaž Kaštrun. Data Science for Beginners Tomaž Kaštrun Data Science for Beginners To all sponsors, thank you! Thanks to all organizers! GetLatestVersion. it About (2.0.1) BI Developer and data analyst SQL Server, SAS, R, Python, C#, SAP, SPSS

More information

Data Mining. Jeff M. Phillips. January 7, 2019 CS 5140 / CS 6140

Data Mining. Jeff M. Phillips. January 7, 2019 CS 5140 / CS 6140 Data Mining CS 5140 / CS 6140 Jeff M. Phillips January 7, 2019 What is Data Mining? What is Data Mining? Finding structure in data? Machine learning on large data? Unsupervised learning? Large scale computational

More information

Traffic Load Prediction in SDN/OpenFlow Networks

Traffic Load Prediction in SDN/OpenFlow Networks UNIVERSITY OF PIRAEUS DEPARTMENT OF DIGITAL SYSTEMS POSTGRADUATE PROGRAM DIGITAL SYSTEMS AND SERVICES Traffic Load Prediction in SDN/OpenFlow Networks Master Thesis Candidate Theoni Petropoulou Advisor

More information

20532D - Version: 1. Developing Microsoft Azure Solutions

20532D - Version: 1. Developing Microsoft Azure Solutions 20532D - Version: 1 Developing Microsoft Azure Solutions Developing Microsoft Azure Solutions 20532D - Version: 1 5 days Course Description: This course offers students the opportunity to take an existing

More information

WP Cookie Control 8. User. Manual. Version 1. civicuk.com. Cookie Control 8 for Wordpress

WP Cookie Control 8. User. Manual. Version 1. civicuk.com. Cookie Control 8 for Wordpress WP Cookie Control 8 User Manual Version 1 CIVIC 12 South Charlotte Street Edinburgh EH2 4AX 0131 624 9830 info@ www. Registered in Scotland SC221925 VAT Registration No 774906100 Civic Computing Ltd 1

More information

Big Data Infrastructures & Technologies

Big Data Infrastructures & Technologies Big Data Infrastructures & Technologies Spark and MLLIB OVERVIEW OF SPARK What is Spark? Fast and expressive cluster computing system interoperable with Apache Hadoop Improves efficiency through: In-memory

More information

Learning Objectives for Data Concept and Visualization

Learning Objectives for Data Concept and Visualization Learning Objectives for Data Concept and Visualization Assignment 1: Data Quality Concept and Impact of Data Quality Summarize concepts of data quality. Understand and describe the impact of data on actuarial

More information

Blended Learning Outline: Cloudera Data Analyst Training (171219a)

Blended Learning Outline: Cloudera Data Analyst Training (171219a) Blended Learning Outline: Cloudera Data Analyst Training (171219a) Cloudera Univeristy s data analyst training course will teach you to apply traditional data analytics and business intelligence skills

More information

Introduction to Data Analytics. David Walling

Introduction to Data Analytics. David Walling Introduction to Data Analytics David Walling walling@tacc.utexas.edu Source: http://research.microsoft.com/en-us/collaboration/fourthparadigm/default.aspx Computational Simulation Model first, given initial

More information

Python With Data Science

Python With Data Science Course Overview This course covers theoretical and technical aspects of using Python in Applied Data Science projects and Data Logistics use cases. Who Should Attend Data Scientists, Software Developers,

More information

MAPPING HYDROTHERMAL ALTERATION ZONES FROM SPECTRAL BAND RATIOS: A GEOSTATISTICAL APPROACH BASED ON THE STABLE SEMIVARIOGRAM MODEL

MAPPING HYDROTHERMAL ALTERATION ZONES FROM SPECTRAL BAND RATIOS: A GEOSTATISTICAL APPROACH BASED ON THE STABLE SEMIVARIOGRAM MODEL Δελτίο της Ελληνικής Γεωλογικής Εταιρίας, τομ. XLVII, 013 Πρακτικά 13 ου Διεθνούς Συνεδρίου, Χανιά, Σεπτ. 013 Bulletin of the Geological Society of Greece, vol. XLVII 013 Proceedings of the 13 th International

More information

Συστήματα Διαχείρισης Βάσεων Δεδομένων

Συστήματα Διαχείρισης Βάσεων Δεδομένων ΕΛΛΗΝΙΚΗ ΔΗΜΟΚΡΑΤΙΑ ΠΑΝΕΠΙΣΤΗΜΙΟ ΚΡΗΤΗΣ Συστήματα Διαχείρισης Βάσεων Δεδομένων Διάλεξη 3η: Access methods: Hash Indexes Δημήτρης Πλεξουσάκης Τμήμα Επιστήμης Υπολογιστών ACCESS METHODS: HASH INDEXES 1 Hash

More information

ΝΤUA. Τεχνολογία Πολυμέσων

ΝΤUA. Τεχνολογία Πολυμέσων ΝΤUA Τεχνολογία Πολυμέσων 3. Διάλεξη 3: Transform Coding Rate Distortion Theory D may be the Mean Square Error or some human perceived measure of distortion Types of Lossy Compression VBR Variable Bit

More information

Developing Microsoft Azure Solutions

Developing Microsoft Azure Solutions Course 20532C: Developing Microsoft Azure Solutions Course details Course Outline Module 1: OVERVIEW OF THE MICROSOFT AZURE PLATFORM This module reviews the services available in the Azure platform and

More information

IBM Data Science Experience White paper. SparkR. Transforming R into a tool for big data analytics

IBM Data Science Experience White paper. SparkR. Transforming R into a tool for big data analytics IBM Data Science Experience White paper R Transforming R into a tool for big data analytics 2 R Executive summary This white paper introduces R, a package for the R statistical programming language that

More information

Machine Learning with Python

Machine Learning with Python DEVNET-2163 Machine Learning with Python Dmitry Figol, SE WW Enterprise Sales @dmfigol Cisco Spark How Questions? Use Cisco Spark to communicate with the speaker after the session 1. Find this session

More information

Τα Διεθνή Πρότυπα που έχουν εκδοθεί από τον Διεθνή Οργανισμό Τυποποίησης μπορείτε να τα δείτε εδώ

Τα Διεθνή Πρότυπα που έχουν εκδοθεί από τον Διεθνή Οργανισμό Τυποποίησης μπορείτε να τα δείτε εδώ Λευκωσία, 17 Ιουνίου 2016 ΠΡΟΣ: ΘΕΜΑ: Ολα τα Μέλη Δημοσίευση Ευρωπαϊκών Προτύπων-CEN & CENELEC Κύριοι, Σας αποστέλλονται συνημμένα τα κείμενα που περιλαμβάνουν: Τα Ευρωπαϊκά Πρότυπα CEN και CENELEC που

More information

CSC 261/461 Database Systems Lecture 24. Spring 2017 MW 3:25 pm 4:40 pm January 18 May 3 Dewey 1101

CSC 261/461 Database Systems Lecture 24. Spring 2017 MW 3:25 pm 4:40 pm January 18 May 3 Dewey 1101 CSC 261/461 Database Systems Lecture 24 Spring 2017 MW 3:25 pm 4:40 pm January 18 May 3 Dewey 1101 Announcements Term Paper due on April 20 April 23 Project 1 Milestone 4 is out Due on 05/03 But I would

More information

Unifying Big Data Workloads in Apache Spark

Unifying Big Data Workloads in Apache Spark Unifying Big Data Workloads in Apache Spark Hossein Falaki @mhfalaki Outline What s Apache Spark Why Unification Evolution of Unification Apache Spark + Databricks Q & A What s Apache Spark What is Apache

More information

Πανεπιστήμιο Θεσσαλίας Πολυτεχνική Σχολή Τμήμα Μηχανικών Ηλεκτρονικών Υπολογιστών, Δικτύων και Τηλεπικοινωνιών

Πανεπιστήμιο Θεσσαλίας Πολυτεχνική Σχολή Τμήμα Μηχανικών Ηλεκτρονικών Υπολογιστών, Δικτύων και Τηλεπικοινωνιών Πανεπιστήμιο Θεσσαλίας Πολυτεχνική Σχολή Τμήμα Μηχανικών Ηλεκτρονικών Υπολογιστών, Δικτύων και Τηλεπικοινωνιών Διαχείριση Πόρων σε Δίκτυα Περιεχομένου Κωνσταντίνος Πουλαράκης Επιβλέποντες: Λέανδρος Τασιούλας

More information

Data Science. Data Analyst. Data Scientist. Data Architect

Data Science. Data Analyst. Data Scientist. Data Architect Data Science Data Analyst Data Analysis in Excel Programming in R Introduction to Python/SQL/Tableau Data Visualization in R / Tableau Exploratory Data Analysis Data Scientist Inferential Statistics &

More information

ΠΟΛΙΤΙΚΕΣ ΡΥΘΜΙΣΗΣ ΤΗΣ ΙΑΧΕΙΡΙΣΗΣ ΤΗΣ ΕΝΗΜΕΡΩΣΗΣ ΑΠΟΘΗΚΩΝ Ε ΟΜΕΝΩΝ Η ΜΕΤΑΠΤΥΧΙΑΚΗ ΕΡΓΑΣΙΑ ΕΞΕΙ ΙΚΕΥΣΗΣ. Υποβάλλεται στην

ΠΟΛΙΤΙΚΕΣ ΡΥΘΜΙΣΗΣ ΤΗΣ ΙΑΧΕΙΡΙΣΗΣ ΤΗΣ ΕΝΗΜΕΡΩΣΗΣ ΑΠΟΘΗΚΩΝ Ε ΟΜΕΝΩΝ Η ΜΕΤΑΠΤΥΧΙΑΚΗ ΕΡΓΑΣΙΑ ΕΞΕΙ ΙΚΕΥΣΗΣ. Υποβάλλεται στην ΠΟΛΙΤΙΚΕΣ ΡΥΘΜΙΣΗΣ ΤΗΣ ΙΑΧΕΙΡΙΣΗΣ ΤΗΣ ΕΝΗΜΕΡΩΣΗΣ ΑΠΟΘΗΚΩΝ Ε ΟΜΕΝΩΝ Η ΜΕΤΑΠΤΥΧΙΑΚΗ ΕΡΓΑΣΙΑ ΕΞΕΙ ΙΚΕΥΣΗΣ Υποβάλλεται στην ορισθείσα από την Γενική Συνέλευση Ειδικής Σύνθεσης του Τµήµατος Πληροφορικής Εξεταστική

More information

The Global Workplace gets Faster. Connect people Everywhere, Faster, on any Device!

The Global Workplace gets Faster. Connect people Everywhere, Faster, on any Device! Fiber is changing the game. Since having the new superfast broadband we have been able to sync files much faster across the team enabling us to be more productive and offer a better service to our clients.

More information

Scalable Tools - Part I Introduction to Scalable Tools

Scalable Tools - Part I Introduction to Scalable Tools Scalable Tools - Part I Introduction to Scalable Tools Adisak Sukul, Ph.D., Lecturer, Department of Computer Science, adisak@iastate.edu http://web.cs.iastate.edu/~adisak/mbds2018/ Scalable Tools session

More information

R For Everyone: Advanced Analytics And Graphics (Addison-Wesley Data & Analytics Series) PDF

R For Everyone: Advanced Analytics And Graphics (Addison-Wesley Data & Analytics Series) PDF R For Everyone: Advanced Analytics And Graphics (Addison-Wesley Data & Analytics Series) PDF Statistical Computation for Programmers, Scientists, Quants, Excel Users, and Other Professionals Using the

More information

Scaled Machine Learning at Matroid

Scaled Machine Learning at Matroid Scaled Machine Learning at Matroid Reza Zadeh @Reza_Zadeh http://reza-zadeh.com Machine Learning Pipeline Learning Algorithm Replicate model Data Trained Model Serve Model Repeat entire pipeline Scaling

More information

17/05/2017. What we ll cover. Who is Greg? Why PaaS and SaaS? What we re not discussing: IaaS

17/05/2017. What we ll cover. Who is Greg? Why PaaS and SaaS? What we re not discussing: IaaS What are all those Azure* and Power* services and why do I want them? Dr Greg Low SQL Down Under greg@sqldownunder.com Who is Greg? CEO and Principal Mentor at SDU Data Platform MVP Microsoft Regional

More information

Analytics Research Internship at Hewlett Packard Labs

Analytics Research Internship at Hewlett Packard Labs Analytics Research Internship at Hewlett Packard Labs Stefanie Deo Mentor: Mehran Kafai September 12, 2016 First, another opportunity that came my way but didn t pan out: Data Science Internship at Intuit!

More information

FAULT MODELS, TEST ALGORITHMS AND EMBEDDED TEST TECHNIQUES FOR DRAM CIRCUITS DISSERTATION. submitted to the Examination Commission,

FAULT MODELS, TEST ALGORITHMS AND EMBEDDED TEST TECHNIQUES FOR DRAM CIRCUITS DISSERTATION. submitted to the Examination Commission, FAULT MODELS, TEST ALGORITHMS AND EMBEDDED TEST TECHNIQUES FOR DRAM CIRCUITS DISSERTATION submitted to the Examination Commission, designated by the General Assembly of Special Composition of the Department

More information

Apache Spark and Scala Certification Training

Apache Spark and Scala Certification Training About Intellipaat Intellipaat is a fast-growing professional training provider that is offering training in over 150 most sought-after tools and technologies. We have a learner base of 600,000 in over

More information

A Parallel R Framework

A Parallel R Framework A Parallel R Framework for Processing Large Dataset on Distributed Systems Nov. 17, 2013 This work is initiated and supported by Huawei Technologies Rise of Data-Intensive Analytics Data Sources Personal

More information

Rapid growth of massive datasets

Rapid growth of massive datasets Overview Rapid growth of massive datasets E.g., Online activity, Science, Sensor networks Data Distributed Clusters are Pervasive Data Distributed Computing Mature Methods for Common Problems e.g., classification,

More information

Oracle Big Data Discovery

Oracle Big Data Discovery Oracle Big Data Discovery Turning Data into Business Value Harald Erb Oracle Business Analytics & Big Data 1 Safe Harbor Statement The following is intended to outline our general product direction. It

More information

Overview. : Cloudera Data Analyst Training. Course Outline :: Cloudera Data Analyst Training::

Overview. : Cloudera Data Analyst Training. Course Outline :: Cloudera Data Analyst Training:: Module Title Duration : Cloudera Data Analyst Training : 4 days Overview Take your knowledge to the next level Cloudera University s four-day data analyst training course will teach you to apply traditional

More information

Take P, R or U. and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22

Take P, R or U. and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22 Take P, R or U and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22 Oliver Engels CEO, oh22data AG @oengels Datamonster from Germany MS Data Platform MVP President of PASS Germany

More information

Intro to Python & Programming. C-START Python PD Workshop

Intro to Python & Programming. C-START Python PD Workshop Don t just buy a new video game, make one. Don t just download the latest app, help design it. Don t just play on your phone, program it. No one is born a computer scientist, but with a little hard work

More information

Recommended Maintenance Plan for Siriusware Clients for SQL server 2005

Recommended Maintenance Plan for Siriusware Clients for SQL server 2005 Recommended Maintenance Plan for Siriusware Clients for SQL server 2005 PURPOSE The purpose of this document is to describe how to automate the periodic rebuilding of indexes for the SiriusSQL database.

More information

Hadoop course content

Hadoop course content course content COURSE DETAILS 1. In-detail explanation on the concepts of HDFS & MapReduce frameworks 2. What is 2.X Architecture & How to set up Cluster 3. How to write complex MapReduce Programs 4. In-detail

More information

Analyzing Big Data with Microsoft R

Analyzing Big Data with Microsoft R Analyzing Big Data with Microsoft R 20773; 3 days, Instructor-led Course Description The main purpose of the course is to give students the ability to use Microsoft R Server to create and run an analysis

More information

Certified Data Science with Python Professional VS-1442

Certified Data Science with Python Professional VS-1442 Certified Data Science with Python Professional VS-1442 Certified Data Science with Python Professional Certified Data Science with Python Professional Certification Code VS-1442 Data science has become

More information

An Empirical Study on the Usage of Conventions and Rules for SQL programming in FoSS

An Empirical Study on the Usage of Conventions and Rules for SQL programming in FoSS An Empirical Study on the Usage of Conventions and Rules for SQL programming in FoSS A Thesis submitted to the designated by the General Assembly of Special Composition of the Department of Computer Science

More information

CloudExpo November 2017 Tomer Levi

CloudExpo November 2017 Tomer Levi CloudExpo November 2017 Tomer Levi About me Full Stack Engineer @ Intel s Advanced Analytics group. Artificial Intelligence unit at Intel. Responsible for (1) Radical improvement of critical processes

More information

Python Certification Training

Python Certification Training About Intellipaat Intellipaat is a fast-growing professional training provider that is offering training in over 150 most sought-after tools and technologies. We have a learner base of 600,000 in over

More information

Σχεδιασμός Κυκλώματος Προσαύξησης στη VHDL

Σχεδιασμός Κυκλώματος Προσαύξησης στη VHDL Σχεδιασμός Κυκλώματος Προσαύξησης στη VHDL Τμήμα Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών Πανεπιστήμιο Κύπρου 11-09-2013 Block Diagram of a 16-bit Adder A[0..15] B[0..1 5 Ci n 16-bit adder S[0..15]

More information

Digital Enterprise Platform for Live Business. Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU

Digital Enterprise Platform for Live Business. Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU Digital Enterprise Platform for Live Business Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU Rethinking the Future Competing in today s marketplace means leveraging

More information

Data Science Training

Data Science Training Data Science Training R, Predictive Modeling, Machine Learning, Python, Bigdata & Spark 9886760678 Introduction: This is a comprehensive course which builds on the knowledge and experience a business analyst

More information

SOFTWARE DEVELOPMENT: DATA SCIENCE

SOFTWARE DEVELOPMENT: DATA SCIENCE PROFESSIONAL CAREER TRAINING INSTITUTE SOFTWARE DEVELOPMENT: DATA SCIENCE www.pcti.edu/data-science applicant@pcti.edu 832-484-9100 PROGRAM OVERVIEW Prepare for a life changing career as a data scientist

More information

Storage, Processing and Analysis of Large Evolving Graphs

Storage, Processing and Analysis of Large Evolving Graphs Storage, Processing and Analysis of Large Evolving Graphs A Dissertation submitted to the designated by the General Assembly of Special Composition of the Department of Computer Science and Engineering

More information

Advantech AE Technical Share Document

Advantech AE Technical Share Document Advantech AE Technical Share Document Date 2018/4/3 SR# 1-3292296941 Category FAQ SOP Related OS N/A Abstract How to set up ADAM-3600 to upload to Azure cloud Keyword Azure, Cloud, Upload, MQTT, PowerBI,

More information

Connecting relevant video content to audiences CREDENTIALS DECK

Connecting relevant video content to audiences CREDENTIALS DECK Connecting relevant video content to audiences CREDENTIALS DECK 1 Hello, We re Vilynx We've developed machine learning technology to drive video discovery and engagement. US company, headquarters in Palo

More information

Low-High Orders of Directed Graphs: Incremental Algorithms and Applications

Low-High Orders of Directed Graphs: Incremental Algorithms and Applications Low-High Orders of Directed Graphs: Incremental Algorithms and Applications Η Μεταπτυχιακή Εργασία Εξειδίκευσης υποβάλλεται στην ορισθείσα από τη Γενική Συνέλευση Ειδικής Σύνθεσης του Τμήματος Μηχανικών

More information

STAT 1291: Data Science

STAT 1291: Data Science STAT 1291: Data Science Lecture 20 - Summary Sungkyu Jung Semester recap data visualization data wrangling professional ethics statistical foundation Statistical modeling: Regression Cause and effect:

More information

Holberton takes on full-stack software engineering with a project based approach, we have a mission to cultivate community and encourage innovation.

Holberton takes on full-stack software engineering with a project based approach, we have a mission to cultivate community and encourage innovation. Syllabus - Year 1 Holberton takes on full-stack software engineering with a project based approach, we have a mission to cultivate community and encourage innovation. Our syllabus walks through the major

More information

ACHIEVEMENTS FROM TRAINING

ACHIEVEMENTS FROM TRAINING LEARN WELL TECHNOCRAFT DATA SCIENCE/ MACHINE LEARNING SYLLABUS 8TH YEAR OF ACCOMPLISHMENTS AUTHORIZED GLOBAL CERTIFICATION CENTER FOR MICROSOFT, ORACLE, IBM, AWS AND MANY MORE. 8411002339/7709292162 WWW.DW-LEARNWELL.COM

More information

SharePoint 2013 Central Administration

SharePoint 2013 Central Administration Course Objectives SharePoint 2013 Central Administration SharePoint Virtual environment creation through VMware, Virtual Box & Hyper-V. SharePoint Farm setup - Standalone, Small, Medium and Large Scale

More information