Analysing the Panama Papers with Oracle Big Data Spatial and Graph
|
|
- Alvin Daniels
- 5 years ago
- Views:
Transcription
1 speakerdeck.com/rmoff/ Analysing the Panama Papers with Oracle Big Data Spatial and Graph BIWA Summit 2017 Robin Moffatt, Rittman Mead 1
2 Robin Moffatt! Head of R&D, Rittman Mead Previously OBIEE/DW developer at large UK retailer Previously SQL Server DBA, Business Objects, DB2, COBOL.! Oracle ACE! Frequent blogger : and IRC: rmoff / #obihackers / freenode info@rittmanmead.com 2
3 Rittman Mead! Oracle Gold Partner with offices in the UK and USA! 70+ staff delivering Oracle BI, DW, Big Data and Advanced Analytics projects! Significant web presence with the Rittman Mead Blog ( Hadoop R&D lab for dogfooding solutions developed for customers 3
4 What are the Panama Papers?! Dataset of 11.5 million documents regarding offshore entities, released in 2015! International Consortium of Investigative Journalists (ICIJ) analysed the raw data and made available a curated set of the data! The New York Times and The Guardian among newspapers that investigated the data in depth There are legitimate uses for o"shore companies and trusts. We do not intend to suggest or imply that any persons, companies or other entities included in the ICIJ O!shore Leaks Database have broken the law or otherwise acted improperly. Many people and entities have the same or similar names. 4
5 What is a Property Graph and Why is it So Useful? Graph enables us to answer question that relational would struggle with - Show me all people with a connection to an entity that is connected to at least five other people You could write recursive or procedural SQL but it would be nasty. It would also be impossible to maintain and repeat at scale. Graph-based algorithms (e.g. PageRank) enrich an existing dataset and give us additional insights into it 5
6 Property Graph Terminology Node/Vertex - The What Edge - The How / Relationship (Un)Directed - Does the relationship go in a specific direction - for example, a person is an officer of a company, not the other way around 6
7 Graph Analysis Uses 7
8 Oracle Big Data Spatial and Graph! Store the Property Graph definition in HBase or Oracle NoSQL! API to load/modify data! In-memory analytic engine (PGX) loads graph for analysis, and provides built-in algorithm implementations! Also provides RDF and Spatial capabilities 8
9 New in Oracle Property Graph support! Store Property Graph data in Oracle RDBMS! Query through PGX same way as existing Big Data tool version! 122/SPGDG/using-propertygraphs-oracledatabase.htm#BDSPA191 9
10 Interacting with Property Graphs! First-class Java APIs for programatic access - Gremlin support for interactive work! Nascent Python support (pyopg)! Spark library available! Use Notebooks!! Interactive visualisation and exploration of data with tools like Cytoscape (open source), Tom Sawyer (paid), etc 10
11 Notebooks! Interactive code development & execution environment! Notebooks can be shared for others to run and reproduce findings! Apache Zeppelin and Jupyter two popular options! Working with Spatial and Graph: - pyopg/jupyter - PGX/Zeppelin 11
12 Apache Zeppelin 12
13 Loading the Data Data can be loaded from various formats: - GraphML Data Format - GraphSON Data Format - GML Data Format - Oracle Flat File Format Source data was CSV, which needed wrangling to fit a supported input format - Oracle Flat File Format was chosen - Supports highly-parallised loading in BDSG - BDSG now provides CSV->OPE/V Java API 13
14 Data Wrangling with R panama_opv_ope.r 14
15 importflatfiles() 15
16 Inspecting the Property Graph 16
17 Inspecting the Property Graph 17
18 Analysing the Property Graph 18
19 Analysing the Property Graph - Zeppelin! Native rendering support for resultset objects 19
20 Analysing the Data - Property Graph Query Language (PGQL)! SQL-like language for querying property graph! Same SELECT.. WHERE clause pattern but with syntax for expressing graph relationships! 20
21 Simple PGQL 21
22 PGQL with Predicate 22
23 Built-in Functions 23
24 Native Charting in Zeppelin 24
25 Powerful Predicate support 25
26 PGX Built-In Algorithms 26
27 Exploring the Property Graph with Cytoscape 27
28 Exploring the Property Graph with Cytoscape 28
29 Exploring the Property Graph with Cytoscape 29
30 Layout Algorithms Prefuse Force Directed Layout 30
31 Community Detection 31
32 Community Detection 32
33 Let s walk through an example 33
34 34
35 35
36 36
37 37
38 38
39 39
40 40
41 EOF #EOF ! web! speakerdeck.com/rmoff/ irc! #obihackers 41
Analyzing a social network using Big Data Spatial and Graph Property Graph
Analyzing a social network using Big Data Spatial and Graph Property Graph Oskar van Rest Principal Member of Technical Staff Gabriela Montiel-Moreno Principal Member of Technical Staff Safe Harbor Statement
More informationApply Graph and Deep Learning to Recommendation and Network Intrusion Detection
Apply Graph and Deep Learning to Recommendation and Network Intrusion Detection Zhe Wu, Ph.D. Architect alan.wu@oracle.com Oracle Spatial and Graph June 22, 2017 Outline Introduction and overview of graph
More informationGraph Databases nur ein Hype oder das Ende der relationalen Welt? DOAG 2016
Graph Databases nur ein Hype oder das Ende der relationalen Welt? DOAG 2016 Hans Viehmann Product Manager EMEA 15. November 2016 Safe Harbor Statement The following is intended to outline our general product
More informationDeep Learning und Graphenanalyse im Einsatz gegen Hacker
Deep Learning und Graphenanalyse im Einsatz gegen Hacker Hans Viehmann Product Manager EMEA ORACLE Corporation DOAG Konferenz 2017 @SpatialHannes Safe Harbor Statement The following is intended to outline
More informationOverview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::
Title Duration : Apache Spark Development : 4 days Overview Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized
More informationOracle BI 12c Upgrade at LGI Francesco Tisiot, Principal Consultant, Rittman Mead.
Oracle BI 12c Upgrade at LGI Francesco Tisiot, Principal Consultant, Rittman Mead 1 About the Speaker Francesco Tisiot Principal Consultant at Rittman Mead Based in Verona, Italy Writer for Rittman Mead
More informationData Architectures in Azure for Analytics & Big Data
Data Architectures in for Analytics & Big Data October 20, 2018 Melissa Coates Solution Architect, BlueGranite Microsoft Data Platform MVP Blog: www.sqlchick.com Twitter: @sqlchick Data Architecture A
More informationOracle Big Data Science
Oracle Big Data Science Tim Vlamis and Dan Vlamis Vlamis Software Solutions 816-781-2880 www.vlamis.com @VlamisSoftware Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri
More informationOverview of Oracle Big Data Spatial and Graph Property Graph
Overview of Oracle Big Data Spatial and Graph Property Graph Zhe Wu, Ph.D. Architect Oracle Spatial and Graph Jan, 2016 Copyright 2014 Oracle and/or its affiliates. All rights reserved. The following is
More informationDelving Deep into Hadoop Course Contents Introduction to Hadoop and Architecture
Delving Deep into Hadoop Course Contents Introduction to Hadoop and Architecture Hadoop 1.0 Architecture Introduction to Hadoop & Big Data Hadoop Evolution Hadoop Architecture Networking Concepts Use cases
More informationBuild Recommender Systems, Detect Network Intrusion, and Integrate Deep Learning with Graph Technologies
Build Recommender Systems, Detect Network Intrusion, and Integrate Deep Learning with Graph Technologies Zhe Wu Chris Nicholson Charlie Berger Architect Oracle CEO Skymind Senior Director Oracle BIWA 2017
More information20777A: Implementing Microsoft Azure Cosmos DB Solutions
20777A: Implementing Microsoft Azure Solutions Course Details Course Code: Duration: Notes: 20777A 3 days This course syllabus should be used to determine whether the course is appropriate for the students,
More informationUnderstanding the latent value in all content
Understanding the latent value in all content John F. Kennedy (JFK) November 22, 1963 INGEST ENRICH EXPLORE Cognitive skills Data in any format, any Azure store Search Annotations Data Cloud Intelligence
More informationAnalyzing Blockchain and Bitcoin Transaction Data as Graph
Analyzing Blockchain and Bitcoin Transaction Data as Graph Xavier Lopez Senior Director Zhe Wu Architect Oracle Code Boston April 17th, 2018 Copyright 2015 Oracle and/or its affiliates. All rights reserved.
More informationBIG DATA COURSE CONTENT
BIG DATA COURSE CONTENT [I] Get Started with Big Data Microsoft Professional Orientation: Big Data Duration: 12 hrs Course Content: Introduction Course Introduction Data Fundamentals Introduction to Data
More informationOracle Big Data Science IOUG Collaborate 16
Oracle Big Data Science IOUG Collaborate 16 Session 4762 Tim and Dan Vlamis Tuesday, April 12, 2016 Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri Developed 200+ Oracle
More informationOracle Big Data Spatial and Graph Property Graph: Features and Performance ORACLE TECHNICAL WHITEPAPER DECEMBER 2017
Oracle Big Data Spatial and Graph Property Graph: Features and Performance ORACLE TECHNICAL WHITEPAPER DECEMBER 2017 Table of Contents INTRODUCTION... 2 ORACLE BIG DATA SPATIAL AND GRAPH PROPERTY GRAPH
More informationOracle Big Data Connectors
Oracle Big Data Connectors Oracle Big Data Connectors is a software suite that integrates processing in Apache Hadoop distributions with operations in Oracle Database. It enables the use of Hadoop to process
More informationDistributed Graph Storage. Veronika Molnár, UZH
Distributed Graph Storage Veronika Molnár, UZH Overview Graphs and Social Networks Criteria for Graph Processing Systems Current Systems Storage Computation Large scale systems Comparison / Best systems
More informationGraph Analytics and Machine Learning A Great Combination Mark Hornick
Graph Analytics and Machine Learning A Great Combination Mark Hornick Oracle Advanced Analytics and Machine Learning November 3, 2017 Safe Harbor Statement The following is intended to outline our research
More informationAnalyzing Blockchain and Bitcoin Transaction Data as Graph
Analyzing Blockchain and Bitcoin Transaction Data as Graph Zhe Wu alan.wu@oracle.com, Ph.D. Architect Oracle Spatial and Graph Feb 2018 Safe Harbor Statement The following is intended to outline our general
More informationAn InterSystems Guide to the Data Galaxy. Benjamin De Boe Product Manager
An InterSystems Guide to the Data Galaxy Benjamin De Boe Product Manager Analytics 3 InterSystems Corporation. All rights reserved. 4 InterSystems Corporation. All rights reserved. 5 InterSystems Corporation.
More informationApache Ignite - Using a Memory Grid for Heterogeneous Computation Frameworks A Use Case Guided Explanation. Chris Herrera Hashmap
Apache Ignite - Using a Memory Grid for Heterogeneous Computation Frameworks A Use Case Guided Explanation Chris Herrera Hashmap Topics Who - Key Hashmap Team Members The Use Case - Our Need for a Memory
More informationOracle Big Data. A NA LYT ICS A ND MA NAG E MENT.
Oracle Big Data. A NALYTICS A ND MANAG E MENT. Oracle Big Data: Redundância. Compatível com ecossistema Hadoop, HIVE, HBASE, SPARK. Integração com Cloudera Manager. Possibilidade de Utilização da Linguagem
More informationBig Data and Hadoop. Course Curriculum: Your 10 Module Learning Plan. About Edureka
Course Curriculum: Your 10 Module Learning Plan Big Data and Hadoop About Edureka Edureka is a leading e-learning platform providing live instructor-led interactive online training. We cater to professionals
More informationDATA FORMATS FOR DATA SCIENCE Remastered
Budapest BI FORUM 2016 DATA FORMATS FOR DATA SCIENCE Remastered Valerio Maggio @leriomaggio Data Scientist and Researcher Fondazione Bruno Kessler (FBK) Trento, Italy WhoAmI Post Doc Researcher @ FBK Interested
More informationBig Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara
Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case
More informationR Language for the SQL Server DBA
R Language for the SQL Server DBA Beginning with R Ing. Eduardo Castro, PhD, Principal Data Analyst Architect, LP Consulting Moderated By: Jose Rolando Guay Paz Thank You microsoft.com idera.com attunity.com
More informationIntroduction to Graph Analytics and Oracle Cloud Service
Introduction to Graph Analytics and Oracle Cloud Service Hans Viehmann Jean Ihm Korbi Schmid Product Manager EMEA Product Manager US Engineering Manager Oracle Oracle Oracle @SpatialHannes @JeanIhm October
More informationData 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.
Data 101 Which DB, When Joe Yong (joeyong@microsoft.com) Azure SQL Data Warehouse, Program Management Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020
More informationDeploying Spatial Applications in Oracle Public Cloud
Deploying Spatial Applications in Oracle Public Cloud David Lapp, Product Manager Oracle Spatial and Graph Oracle Spatial Summit at BIWA 2017 Safe Harbor Statement The following is intended to outline
More informationCloudExpo November 2017 Tomer Levi
CloudExpo November 2017 Tomer Levi About me Full Stack Engineer @ Intel s Advanced Analytics group. Artificial Intelligence unit at Intel. Responsible for (1) Radical improvement of critical processes
More informationIncrease Value from Big Data with Real-Time Data Integration and Streaming Analytics
Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Cy Erbay Senior Director Striim Executive Summary Striim is Uniquely Qualified to Solve the Challenges of Real-Time
More informationRittman Mead Consulting Automatic promotion and versioning with Oracle Data Integrator 12c
Rittman Mead Consulting Automatic promotion and versioning with Oracle Data Integrator 12c Jérôme Françoisse DOAG 2014 Konferenz + Ausstellung - 20th September 2014 1 Jérôme Françoisse Consultant for Rittman
More informationTopics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples
Hadoop Introduction 1 Topics Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples 2 Big Data Analytics What is Big Data?
More informationBig Data with Hadoop Ecosystem
Diógenes Pires Big Data with Hadoop Ecosystem Hands-on (HBase, MySql and Hive + Power BI) Internet Live http://www.internetlivestats.com/ Introduction Business Intelligence Business Intelligence Process
More informationKevin Madden Chief Software Engineer
Tom Sawyer Software Build Big Data and Graph Visualization Web Applications with Spring Data and Core Java Kevin Madden Chief Software Engineer Agenda Introduction Products and Technology Tom Sawyer Perspectives
More informationAsanka Padmakumara. ETL 2.0: Data Engineering with Azure Databricks
Asanka Padmakumara ETL 2.0: Data Engineering with Azure Databricks Who am I? Asanka Padmakumara Business Intelligence Consultant, More than 8 years in BI and Data Warehousing A regular speaker in data
More informationUsing Graphs to Analyze Big Linked Data
Using Graphs to Analyze Big Linked Data Hassan Chafi, Director, Research and Advanced Development Oracle Labs Copyright 2014 Oracle and/or its affiliates. All rights reserved. Safe Harbor Statement The
More informationIntro to Big Data on AWS Igor Roiter Big Data Cloud Solution Architect
Intro to Big Data on AWS Igor Roiter Big Data Cloud Solution Architect Igor Roiter Big Data Cloud Solution Architect Working as a Data Specialist for the last 11 years 9 of them as a Consultant specializing
More informationBig Data Analytics using Apache Hadoop and Spark with Scala
Big Data Analytics using Apache Hadoop and Spark with Scala Training Highlights : 80% of the training is with Practical Demo (On Custom Cloudera and Ubuntu Machines) 20% Theory Portion will be important
More informationThe age of Big Data Big Data for Oracle Database Professionals
The age of Big Data Big Data for Oracle Database Professionals Oracle OpenWorld 2017 #OOW17 SessionID: SUN5698 Tom S. Reddy tom.reddy@datareddy.com About the Speaker COLLABORATE & OpenWorld Speaker IOUG
More informationOracle Big Data SQL. Release 3.2. Rich SQL Processing on All Data
Oracle Big Data SQL Release 3.2 The unprecedented explosion in data that can be made useful to enterprises from the Internet of Things, to the social streams of global customer bases has created a tremendous
More informationWhat Are They Talking About These Days? Analyzing Topics with Graphs
What Are They Talking About These Days? Analyzing Topics with Graphs Davide Basilio Bartolini Oracle Labs Zürich, Switzerland Damien Hilloulin
More informationApache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context
1 Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes
More informationBig Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours
Big Data Hadoop Developer Course Content Who is the target audience? Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours Complete beginners who want to learn Big Data Hadoop Professionals
More informationKibana, Grafana and Zeppelin on Monitoring data
Kibana, Grafana and Zeppelin on Monitoring data Internal group presentaion Ildar Nurgaliev OpenLab Summer student Presentation structure About IT-CM-MM Section and myself Visualisation with Kibana 4 and
More informationCERTIFICATE IN SOFTWARE DEVELOPMENT LIFE CYCLE IN BIG DATA AND BUSINESS INTELLIGENCE (SDLC-BD & BI)
CERTIFICATE IN SOFTWARE DEVELOPMENT LIFE CYCLE IN BIG DATA AND BUSINESS INTELLIGENCE (SDLC-BD & BI) The Certificate in Software Development Life Cycle in BIGDATA, Business Intelligence and Tableau program
More informationReadme file for Oracle Spatial and Graph and OBIEE Sample Application (V305) VirtualBox
I Sections in this Readme Sections in this Readme... 1 Introduction... 1 References... 1 Included Software Releases... 2 Software to Download... 2 Installing the Image... 2 Quick Start for RDF Semantic
More informationStages of Data Processing
Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,
More informationSpectrum Version Release Notes
Spectrum Spatial for Big Data Version 2.6.1 Release Notes This document contains the new and updated features for Spectrum Spatial for Big Data. Contents: What's New? 2 Fixed Issues 3 Known Issues 3 System
More informationPNDA.io: when BGP meets Big-Data
PNDA.io: when BGP meets Big-Data Let s go back in time 26 th April 2017 The Internet is very much alive Millions of BGP events occurring every day 15 Routers Monitored 410 active peers (both IPv4 and IPv6)
More informationMicrosoft Big Data and Hadoop
Microsoft Big Data and Hadoop Lara Rubbelke @sqlgal Cindy Gross @sqlcindy 2 The world of data is changing The 4Vs of Big Data http://nosql.mypopescu.com/post/9621746531/a-definition-of-big-data 3 Common
More informationMigrating Express Applications To Oracle 9i A Practical Guide
Migrating Express Applications To Oracle 9i A Practical Guide Mark Rittman, Mick Bull Plus Consultancy http://www.plusconsultancy.co.uk Agenda Introduction A brief history of Oracle Express Oracle 9i OLAP
More informationBlurring the Line Between Developer and Data Scientist
Blurring the Line Between Developer and Data Scientist Notebooks with PixieDust va barbosa va@us.ibm.com Developer Advocacy IBM Watson Data Platform WHY ARE YOU HERE? More companies making bet-the-business
More informationAutonomous Data Warehouse in the Cloud
AUTONOMOUS DATA WAREHOUSE CLOUD` Connecting Your To Autonomous in the Cloud DWCS What is It? Oracle Autonomous Database Warehouse Cloud is fully-managed, highperformance, and elastic. You will have all
More informationCONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM
CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED PLATFORM Executive Summary Financial institutions have implemented and continue to implement many disparate applications
More informationmicrosoft
70-775.microsoft Number: 70-775 Passing Score: 800 Time Limit: 120 min Exam A QUESTION 1 Note: This question is part of a series of questions that present the same scenario. Each question in the series
More informationSession 7: Oracle R Enterprise OAAgraph Package
Session 7: Oracle R Enterprise 1.5.1 OAAgraph Package Oracle Spatial and Graph PGX Graph Algorithms Oracle R Technologies Mark Hornick Director, Oracle Advanced Analytics and Machine Learning July 2017
More informationOracle Big Data Fundamentals Ed 2
Oracle University Contact Us: 1.800.529.0165 Oracle Big Data Fundamentals Ed 2 Duration: 5 Days What you will learn In the Oracle Big Data Fundamentals course, you learn about big data, the technologies
More informationHadoop Online Training
Hadoop Online Training IQ training facility offers Hadoop Online Training. Our Hadoop trainers come with vast work experience and teaching skills. Our Hadoop training online is regarded as the one of the
More informationKNIME for the life sciences Cambridge Meetup
KNIME for the life sciences Cambridge Meetup Greg Landrum, Ph.D. KNIME.com AG 12 July 2016 What is KNIME? A bit of motivation: tool blending, data blending, documentation, automation, reproducibility More
More informationMySQL for Developers Ed 3
Oracle University Contact Us: 1.800.529.0165 MySQL for Developers Ed 3 Duration: 5 Days What you will learn This MySQL for Developers training teaches developers how to plan, design and implement applications
More informationSQT03 Big Data and Hadoop with Azure HDInsight Andrew Brust. Senior Director, Technical Product Marketing and Evangelism
Big Data and Hadoop with Azure HDInsight Andrew Brust Senior Director, Technical Product Marketing and Evangelism Datameer Level: Intermediate Meet Andrew Senior Director, Technical Product Marketing and
More informationThis is a brief tutorial that explains how to make use of Sqoop in Hadoop ecosystem.
About the Tutorial Sqoop is a tool designed to transfer data between Hadoop and relational database servers. It is used to import data from relational databases such as MySQL, Oracle to Hadoop HDFS, and
More informationHadoop. Introduction / Overview
Hadoop Introduction / Overview Preface We will use these PowerPoint slides to guide us through our topic. Expect 15 minute segments of lecture Expect 1-4 hour lab segments Expect minimal pretty pictures
More informationhttps://linkurio.us/blog/panama-papers-how-linkurious-enables-icij-to-investigate-the-massive-mossack-fonseca-leaks/
https://linkurio.us/blog/panama-papers-how-linkurious-enables-icij-to-investigate-the-massive-mossack-fonseca-leaks/ GEOB 370 November 23, 2017 Database Management Systems Guest lecture by Dr. Luke Bergmann
More informationEnable Spark SQL on NoSQL Hbase tables with HSpark IBM Code Tech Talk. February 13, 2018
Enable Spark SQL on NoSQL Hbase tables with HSpark IBM Code Tech Talk February 13, 2018 https://developer.ibm.com/code/techtalks/enable-spark-sql-onnosql-hbase-tables-with-hspark-2/ >> MARC-ARTHUR PIERRE
More informationSpatial Analytics Built for Big Data Platforms
Spatial Analytics Built for Big Platforms Roberto Infante Software Development Manager, Spatial and Graph 1 Copyright 2011, Oracle and/or its affiliates. All rights Global Digital Growth The Internet of
More informationOracle Machine Learning Notebook
Oracle Machine Learning Notebook Included in Autonomous Data Warehouse Cloud Charlie Berger, MS Engineering, MBA Sr. Director Product Management, Machine Learning, AI and Cognitive Analytics charlie.berger@oracle.com
More informationHow to choose the right approach to analytics and reporting
SOLUTION OVERVIEW How to choose the right approach to analytics and reporting A comprehensive comparison of the open source and commercial versions of the OpenText Analytics Suite In today s digital world,
More informationA Tutorial on Apache Spark
A Tutorial on Apache Spark A Practical Perspective By Harold Mitchell The Goal Learning Outcomes The Goal Learning Outcomes NOTE: The setup, installation, and examples assume Windows user Learn the following:
More informationOracle Big Data Discovery
Oracle Big Data Discovery Turning Data into Business Value Harald Erb Oracle Business Analytics & Big Data 1 Safe Harbor Statement The following is intended to outline our general product direction. It
More informationData Analyst Nanodegree Syllabus
Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working
More informationGPU Accelerated Data Processing Speed of Thought Analytics at Scale
GPU Accelerated Data Processing Speed of Thought Analytics at Scale The benefits of Brytlyt s GPU Accelerated Database Brytlyt is an ultra-high performance database that combines patent pending intellectual
More informationBig Data Architect.
Big Data Architect www.austech.edu.au WHAT IS BIG DATA ARCHITECT? A big data architecture is designed to handle the ingestion, processing, and analysis of data that is too large or complex for traditional
More informationData Formats. for Data Science. Valerio Maggio Data Scientist and Researcher Fondazione Bruno Kessler (FBK) Trento, Italy.
Data Formats for Data Science Valerio Maggio Data Scientist and Researcher Fondazione Bruno Kessler (FBK) Trento, Italy @leriomaggio About me kidding, that s me!-) Post Doc Researcher @ FBK Complex Data
More informationCloud Computing & Visualization
Cloud Computing & Visualization Workflows Distributed Computation with Spark Data Warehousing with Redshift Visualization with Tableau #FIUSCIS School of Computing & Information Sciences, Florida International
More informationModern Data Warehouse The New Approach to Azure BI
Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics
More informationIBM Data Virtualization Manager for z/os Leverage data virtualization synergy with API economy to evolve the information architecture on IBM Z
IBM for z/os Leverage data virtualization synergy with API economy to evolve the information architecture on IBM Z IBM z Analytics Agenda Big Data vs. Dark Data Traditional Data Integration Mainframe Data
More informationBusiness Analytics in the Oracle 12.2 Database: Analytic Views. Event: BIWA 2017 Presenter: Dan Vlamis and Cathye Pendley Date: January 31, 2017
Business Analytics in the Oracle 12.2 Database: Analytic Views Event: BIWA 2017 Presenter: Dan Vlamis and Cathye Pendley Date: January 31, 2017 Vlamis Software Solutions Vlamis Software founded in 1992
More informationData Analyst Nanodegree Syllabus
Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working
More informationMaking Sense of Schema-on-Read
YOUR DATA, NO LIMITS Making Sense of Schema-on-Read KENT GRAZIANO Chief Technical Evangelist Snowflake Computing @KentGraziano 1 My Bio Chief Technical Evangelist, Snowflake Computing Oracle ACE Director
More informationLesson 3 Ways of Organising the Data. Chapter-5 L03: "Internet of Things ", Raj Kamal, Publs.: McGraw-Hill Education
Lesson 3 Ways of Organising the Data 1 Data Organising Flat file Spreadsheet Database Relational database (only one scheme) 2 Data Organising Multiple schemas Data SQL Database Not Only SQL or NOSQL Relational
More informationPrototyping Data Intensive Apps: TrendingTopics.org
Prototyping Data Intensive Apps: TrendingTopics.org Pete Skomoroch Research Scientist at LinkedIn Consultant at Data Wrangling @peteskomoroch 09/29/09 1 Talk Outline TrendingTopics Overview Wikipedia Page
More informationHadoop 2.x Core: YARN, Tez, and Spark. Hortonworks Inc All Rights Reserved
Hadoop 2.x Core: YARN, Tez, and Spark YARN Hadoop Machine Types top-of-rack switches core switch client machines have client-side software used to access a cluster to process data master nodes run Hadoop
More informationEasier than Excel: Social Network Analysis of DocGraph with Gephi
Easier than Excel: Social Network Analysis of DocGraph with Gephi Janos G. Hajagos Stony Brook School of Medicine Fred Trotter fredtrotter.com DocGraph Based on FOIA request to CMS by Fred Trotter Pre-released
More informationBlended Learning Outline: Cloudera Data Analyst Training (171219a)
Blended Learning Outline: Cloudera Data Analyst Training (171219a) Cloudera Univeristy s data analyst training course will teach you to apply traditional data analytics and business intelligence skills
More informationThings Every Oracle DBA Needs to Know about the Hadoop Ecosystem. Zohar Elkayam
Things Every Oracle DBA Needs to Know about the Hadoop Ecosystem Zohar Elkayam www.realdbamagic.com Twitter: @realmgic Who am I? Zohar Elkayam, CTO at Brillix Programmer, DBA, team leader, database trainer,
More informationJupyter and Spark on Mesos: Best Practices. June 21 st, 2017
Jupyter and Spark on Mesos: Best Practices June 2 st, 207 Agenda About me What is Spark & Jupyter Demo How Spark+Mesos+Jupyter work together Experience Q & A About me Graduated from EE @ Tsinghua Univ.
More informationMySQL for Developers Ed 3
Oracle University Contact Us: 0845 777 7711 MySQL for Developers Ed 3 Duration: 5 Days What you will learn This MySQL for Developers training teaches developers how to plan, design and implement applications
More informationGraph and Timeseries Databases
Graph and Timeseries Databases Roman Kern ISDS, TU Graz 2017-10-23 Roman Kern (ISDS, TU Graz) Dbase2 2017-10-23 1 / 31 Graph Databases Graph Databases Motivation and Basics of Graph Databases? Roman Kern
More informationSpecialist ICT Learning
Specialist ICT Learning APPLIED DATA SCIENCE AND BIG DATA ANALYTICS GTBD7 Course Description This intensive training course provides theoretical and technical aspects of Data Science and Business Analytics.
More informationGranting Read-only Access To An Existing Oracle Schema
Granting Read-only Access To An Existing Oracle Schema Oracle recommends that you only grant the ANY privileges to trusted users. Use the IDENTIFIED BY clause to specify a new password for an existing
More informationOracle GoldenGate for Big Data
Oracle GoldenGate for Big Data The Oracle GoldenGate for Big Data 12c product streams transactional data into big data systems in real time, without impacting the performance of source systems. It streamlines
More informationCloud Computing 3. CSCI 4850/5850 High-Performance Computing Spring 2018
Cloud Computing 3 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning
More informationJAVASCRIPT CHARTING. Scaling for the Enterprise with Metric Insights Copyright Metric insights, Inc.
JAVASCRIPT CHARTING Scaling for the Enterprise with Metric Insights 2013 Copyright Metric insights, Inc. A REVOLUTION IS HAPPENING... 3! Challenges... 3! Borrowing From The Enterprise BI Stack... 4! Visualization
More informationAn Introduction to Apache Spark
An Introduction to Apache Spark 1 History Developed in 2009 at UC Berkeley AMPLab. Open sourced in 2010. Spark becomes one of the largest big-data projects with more 400 contributors in 50+ organizations
More informationSpotfire Advanced Data Services. Lunch & Learn Tuesday, 21 November 2017
Spotfire Advanced Data Services Lunch & Learn Tuesday, 21 November 2017 CONFIDENTIALITY The following information is confidential information of TIBCO Software Inc. Use, duplication, transmission, or republication
More informationCIB Session 12th NoSQL Databases Structures
CIB Session 12th NoSQL Databases Structures By: Shahab Safaee & Morteza Zahedi Software Engineering PhD Email: safaee.shx@gmail.com, morteza.zahedi.a@gmail.com cibtrc.ir cibtrc cibtrc 2 Agenda What is
More information