Visualisation of Abstract Information

Similar documents
Visualisation of Abstract Information

Information Visualisation

Visualisation : Lecture 1. So what is visualisation? Visualisation

International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.7, No.3, May Dr.Zakea Il-Agure and Mr.Hicham Noureddine Itani

A Survey And Comparative Analysis Of Data

D B M G Data Base and Data Mining Group of Politecnico di Torino

HOW TO ANALYZE AND UNDERSTAND YOUR NETWORK

Data Representation in Visualisation

Knowledge Engineering and Data Mining. Knowledge engineering has 6 basic phases:

CPSC 340: Machine Learning and Data Mining. Outlier Detection Fall 2018

COMP90049 Knowledge Technologies

Learning Objectives for Data Concept and Visualization

New Trends in Database Systems

Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data

Class #2. Data Models: maps as models of reality, geographical and attribute measurement & vector and raster (and other) data structures

Introduction to Geographic Information Science. Some Updates. Last Lecture 4/6/2017. Geography 4103 / Raster Data and Tesselations.

SECURING THE UK S DIGITAL PROSPERITY. Enabling the joint delivery of the National Cyber Security Strategy's objectives

Scalar Algorithms: Contouring

Volume Illumination, Contouring

LSGI 521: Principles of GIS. Lecture 5: Spatial Data Management in GIS. Dr. Bo Wu

Mining Web Data. Lijun Zhang

Multidimensional Indexing The R Tree

Mining data streams. Irene Finocchi. finocchi/ Intro Puzzles

Lecture 8. Vector Data Analyses. Tomislav Sapic GIS Technologist Faculty of Natural Resources Management Lakehead University

Texture Image Segmentation using FCM

Data mining fundamentals

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Mining Web Data. Lijun Zhang

Previously... contour or image rendering in 2D

INFORMATION-THEORETIC OUTLIER DETECTION FOR LARGE-SCALE CATEGORICAL DATA

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

DATA MINING TRANSACTION

International Journal of Advance Engineering and Research Development. A Survey on Data Mining Methods and its Applications

(Information) Visualization

Bayesian Learning Networks Approach to Cybercrime Detection

Table of Contents 1 Introduction A Declarative Approach to Entity Resolution... 17

Efficient Orienteering-Route Search over Uncertain Spatial Datasets

11. Image Data Analytics. Jacobs University Visualization and Computer Graphics Lab

CPSC 340: Machine Learning and Data Mining. Outlier Detection Fall 2016

CPSC 340: Machine Learning and Data Mining. Finding Similar Items Fall 2017

automatic digitization. In the context of ever increasing population worldwide and thereby

Visualisation of Temporal Interval Association Rules

Chapter 3: Data Mining:

Final Exam Review Sheet INFO/CSE 100 Spring 2006

Analysing crime data in Maps for Office and ArcGIS Online

NOWPAP. Northwest Pacific Action Plan. United Nations Environment Programme

Contents. Part I Setting the Scene

Computer Information Systems (CIS) CIS 105 Current Operating Systems/Security CIS 101 Introduction to Computers

Lecture #11: The Perceptron

With turing you can: Identify, locate and mitigate the effects of botnets or other malware abusing your infrastructure

Data Mining & Machine Learning F2.4DN1/F2.9DM1

CyberSecurity Situational Awareness Monitoring & Reporting Platform Pharos. Cyber Security Showcase Wednesday, 29 February 2012 Brussels, Belgium

Security in AI. Alex Healing Senior Research Manager BT Applied Research. British Telecommunications plc 2019

TNM093 Tillämpad visualisering och virtuell verklighet. Jimmy Johansson C-Research, Linköping University

Scalar Visualization

Chapter 1: Introduction to Spatial Databases

2. REVIEW OF RELATED RESEARCH WORK. 2.1 Methods of Data Aggregation

Lies, Damned Lies and Statistics Using Data Mining Techniques to Find the True Facts.

Advances in geographic information systems and remote sensing for fisheries and aquaculture

(Refer Slide Time 00:17) Welcome to the course on Digital Image Processing. (Refer Slide Time 00:22)

An open source relational database schema and system for the analysis of large scale spatially-interdependent infrastructure networks

Geodatabase over Taita Hills, Kenya

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

MUTUAL INFORMATION TO INTERPRET THE SEMANTICS OF ANOMALIES IN LINK MINING

Types of Computer Networks. ICS 614: Computer Networks Concepts and Principles 11

MACHINE LEARNING & INTRUSION DETECTION: HYPE OR REALITY?

Who We Are! Natalie Timpone

CYBER THREATS: REAL ESTATE FRAUD ADVISORY COUNCIL

EAT 233/3 GEOGRAPHIC INFORMATION SYSTEM (GIS)

CS246: Mining Massive Datasets Jure Leskovec, Stanford University.

Grid Exemplars: Web mapping in 3D. - Mark Morrison

Defining a Data Mining Task. CSE3212 Data Mining. What to be mined? Or the Approaches. Task-relevant Data. Estimation.

Processing 3D Surface Data

Design and Realization of WCDMA Project Simulation System

Knowledge Discovery and Data Mining

Computer Assisted Image Analysis TF 3p and MN1 5p Lecture 1, (GW 1, )

International Journal of Scientific & Engineering Research, Volume 6, Issue 10, October ISSN

Preservation Network Models: Creating Stable Networks of Information to Ensure the Long Term use of Scientific Data

Lecture 10: An Overview of Spatial Database Development: the project development approach

DATA MINING II - 1DL460

HOT asax: A Novel Adaptive Symbolic Representation for Time Series Discords Discovery

A Comparative Study of Teaching Forensics at a University Degree Level

Chapter 5: Outlier Detection

Intelligent management of on-line video learning resources supported by Web-mining technology based on the practical application of VOD

Accessible and Usable How personas could support principle 5 of the Statistical Geospatial Framework. Ekkehard Petri, Maja Islam Eurostat

CYBERBIT P r o t e c t i n g a n e w D i m e n s i o n

NM Resource Geographic Information System Program RGIS

Clustering Analysis based on Data Mining Applications Xuedong Fan

BIG Data How to handle it. Mark Holton, College of Engineering, Swansea University,

Today. Lecture 17: Reality Mining. Last time

Data Models and Data processing in GIS

GeoTemporal Reasoning for the Social Semantic Web

Geometric Techniques. Part 1. Example: Scatter Plot. Basic Idea: Scatterplots. Basic Idea. House data: Price and Number of bedrooms

COMP 465 Special Topics: Data Mining

Research Article Does an Arithmetic Coding Followed by Run-length Coding Enhance the Compression Ratio?

IEEE 2013 JAVA PROJECTS Contact No: KNOWLEDGE AND DATA ENGINEERING

LESSON 13 OVERVIEW OF PROGRAM DEVELOPMENT PHASES

Big Data analytics and Visualization

Legal Informatics, Privacy and Cyber Crime

Textual Integration -- Some Business Examples

Transcription:

Visualisation of Abstract Information Visualisation Lecture 17 Institute for Perception, Action & Behaviour School of Informatics Abstract Information 1

Information Visualisation Previously data with inherent topology 2D / 3D datasets multi-dimensional information Today : Abstract Information constructing topology in otherwise disjoint data measurements for visualisation information visualisation Abstract Information 2

What is Information Visualisation? visualising discrete data with no topology Information with no obvious visual representation Interest in data routinely collected and archived: events in an O/S, financial records, documents, medical records Visualising the non-visual Visual data mining Consumer store loyalty cards (who buys what, when?) Mobile communication (who call who, when?) Internet crime (firewall logs who attacks who, when?) Abstract Information 3

Visualisation of Connectivity Essentially identifying for patterns in connections between multiple objects Example : system failures Example : Fraud Usually multiple causes, or a combination of events Identify for patterns Identify for anomalous or unusual behaviour Connections between fraudsters humans poor at numerical analysis, use visual analysis Abstract Information 4

Example : fraud detection! Raw information on transaction visualised in a graphical format anomalies easily apparent Abstract Information 5

Connectivity : netmap Popular tool for information visualisation Concept: display data items around a 2D circle with n attributes displayed next to each other draw dissecting lines to indicate connections between attributes lines form graph indicating strength and correlations of connections represent relationships in the attributes of the information Abstract Information 6

Example : insurance data Detecting insurance fraud 12 months insurance claim data http://www.netmapanalytics.co m/technical/fraud_crime_de mo.pdf. Same example, larger scale. Abstract Information 7

Example : insurance data The results of a clustering algorithm. A normal cluster links 1 person, address and claim. Abstract Information 8

Example : insurance data Visualising a cluster in linear form (top). 6 steps of connectivity, 10 people linked (no third parties like lawyers). Links between clusters at bottom. Abstract Information 9

Example : insurance data All the information relating to Mr Verman is displayed anticlockwise and in clusters shows one person linked to multiple claims! The real Mr Verman is in jail Abstract Information 10

Connectivity : Daisy Variant on netmap concept Displays quantities as well in a bargraph form. www.daisy2000.com Visualisation of calls made by a fax machine. Triangle indicates large number of calls which timed out. As a result a fault was identified. Abstract Information 11

Connectivity : Daisy What does the visualisation tell us? the green shows the Number of Incidents the cyan the Total Delay in Minutes Many delays on the Jubilee line on Mondays this was due to a station being upgraded. Most delays are essentially random Northern line is particularly bad Abstract Information 12

Visualising Connectivity??? Abstract Information 13

Visualising Connectivity??? Abstract Information 14

Visualising Connectivity 1933 London Underground Map N.B. Beck used colour mapping in 1933! Here the connectivity is important to the user design : H. Beck 1933 (electrical draftsman) Abstract Information 15

Visualising Connectivity 73 years later! Beck's design : topology of the data preserved but geographically inaccurate topology represents the connectivity (thus it is maintained in the design) Abstract Information 16

Document Visualisation Motivation: Action Units of Information transfer Typing at 10 bytes per second 1 Mouse Operations. 2 Reading 3-40 Hearing 60 Visualisation and Pattern Recognition 12,500 visualisation is considerably faster than hearing / reading Source : Silicon Graphics Inc. Abstract Information 17

Visualisation of Documents Motivation : large bandwidth of human visual system 100s millions of documents available on-line information only in textual form Visualising the non-visual searching for scientific papers analysing witness statements awareness of events in news bulletins Concerned with searching and visualising occurrences of query words Abstract Information 18

Document Visualisation - Stages Query keywords from user specification comparison to sample reference document Representation of results form high-dimensional vector (one for each word, ~10000+) cluster documents based on vector similarity (e.g. Nearest-Neighbour) Visualisation of clustered results projection to lower dimensional space 2D galaxy / theme-scape / 1D theme-river Abstract Information 19

2D and 3D projections of documents 3D Visualisation of 567,000 cancer literature abstracts. Articles in a collection of news items (2D). Pacific Northwest National Laboratory. Abstract Information 20

1D visualisation of news articles A Theme River shows the relative importance of themes over the course of a year from press articles. Pacific Northwest National Laboratory. Abstract Information 21

Document Querying We are interested in distribution of keywords in the document related articles to the keyword entered Title bar scheme (Hearst 1995) display a list of documents with a title bar title bar shows the occurrence of keywords in document Abstract Information 22

Title Bar Method User query Cancer Prevention Research Columns represent paragraphs or pages in a document. Shade indicates relevance shown by word occurrence. Visualisation - Use of document topology / colour-mapping Abstract Information 23

Example : Title Bar Query / Result Query terms: DBMS (Database Systems) Reliability What roles do they play in retrieved documents? Mainly about both DBMS & reliability Mainly about DBMS, discusses reliability Mainly about, say, banking, with a subtopic discussion on DBMS/Reliability Mainly about something different Marti Hearst SIMS 247 Abstract Information 24

GIS : Geo-Spatial Visualisation visualisation of information clustered or placed relative to geographic location Increasing Area increasing availability of data & cheap compute power using fundamentals of computer based visualisation colour-mapping Abstract Information 25

Summary Visualisation of Connectivity Document Visualisation keyword query, visualisation of document relevance Geospatial Visualisation GIS identify complex relationships visual data mining representation of data with geographical interpretation Abstract Information 26