XML Miner, XML Rule and Metarule.

Size: px
Start display at page:

Download "XML Miner, XML Rule and Metarule."

Transcription

1 XML Miner, XML Rule and Metarule. Andrew N Edmonds Scientio inc. Wednesday, 30 May 2001 Version 1 Introduction... 1 Knowledge representation... 1 Data representation... 3 Data Location... 4 Fuzzy Logic... 4 Fuzzy sets... 5 Tools that use Metarule... 5 XML Miner... 5 XML Rule... 6 Utilities... 7 Strucfind... 7 Metarule.xsl... 7 Introduction XML is rapidly becoming the Lingua Franca of the Internet age. Using it you can communicate data between disparate computers running disparate software in a format that is recognisable by a wide range of tools. The kinds of data that can be communicated are not just boring old records and fields, but structured hierarchical data capable of representing not just the individual data items, but the relationships between data items. XML encodes data in tree form, and trees match the internal topology of a wide variety of data types that could hitherto only be encoded and transmitted via a proprietary format, or by bludgeoning them into field/record format. As new technologies develop they need tools. The first generation of these have been XML editors, schema editors, and various objects that work on XSL transforms. The next generation of tools will permit the analysis of data in XML, and we proudly present XML Miner as the first of these. XML Miner is Artificial Intelligence and Data-mining for XML. With XML miner and the associated runtime, XML Rule you can analyse, filter, categorise and predict XML data. Wherever data streams between two systems (or inside one system) in XML, XML Miner can extract information and make decisions and predictions that cope with incomplete data, vagueness, and the inherent tree structured nature of XML. Knowledge representation A fundamental part of Artificial Intelligence and Data mining is the representation of knowledge. AI is concerned, amongst other things, with encapsulating human knowledge in a form that computers can use; Data mining is concerned with extracting knowledge from data in a way that humans, and sometimes computers, can read. Scientio inc. has developed a knowledge representation framework called

2 Metarule that services both these needs. Unsurprisingly perhaps, Metarule is a dialect of XML. Because this is the case, Metarule can be transformed with XSL, edited with XML editors and so on. Like XML it is not processor or operating system specific. The first and central premise of Metarule is that the majority of useful knowledge can be represented with rules of 3 simple forms. The degree to which the previous statement is true is extremely hard to quantify. It is fundamentally a philosophical question, but our experience so far is that the application of these forms is very wide indeed. The second premise is that in order to represent, quantify, and handle uncertainty, the logic used should not be Boolean logic, but the superset of Boolean logic that also encompasses uncertainty, Fuzzy Logic. Anyone as yet unconverted to the marvels of fuzzy logic should not worry: if you restrict reasoning to just truth and falsehood, fuzzy logic behaves identically to Boolean logic. Here s a fragment of Metarule and the English language equivalent: <rule> </rule> <if/> <and> <and> </and> <and> </and> <input>petal_width</input> <input>sepal_width</input> <input>petal_length</input> <input>sepal_length</input> </and> <then/> <output>class</output> <willbe/> <category>iris-versicolor</category> <confidence>0.875</confidence> If petal width is small and sepal width is small and petal length is small and sepal length is small then class will be Iris-versicolor. Converting from Metarule to English in HTML is simply performed using an XSL transform which we supply with our products. The three rule types are: 1) If <conditions> then output will be <category>. Used for classification. 2) If <conditions> then output will be <fuzzy set>. Used for prediction.

3 3) If <conditions> then output will be <arithmetic expression>. Used to encapsulate mathematical knowledge. Rules are seldom used singly; a collection is termed a rule set. Data representation Individual data items occur in many primitive types, such as integers strings floats, etc. Metarule is concerned with two different fundamental data types, categorical or numeric. In general we can think of the things in the world as having quantities and / or qualities. Categorical variables hold qualities, numeric ones quantities. For instance male and female are categorical, 0.37metres is numeric. One might assume that any data item can easily be classified into one or the other, but this is not always true. Numbers are often used to represent categories for instance, as in Close encounters of the third kind.! In Metarule you can specifically differentiate between these two data types. Each Metarule document contains a section detailing the inputs and the outputs of the contained rule set. For instance: <outputlist> <outputspec> <output>class</output> <categorical> <category>iris-setosa</category> <category>iris-versicolor</category> <category>iris-virginica</category> </categorical> <execorder>1</execorder> </outputspec> </outputlist> Describes an output with three categories, while the following describes a numeric input defined by three fuzzy sets.

4 <inputspec> <input>sepal_length</input> <numeric> <setdefinition> <set settype = "extremely small"/> <lower>4.3</lower> <middle>5</middle> <upper>5.8</upper> </setdefinition> <setdefinition> <set settype = "very small"/> <lower>5</lower> <middle>5.8</middle> <upper>6.7</upper> </setdefinition> <setdefinition> <lower>5.8</lower> <middle>6.7</middle> <upper>7.9</upper> </setdefinition> </numeric> </inputspec> Data Location In relational databases it s fairly simple to locate a data item using indexes, ids, row numbers and field names. In XML, since the data is tree structured, it is more difficult. The inventors of XML thought of this however and invented XPath as the method to navigate round XML and identify nodes in the true, singly or in groups. All input and output variable names in Metarule can be XPath strings. Furthermore in XML information can be stored either as text between XML elements, for instance <height>1.84</height>, or as attributes: <wardrobe height= 1.84 ></wardrobe>. Metarule uses XPath to permit data in either form to be described and analyzed. Fuzzy Logic Since its invention in 1965 by Lotfi Zadeh, this extension to conventional logic has grown to be a very large field of study all on its own. The key concept is that the often-uncertain state of our knowledge of the universe is best described by a logic that permits degrees of truth. This compares to the traditional statistical view of describing the validity of assertions in terms of probability. Fuzzy logic generates tractable descriptions of uncertainty; probabilistic descriptions are notoriously intractable. In conventional logic it is standard to represent truth with the value 1 and falsehood with 0. Fuzzy logic permits truth to be represented by any real number between and including 0 and 1. Each element of conventional logic has an analogue in fuzzy logic, so there are fuzzy logic operators like and, or, not, that each behave like the conventional version if only 1s and 0s are used; there are fuzzy implication operators and fuzzy sets.

5 Fuzzy sets Fuzzy sets are really important for dealing with numeric variables. In speech we use a variety of quantifiers like big, small, large etc. and when asked how to perform some task we tend to say things like: When I m going too fast I take my foot of the gas or When I m too close to the car in front I put my foot on the brake. Where too fast and too close are vague quantifiers. An important insight in the early days of fuzzy logic was that these linguistic quantifiers corresponded to sets of data values. These sets do not have fixed boundaries, in the normal sense of a set, they have gradual boundaries, and it is possible for the current state of a variable to be in more than one of them. These sets were termed fuzzy sets. The next insight was that these sets could be modelled fairly easily, and once they were modelled it was not too difficult to take a set of instructions such as those above and turn them into a working control system. The above is an example of some fuzzy sets defined for the variable male height. As can be seen, someone who is 5 9 would be deemed by these sets to be 0.5 tall, and 0.5 medium. Tools that use Metarule Metarule would be only of academic interest if tools did not exist that both read and write Metarule data. Scientio have several tools to generate Metarule from a data source, and to allow the evaluation of knowledge stored in Metarule format. XML Miner This COM object takes as input an XML string or URL pointing to an XML data source, and uses fuzzy rule induction to create a Metarule description of the data set. The user must provide an XPath expression selecting the parent nodes of the data items to mine/analyse, and XPath expressions relative to that node locating the input values (those that are to be used in the analysis) and the output value (the leaf node that is to be predicted or classified). The user can optionally supply the type of the inputs and output (categorical/numeric) and the set granularity. XML Miner reads the input data, identifies the inputs and outputs, performs consistency checks, performs a statistical analysis of the data in each input or output and then creates a Fuzzy decision tree before converting the result to a set of rules in Metarule format.

6 The rules that XML Miner generates will predict or classify the output value with a margin of error. There is a trade off between that error and the complexity of the rule set produced, and yet another trade off between the complexity of the rule set and the performance of the prediction on new data that is not part of the training set. XML miner can be told to set part of the training set aside as a test set and supplies the performance for both test and training set as a percentage correct, for categorical outputs, or root-mean-square error for numeric. For batch or automated processing the definitions of the inputs or outputs can be presented in another XML string or file. XML Rule Given a Metarule encoded rule set we can make use of the knowledge contained using XML Rule. This is a COM object that acts as the Metarule runtime package.

7 XML Rule exposes two collections, inputs and outputs, the number, type and name of which are determined by the contents of the Metarule rule set. To use XML Rule you locate each input, set the inputs value, and call the only method, evaluate. The outputs collection will contain the predicted/classified values. Utilities Two utilities are provided with XML Miner. Strucfind This is a control that displays the structure of an XML document in tree form. Each element type in the source document appears only once in the strucfind tree, no matter how many times it occurs in the source. The result is very much like a graphical display of the schema for the source document, except that optional elements defined in the schema that do not appear in the data set also do not appear in the tree. Using strucfind it is easy to select nodes for training, inputs and an output. Strucfind produces a definition file that can be loaded directly into XML Miner. Metarule.xsl This XSL style sheet converts Metarule documents to HTML form for display. The rule set and the input and output definitions are displayed.

Introduction. Aleksandar Rakić Contents

Introduction. Aleksandar Rakić Contents Beograd ETF Fuzzy logic Introduction Aleksandar Rakić rakic@etf.rs Contents Definitions Bit of History Fuzzy Applications Fuzzy Sets Fuzzy Boundaries Fuzzy Representation Linguistic Variables and Hedges

More information

Classification with Diffuse or Incomplete Information

Classification with Diffuse or Incomplete Information Classification with Diffuse or Incomplete Information AMAURY CABALLERO, KANG YEN Florida International University Abstract. In many different fields like finance, business, pattern recognition, communication

More information

Why Fuzzy? Definitions Bit of History Component of a fuzzy system Fuzzy Applications Fuzzy Sets Fuzzy Boundaries Fuzzy Representation

Why Fuzzy? Definitions Bit of History Component of a fuzzy system Fuzzy Applications Fuzzy Sets Fuzzy Boundaries Fuzzy Representation Contents Why Fuzzy? Definitions Bit of History Component of a fuzzy system Fuzzy Applications Fuzzy Sets Fuzzy Boundaries Fuzzy Representation Linguistic Variables and Hedges INTELLIGENT CONTROLSYSTEM

More information

CPS331 Lecture: Fuzzy Logic last revised October 11, Objectives: 1. To introduce fuzzy logic as a way of handling imprecise information

CPS331 Lecture: Fuzzy Logic last revised October 11, Objectives: 1. To introduce fuzzy logic as a way of handling imprecise information CPS331 Lecture: Fuzzy Logic last revised October 11, 2016 Objectives: 1. To introduce fuzzy logic as a way of handling imprecise information Materials: 1. Projectable of young membership function 2. Projectable

More information

Fuzzy Reasoning. Outline

Fuzzy Reasoning. Outline Fuzzy Reasoning Outline Introduction Bivalent & Multivalent Logics Fundamental fuzzy concepts Fuzzification Defuzzification Fuzzy Expert System Neuro-fuzzy System Introduction Fuzzy concept first introduced

More information

Machine Learning Chapter 2. Input

Machine Learning Chapter 2. Input Machine Learning Chapter 2. Input 2 Input: Concepts, instances, attributes Terminology What s a concept? Classification, association, clustering, numeric prediction What s in an example? Relations, flat

More information

Lecture notes. Com Page 1

Lecture notes. Com Page 1 Lecture notes Com Page 1 Contents Lectures 1. Introduction to Computational Intelligence 2. Traditional computation 2.1. Sorting algorithms 2.2. Graph search algorithms 3. Supervised neural computation

More information

Machine Learning: Algorithms and Applications Mockup Examination

Machine Learning: Algorithms and Applications Mockup Examination Machine Learning: Algorithms and Applications Mockup Examination 14 May 2012 FIRST NAME STUDENT NUMBER LAST NAME SIGNATURE Instructions for students Write First Name, Last Name, Student Number and Signature

More information

FUZZY INFERENCE SYSTEMS

FUZZY INFERENCE SYSTEMS CHAPTER-IV FUZZY INFERENCE SYSTEMS Fuzzy inference is the process of formulating the mapping from a given input to an output using fuzzy logic. The mapping then provides a basis from which decisions can

More information

Data Mining. Practical Machine Learning Tools and Techniques. Slides for Chapter 3 of Data Mining by I. H. Witten, E. Frank and M. A.

Data Mining. Practical Machine Learning Tools and Techniques. Slides for Chapter 3 of Data Mining by I. H. Witten, E. Frank and M. A. Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 3 of Data Mining by I. H. Witten, E. Frank and M. A. Hall Input: Concepts, instances, attributes Terminology What s a concept?

More information

Input: Concepts, Instances, Attributes

Input: Concepts, Instances, Attributes Input: Concepts, Instances, Attributes 1 Terminology Components of the input: Concepts: kinds of things that can be learned aim: intelligible and operational concept description Instances: the individual,

More information

A Systematic Overview of Data Mining Algorithms. Sargur Srihari University at Buffalo The State University of New York

A Systematic Overview of Data Mining Algorithms. Sargur Srihari University at Buffalo The State University of New York A Systematic Overview of Data Mining Algorithms Sargur Srihari University at Buffalo The State University of New York 1 Topics Data Mining Algorithm Definition Example of CART Classification Iris, Wine

More information

CITS5501 Software Testing and Quality Assurance Formal methods

CITS5501 Software Testing and Quality Assurance Formal methods CITS5501 Software Testing and Quality Assurance Formal methods Unit coordinator: Arran Stewart May 1, 2018 1 / 49 Sources Pressman, R., Software Engineering: A Practitioner s Approach, McGraw-Hill, 2005

More information

DATA MINING INTRODUCTION TO CLASSIFICATION USING LINEAR CLASSIFIERS

DATA MINING INTRODUCTION TO CLASSIFICATION USING LINEAR CLASSIFIERS DATA MINING INTRODUCTION TO CLASSIFICATION USING LINEAR CLASSIFIERS 1 Classification: Definition Given a collection of records (training set ) Each record contains a set of attributes and a class attribute

More information

6.001 Notes: Section 1.1

6.001 Notes: Section 1.1 6.001 Notes: Section 1.1 Slide 1.1.1 This first thing we need to do is discuss the focus of 6.001. What is this course all about? This seems quite obvious -- this is a course about computer science. But

More information

Data Mining. Practical Machine Learning Tools and Techniques. Slides for Chapter 3 of Data Mining by I. H. Witten, E. Frank and M. A.

Data Mining. Practical Machine Learning Tools and Techniques. Slides for Chapter 3 of Data Mining by I. H. Witten, E. Frank and M. A. Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 3 of Data Mining by I. H. Witten, E. Frank and M. A. Hall Output: Knowledge representation Tables Linear models Trees Rules

More information

Fuzzy Logic. Sourabh Kothari. Asst. Prof. Department of Electrical Engg. Presentation By

Fuzzy Logic. Sourabh Kothari. Asst. Prof. Department of Electrical Engg. Presentation By Fuzzy Logic Presentation By Sourabh Kothari Asst. Prof. Department of Electrical Engg. Outline of the Presentation Introduction What is Fuzzy? Why Fuzzy Logic? Concept of Fuzzy Logic Fuzzy Sets Membership

More information

Styles, Style Sheets, the Box Model and Liquid Layout

Styles, Style Sheets, the Box Model and Liquid Layout Styles, Style Sheets, the Box Model and Liquid Layout This session will guide you through examples of how styles and Cascading Style Sheets (CSS) may be used in your Web pages to simplify maintenance of

More information

FUZZY LOGIC TECHNIQUES. on random processes. In such situations, fuzzy logic exhibits immense potential for

FUZZY LOGIC TECHNIQUES. on random processes. In such situations, fuzzy logic exhibits immense potential for FUZZY LOGIC TECHNIQUES 4.1: BASIC CONCEPT Problems in the real world are quite often very complex due to the element of uncertainty. Although probability theory has been an age old and effective tool to

More information

Data Mining Practical Machine Learning Tools and Techniques

Data Mining Practical Machine Learning Tools and Techniques Input: Concepts, instances, attributes Data ining Practical achine Learning Tools and Techniques Slides for Chapter 2 of Data ining by I. H. Witten and E. rank Terminology What s a concept z Classification,

More information

Lecture 1: Introduction

Lecture 1: Introduction Lecture 1: Introduction Doina Precup With many thanks to Prakash Panagaden and Mathieu Blanchette January 6, 2014 1 What is an algorithm? First of all, welcome to this course! Our goal will be to introduce

More information

Basic Concepts Weka Workbench and its terminology

Basic Concepts Weka Workbench and its terminology Changelog: 14 Oct, 30 Oct Basic Concepts Weka Workbench and its terminology Lecture Part Outline Concepts, instances, attributes How to prepare the input: ARFF, attributes, missing values, getting to know

More information

ARTIFICIAL INTELLIGENCE. Uncertainty: fuzzy systems

ARTIFICIAL INTELLIGENCE. Uncertainty: fuzzy systems INFOB2KI 2017-2018 Utrecht University The Netherlands ARTIFICIAL INTELLIGENCE Uncertainty: fuzzy systems Lecturer: Silja Renooij These slides are part of the INFOB2KI Course Notes available from www.cs.uu.nl/docs/vakken/b2ki/schema.html

More information

Fuzzy Sets and Systems. Lecture 1 (Introduction) Bu- Ali Sina University Computer Engineering Dep. Spring 2010

Fuzzy Sets and Systems. Lecture 1 (Introduction) Bu- Ali Sina University Computer Engineering Dep. Spring 2010 Fuzzy Sets and Systems Lecture 1 (Introduction) Bu- Ali Sina University Computer Engineering Dep. Spring 2010 Fuzzy sets and system Introduction and syllabus References Grading Fuzzy sets and system Syllabus

More information

Neural Networks Lesson 9 - Fuzzy Logic

Neural Networks Lesson 9 - Fuzzy Logic Neural Networks Lesson 9 - Prof. Michele Scarpiniti INFOCOM Dpt. - Sapienza University of Rome http://ispac.ing.uniroma1.it/scarpiniti/index.htm michele.scarpiniti@uniroma1.it Rome, 26 November 2009 M.

More information

A Decision-Theoretic Rough Set Model

A Decision-Theoretic Rough Set Model A Decision-Theoretic Rough Set Model Yiyu Yao and Jingtao Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 {yyao,jtyao}@cs.uregina.ca Special Thanks to Professor

More information

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis Informatics 1: Data & Analysis Lecture 9: Trees and XML Ian Stark School of Informatics The University of Edinburgh Tuesday 11 February 2014 Semester 2 Week 5 http://www.inf.ed.ac.uk/teaching/courses/inf1/da

More information

CHAPTER 3 FUZZY RULE BASED MODEL FOR FAULT DIAGNOSIS

CHAPTER 3 FUZZY RULE BASED MODEL FOR FAULT DIAGNOSIS 39 CHAPTER 3 FUZZY RULE BASED MODEL FOR FAULT DIAGNOSIS 3.1 INTRODUCTION Development of mathematical models is essential for many disciplines of engineering and science. Mathematical models are used for

More information

A Neuro-Fuzzy Classifier and Its Applications

A Neuro-Fuzzy Classifier and Its Applications A Neuro-Fuzzy Classifier and Its Applications Chuen-Tsai Sun Department of Computer and Information Science National Chiao Tung University Hsinchu, Taiwan 30050 E-mail: ctsun@webercisnctuedutw Jyh-Shing

More information

Boolean expressions. Elements of Programming Languages. Conditionals. What use is this?

Boolean expressions. Elements of Programming Languages. Conditionals. What use is this? Boolean expressions Elements of Programming Languages Lecture 3: Booleans, conditionals, and types James Cheney So far we ve considered only a trivial arithmetic language L Arith Let s extend L Arith with

More information

Data Mining Practical Machine Learning Tools and Techniques

Data Mining Practical Machine Learning Tools and Techniques Output: Knowledge representation Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter of Data Mining by I. H. Witten and E. Frank Decision tables Decision trees Decision rules

More information

Binary Representation. Jerry Cain CS 106AJ October 29, 2018 slides courtesy of Eric Roberts

Binary Representation. Jerry Cain CS 106AJ October 29, 2018 slides courtesy of Eric Roberts Binary Representation Jerry Cain CS 106AJ October 29, 2018 slides courtesy of Eric Roberts Once upon a time... Claude Shannon Claude Shannon was one of the pioneers who shaped computer science in its early

More information

Content Management for the Defense Intelligence Enterprise

Content Management for the Defense Intelligence Enterprise Gilbane Beacon Guidance on Content Strategies, Practices and Technologies Content Management for the Defense Intelligence Enterprise How XML and the Digital Production Process Transform Information Sharing

More information

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset.

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset. Glossary of data mining terms: Accuracy Accuracy is an important factor in assessing the success of data mining. When applied to data, accuracy refers to the rate of correct values in the data. When applied

More information

DATA ANALYSIS I. Types of Attributes Sparse, Incomplete, Inaccurate Data

DATA ANALYSIS I. Types of Attributes Sparse, Incomplete, Inaccurate Data DATA ANALYSIS I Types of Attributes Sparse, Incomplete, Inaccurate Data Sources Bramer, M. (2013). Principles of data mining. Springer. [12-21] Witten, I. H., Frank, E. (2011). Data Mining: Practical machine

More information

Goals: Define the syntax of a simple imperative language Define a semantics using natural deduction 1

Goals: Define the syntax of a simple imperative language Define a semantics using natural deduction 1 Natural Semantics Goals: Define the syntax of a simple imperative language Define a semantics using natural deduction 1 1 Natural deduction is an instance of first-order logic; that is, it is the formal

More information

Chapter 4 Fuzzy Logic

Chapter 4 Fuzzy Logic 4.1 Introduction Chapter 4 Fuzzy Logic The human brain interprets the sensory information provided by organs. Fuzzy set theory focus on processing the information. Numerical computation can be performed

More information

Machine Learning - Clustering. CS102 Fall 2017

Machine Learning - Clustering. CS102 Fall 2017 Machine Learning - Fall 2017 Big Data Tools and Techniques Basic Data Manipulation and Analysis Performing well-defined computations or asking well-defined questions ( queries ) Data Mining Looking for

More information

CISC 4631 Data Mining

CISC 4631 Data Mining CISC 4631 Data Mining Lecture 03: Introduction to classification Linear classifier Theses slides are based on the slides by Tan, Steinbach and Kumar (textbook authors) Eamonn Koegh (UC Riverside) 1 Classification:

More information

k Nearest Neighbors Super simple idea! Instance-based learning as opposed to model-based (no pre-processing)

k Nearest Neighbors Super simple idea! Instance-based learning as opposed to model-based (no pre-processing) k Nearest Neighbors k Nearest Neighbors To classify an observation: Look at the labels of some number, say k, of neighboring observations. The observation is then classified based on its nearest neighbors

More information

Unit V. Neural Fuzzy System

Unit V. Neural Fuzzy System Unit V Neural Fuzzy System 1 Fuzzy Set In the classical set, its characteristic function assigns a value of either 1 or 0 to each individual in the universal set, There by discriminating between members

More information

Session Booklet Social Media & Facebook

Session Booklet Social Media & Facebook Session Booklet Social Media & Facebook Social networking refers to the use of online social networks such as Facebook to communicate with other people. A social network can include blogs and other ways

More information

Mobile Computing Professor Pushpendra Singh Indraprastha Institute of Information Technology Delhi Java Basics Lecture 02

Mobile Computing Professor Pushpendra Singh Indraprastha Institute of Information Technology Delhi Java Basics Lecture 02 Mobile Computing Professor Pushpendra Singh Indraprastha Institute of Information Technology Delhi Java Basics Lecture 02 Hello, in this lecture we will learn about some fundamentals concepts of java.

More information

Author Prediction for Turkish Texts

Author Prediction for Turkish Texts Ziynet Nesibe Computer Engineering Department, Fatih University, Istanbul e-mail: admin@ziynetnesibe.com Abstract Author Prediction for Turkish Texts The main idea of authorship categorization is to specify

More information

The New Document Digital Polymorphic Ubiquitous Actionable Patrick P. Bergmans University of Ghent

The New Document Digital Polymorphic Ubiquitous Actionable Patrick P. Bergmans University of Ghent X X The New Document Digital Polymorphic Ubiquitous Actionable Patrick P. Bergmans University of Ghent The Traditional Document Documents have been around for thousands of years The Bible is a document

More information

HTML and CSS COURSE SYLLABUS

HTML and CSS COURSE SYLLABUS HTML and CSS COURSE SYLLABUS Overview: HTML and CSS go hand in hand for developing flexible, attractively and user friendly websites. HTML (Hyper Text Markup Language) is used to show content on the page

More information

Neural Network Exchange Format

Neural Network Exchange Format Copyright Khronos Group 2017 - Page 1 Neural Network Exchange Format Deploying Trained Networks to Inference Engines Viktor Gyenes, specification editor Copyright Khronos Group 2017 - Page 2 Outlook The

More information

Representing structural patterns: Reading Material: Chapter 3 of the textbook by Witten

Representing structural patterns: Reading Material: Chapter 3 of the textbook by Witten Representing structural patterns: Plain Classification rules Decision Tree Rules with exceptions Relational solution Tree for Numerical Prediction Instance-based presentation Reading Material: Chapter

More information

KS3 Progression Map: Number

KS3 Progression Map: Number KS3 Progression Map: Number This progression map expands upon the statements of subject content in the DfE document Mathematics programmes of study: Key Stage 3 published September 2013. Suggested allocation

More information

Introduction... ix. Chapter 1: Exploring Fundamental Programming Concepts... 1

Introduction... ix. Chapter 1: Exploring Fundamental Programming Concepts... 1 Table of Contents Introduction... ix Chapter 1: Exploring Fundamental Programming Concepts... 1 1.1 Exploring the Editors... 2 History of Editors... 2 Exploring the Types of Text Editors... 3 Describing

More information

Performance Evaluation of Various Classification Algorithms

Performance Evaluation of Various Classification Algorithms Performance Evaluation of Various Classification Algorithms Shafali Deora Amritsar College of Engineering & Technology, Punjab Technical University -----------------------------------------------------------***----------------------------------------------------------

More information

CS 354R: Computer Game Technology

CS 354R: Computer Game Technology CS 354R: Computer Game Technology AI Fuzzy Logic and Neural Nets Fall 2018 Fuzzy Logic Philosophical approach Decisions based on degree of truth Is not a method for reasoning under uncertainty that s probability

More information

CPSC 340: Machine Learning and Data Mining. Probabilistic Classification Fall 2017

CPSC 340: Machine Learning and Data Mining. Probabilistic Classification Fall 2017 CPSC 340: Machine Learning and Data Mining Probabilistic Classification Fall 2017 Admin Assignment 0 is due tonight: you should be almost done. 1 late day to hand it in Monday, 2 late days for Wednesday.

More information

JOB SCHEDULER JOB DOCUMENTATION EDITOR

JOB SCHEDULER JOB DOCUMENTATION EDITOR Job Scheduler - Job Documentation Editor JOB SCHEDULER JOB DOCUMENTATION EDITOR Instructions for Creating Job Documentation August 2006 Software- und Organisations-Service GmbH Giesebrechtstr. 15 D-10629

More information

Chapter 2 XML, XML Schema, XSLT, and XPath

Chapter 2 XML, XML Schema, XSLT, and XPath Summary Chapter 2 XML, XML Schema, XSLT, and XPath Ryan McAlister XML stands for Extensible Markup Language, meaning it uses tags to denote data much like HTML. Unlike HTML though it was designed to carry

More information

AP Computer Science 4325

AP Computer Science 4325 4325 Instructional Unit Algorithm Design Techniques -divide-and-conquer The students will be -Decide whether an algorithm -classroom discussion -backtracking able to classify uses divide-and-conquer, -worksheets

More information

Collage: A Declarative Programming Model for Compositional Development and Evolution of Cross-Organizational Applications

Collage: A Declarative Programming Model for Compositional Development and Evolution of Cross-Organizational Applications Collage: A Declarative Programming Model for Compositional Development and Evolution of Cross-Organizational Applications Bruce Lucas, IBM T J Watson Research Center (bdlucas@us.ibm.com) Charles F Wiecha,

More information

Exam Advanced Data Mining Date: Time:

Exam Advanced Data Mining Date: Time: Exam Advanced Data Mining Date: 11-11-2010 Time: 13.30-16.30 General Remarks 1. You are allowed to consult 1 A4 sheet with notes written on both sides. 2. Always show how you arrived at the result of your

More information

4. Descriptive Statistics: Measures of Variability and Central Tendency

4. Descriptive Statistics: Measures of Variability and Central Tendency 4. Descriptive Statistics: Measures of Variability and Central Tendency Objectives Calculate descriptive for continuous and categorical data Edit output tables Although measures of central tendency and

More information

Data Mining: Exploring Data

Data Mining: Exploring Data Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar But we start with a brief discussion of the Friedman article and the relationship between Data

More information

Stats 170A: Project in Data Science Exploratory Data Analysis: Clustering Algorithms

Stats 170A: Project in Data Science Exploratory Data Analysis: Clustering Algorithms Stats 170A: Project in Data Science Exploratory Data Analysis: Clustering Algorithms Padhraic Smyth Department of Computer Science Bren School of Information and Computer Sciences University of California,

More information

Introduction to Fuzzy Logic and Fuzzy Systems Adel Nadjaran Toosi

Introduction to Fuzzy Logic and Fuzzy Systems Adel Nadjaran Toosi Introduction to Fuzzy Logic and Fuzzy Systems Adel Nadjaran Toosi Fuzzy Slide 1 Objectives What Is Fuzzy Logic? Fuzzy sets Membership function Differences between Fuzzy and Probability? Fuzzy Inference.

More information

Which Role for an Ontology of Uncertainty?

Which Role for an Ontology of Uncertainty? Which Role for an Ontology of Uncertainty? Paolo Ceravolo, Ernesto Damiani, Marcello Leida Dipartimento di Tecnologie dell Informazione - Università degli studi di Milano via Bramante, 65-26013 Crema (CR),

More information

From the Grade 8, High School, Functions Progression Document, pp. 7-8:

From the Grade 8, High School, Functions Progression Document, pp. 7-8: Time Frame: Approximately 2-3 weeks Connections to Previous Learning: Students use their knowledge of expressions and equations to evaluate functions and to rewrite when necessary. They also use rigid

More information

At the end of the chapter, you will learn to: Present data in textual form. Construct different types of table and graphs

At the end of the chapter, you will learn to: Present data in textual form. Construct different types of table and graphs DATA PRESENTATION At the end of the chapter, you will learn to: Present data in textual form Construct different types of table and graphs Identify the characteristics of a good table and graph Identify

More information

MITOCW watch?v=kz7jjltq9r4

MITOCW watch?v=kz7jjltq9r4 MITOCW watch?v=kz7jjltq9r4 PROFESSOR: We're going to look at the most fundamental of all mathematical data types, namely sets, and let's begin with the definitions. So informally, a set is a collection

More information

Nesnelerin İnternetinde Veri Analizi

Nesnelerin İnternetinde Veri Analizi Nesnelerin İnternetinde Veri Analizi Bölüm 3. Classification in Data Streams w3.gazi.edu.tr/~suatozdemir Supervised vs. Unsupervised Learning (1) Supervised learning (classification) Supervision: The training

More information

Chapter 3 Analyzing Normal Quantitative Data

Chapter 3 Analyzing Normal Quantitative Data Chapter 3 Analyzing Normal Quantitative Data Introduction: In chapters 1 and 2, we focused on analyzing categorical data and exploring relationships between categorical data sets. We will now be doing

More information

A Systematic Overview of Data Mining Algorithms

A Systematic Overview of Data Mining Algorithms A Systematic Overview of Data Mining Algorithms 1 Data Mining Algorithm A well-defined procedure that takes data as input and produces output as models or patterns well-defined: precisely encoded as a

More information

COSC 6397 Big Data Analytics. Fuzzy Clustering. Some slides based on a lecture by Prof. Shishir Shah. Edgar Gabriel Spring 2015.

COSC 6397 Big Data Analytics. Fuzzy Clustering. Some slides based on a lecture by Prof. Shishir Shah. Edgar Gabriel Spring 2015. COSC 6397 Big Data Analytics Fuzzy Clustering Some slides based on a lecture by Prof. Shishir Shah Edgar Gabriel Spring 215 Clustering Clustering is a technique for finding similarity groups in data, called

More information

2. You are required to enter a password of up to 100 characters. The characters must be lower ASCII, printing characters.

2. You are required to enter a password of up to 100 characters. The characters must be lower ASCII, printing characters. BLACK BOX SOFTWARE TESTING SPRING 2005 DOMAIN TESTING LAB PROJECT -- GRADING NOTES For all of the cases below, do the traditional equivalence class and boundary analysis. Draw one table and use a new line

More information

Diagonalization. The cardinality of a finite set is easy to grasp: {1,3,4} = 3. But what about infinite sets?

Diagonalization. The cardinality of a finite set is easy to grasp: {1,3,4} = 3. But what about infinite sets? Diagonalization Cardinalities The cardinality of a finite set is easy to grasp: {1,3,4} = 3. But what about infinite sets? We say that a set S has at least as great cardinality as set T, written S T, if

More information

Introduction to Automata Theory. BİL405 - Automata Theory and Formal Languages 1

Introduction to Automata Theory. BİL405 - Automata Theory and Formal Languages 1 Introduction to Automata Theory BİL405 - Automata Theory and Formal Languages 1 Automata, Computability and Complexity Automata, Computability and Complexity are linked by the question: What are the fundamental

More information

CITS3001. Algorithms, Agents and Artificial Intelligence. Semester 2, 2016

CITS3001. Algorithms, Agents and Artificial Intelligence. Semester 2, 2016 CITS3001 Algorithms, Agents and Artificial Intelligence Semester 2, 2016 Tim French School of Computer Science & Software Eng. The University of Western Australia 2. Review of algorithmic concepts CLRS,

More information

Elliotte Rusty Harold August From XML to Flat Buffers: Markup in the Twenty-teens

Elliotte Rusty Harold August From XML to Flat Buffers: Markup in the Twenty-teens Elliotte Rusty Harold elharo@ibiblio.org August 2018 From XML to Flat Buffers: Markup in the Twenty-teens Warning! The Contenders XML JSON YAML EXI Protobufs Flat Protobufs XML JSON YAML EXI Protobuf Flat

More information

Lotfi Zadeh (professor at UC Berkeley) wrote his original paper on fuzzy set theory. In various occasions, this is what he said

Lotfi Zadeh (professor at UC Berkeley) wrote his original paper on fuzzy set theory. In various occasions, this is what he said FUZZY LOGIC Fuzzy Logic Lotfi Zadeh (professor at UC Berkeley) wrote his original paper on fuzzy set theory. In various occasions, this is what he said Fuzzy logic is a means of presenting problems to

More information

Bolero Comments. The guiding principles for bolero XML tools and methodologies are:

Bolero Comments. The guiding principles for bolero XML tools and methodologies are: Bolero Comments The guiding principles for bolero XML tools and methodologies are: Compliance with standard UML Compliance with standard XML Compliance with ebxml where relevant Technology independence

More information

Neural Networks Laboratory EE 329 A

Neural Networks Laboratory EE 329 A Neural Networks Laboratory EE 329 A Introduction: Artificial Neural Networks (ANN) are widely used to approximate complex systems that are difficult to model using conventional modeling techniques such

More information

Decision Trees: Representation

Decision Trees: Representation Decision Trees: Representation Machine Learning Fall 2018 Some slides from Tom Mitchell, Dan Roth and others 1 Key issues in machine learning Modeling How to formulate your problem as a machine learning

More information

Similarity Measures of Pentagonal Fuzzy Numbers

Similarity Measures of Pentagonal Fuzzy Numbers Volume 119 No. 9 2018, 165-175 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Similarity Measures of Pentagonal Fuzzy Numbers T. Pathinathan 1 and

More information

CHAPTER 4 FREQUENCY STABILIZATION USING FUZZY LOGIC CONTROLLER

CHAPTER 4 FREQUENCY STABILIZATION USING FUZZY LOGIC CONTROLLER 60 CHAPTER 4 FREQUENCY STABILIZATION USING FUZZY LOGIC CONTROLLER 4.1 INTRODUCTION Problems in the real world quite often turn out to be complex owing to an element of uncertainty either in the parameters

More information

Case-Based Reasoning

Case-Based Reasoning 0/0/ Case-Based Reasoning In this lecture, we turn to another popular form of reasoning system: case based reasoning (CBR) Unlike Rule-based systems and Fuzzy Logic, CBR does not use any rules or logical

More information

Lesson #17 Function Introduction

Lesson #17 Function Introduction Lesson #17 Function Introduction A.A.37 A.A.40 A.A.41 Define a relation and function Write functions in functional notation Use functional notation to evaluate functions for given values in the domain

More information

CS Machine Learning

CS Machine Learning CS 60050 Machine Learning Decision Tree Classifier Slides taken from course materials of Tan, Steinbach, Kumar 10 10 Illustrating Classification Task Tid Attrib1 Attrib2 Attrib3 Class 1 Yes Large 125K

More information

Understanding Geospatial Data Models

Understanding Geospatial Data Models Understanding Geospatial Data Models 1 A geospatial data model is a formal means of representing spatially referenced information. It is a simplified view of physical entities and a conceptualization of

More information

Model Selection Introduction to Machine Learning. Matt Gormley Lecture 4 January 29, 2018

Model Selection Introduction to Machine Learning. Matt Gormley Lecture 4 January 29, 2018 10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Model Selection Matt Gormley Lecture 4 January 29, 2018 1 Q&A Q: How do we deal

More information

University of Florida CISE department Gator Engineering. Visualization

University of Florida CISE department Gator Engineering. Visualization Visualization Dr. Sanjay Ranka Professor Computer and Information Science and Engineering University of Florida What is visualization? Visualization is the process of converting data (information) in to

More information

Fuzzy Classification of Facial Component Parameters

Fuzzy Classification of Facial Component Parameters Fuzzy Classification of Facial Component Parameters S. alder 1,. Bhattacherjee 2,. Nasipuri 2,. K. Basu 2* and. Kundu 2 1 epartment of Computer Science and Engineering, RCCIIT, Kolkata -, India Email:

More information

Course on Artificial Intelligence and Intelligent Systems

Course on Artificial Intelligence and Intelligent Systems Course on Artificial Intelligence and Intelligent Systems Example and exercise using an Excel based Neural Network package Henning Christiansen Roskilde University, Computer Science Dept. c 2007 Version

More information

Data Mining. Introduction. Hamid Beigy. Sharif University of Technology. Fall 1395

Data Mining. Introduction. Hamid Beigy. Sharif University of Technology. Fall 1395 Data Mining Introduction Hamid Beigy Sharif University of Technology Fall 1395 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1395 1 / 21 Table of contents 1 Introduction 2 Data mining

More information

REASONING UNDER UNCERTAINTY: FUZZY LOGIC

REASONING UNDER UNCERTAINTY: FUZZY LOGIC REASONING UNDER UNCERTAINTY: FUZZY LOGIC Table of Content What is Fuzzy Logic? Brief History of Fuzzy Logic Current Applications of Fuzzy Logic Overview of Fuzzy Logic Forming Fuzzy Set Fuzzy Set Representation

More information

Measures of Dispersion

Measures of Dispersion Lesson 7.6 Objectives Find the variance of a set of data. Calculate standard deviation for a set of data. Read data from a normal curve. Estimate the area under a curve. Variance Measures of Dispersion

More information

Data organization. So what kind of data did we collect?

Data organization. So what kind of data did we collect? Data organization Suppose we go out and collect some data. What do we do with it? First we need to figure out what kind of data we have. To illustrate, let s do a simple experiment and collect the height

More information

Tutorial 1 Answers. Question 1

Tutorial 1 Answers. Question 1 Tutorial 1 Answers Question 1 Complexity Software in it what is has to do, is often essentially complex. We can think of software which is accidentally complex such as a large scale e-commerce system (simple

More information

Intro to Programming. Unit 7. What is Programming? What is Programming? Intro to Programming

Intro to Programming. Unit 7. What is Programming? What is Programming? Intro to Programming Intro to Programming Unit 7 Intro to Programming 1 What is Programming? 1. Programming Languages 2. Markup vs. Programming 1. Introduction 2. Print Statement 3. Strings 4. Types and Values 5. Math Externals

More information

Midterms Save the Dates!

Midterms Save the Dates! University of British Columbia CPSC 111, Intro to Computation Alan J. Hu Primitive Data Types Arithmetic Operators Readings Your textbook is Big Java (3rd Ed). This Week s Reading: Ch 2.1-2.5, Ch 4.1-4.2.

More information

PharmaSUG China 2018 Paper AD-62

PharmaSUG China 2018 Paper AD-62 PharmaSUG China 2018 Paper AD-62 Decomposition and Reconstruction of TLF Shells - A Simple, Fast and Accurate Shell Designer Chengeng Tian, dmed Biopharmaceutical Co., Ltd., Shanghai, China ABSTRACT Table/graph

More information

Bézier Surface Modeling for Neutrosophic Data Problems

Bézier Surface Modeling for Neutrosophic Data Problems Neutrosophic Sets and Systems, Vol. 19, 2018 19 University of New Mexico Bézier Surface Modeling for Neutrosophic Data Problems Selçuk Topal 1 * and Ferhat Taş 2 1 Department of Mathematics, Faculty of

More information

Instances and Classes. SOFTWARE ENGINEERING Christopher A. Welty David A. Ferrucci. 24 Summer 1999 intelligence

Instances and Classes. SOFTWARE ENGINEERING Christopher A. Welty David A. Ferrucci. 24 Summer 1999 intelligence Instances and Classes in SOFTWARE ENGINEERING Christopher A. Welty David A. Ferrucci 24 Summer 1999 intelligence Software Engineering Over the past decade or so, one of the many areas that artificial intelligence

More information

STAT10010 Introductory Statistics Lab 2

STAT10010 Introductory Statistics Lab 2 STAT10010 Introductory Statistics Lab 2 1. Aims of Lab 2 By the end of this lab you will be able to: i. Recognize the type of recorded data. ii. iii. iv. Construct summaries of recorded variables. Calculate

More information