User Configurable Semantic Natural Language Processing

Size: px
Start display at page:

Download "User Configurable Semantic Natural Language Processing"

Transcription

1 User Configurable Semantic Natural Language Processing Jason Hedges CEO and Founder Edgetide LLC (443)

2 Table of Contents Bridging the Gap between Human and Machine Language... 3 Human and Machine Languages... 3 Ordenite: The Missing Link Between Human and Machine Language... 3 What is Semantic Natural Language Processing?... 3 How Ordenite Works... 4 User Configuration... 4 Lenses... 5 Entities and the Entity Ontology... 5 Activities and the Activity Ontology... 5 Building the Graph Data Structure... 7 Ordenite Implementations... 8 Question Answer System... 8 Populating a Triple Store... 9 Data Mining Facet Generation Data for Machine Analysis

3 Bridging the Gap between Human and Machine Language It is estimated that unstructured information accounts for 70-90% of the data within most organizations. As computer systems advance, so too does the amount of unstructured data within the digital world. Despite the overwhelming majority of unstructured text within an enterprise, there are few tools that allow a computer system to have a deep understanding of what the text describes. Human and Machine Languages Human languages describe entities and activities and their relationship to each other. Whether someone is describing a complex scientific reaction between particles or the latest blockbuster movies, they are describing entities and activities or things and things that are happening. This is how humans experience the world, with objects (including the intangible) and events. Machine languages describe logic, processes, and algorithms. Computer systems excel with structured data where they can easily use within computer programs, apply statistical models, easily search and discover the data, and display to a user in a variety of formats. However, much of the data that humans create is unstructured. This creates a gap between the majority of data and the type of data a computer system excels with. Ordenite: The Missing Link Between Human and Machine Language Edgetide spent several years researching and prototyping different tools that would allow a system to obtain a deeper understanding of unstructured text. We determined that, to bridge the gap between human and machine, a highly configurable topic based entity and activity extraction system is required because all human languages describe entities and activities. The system would also need to understand the relationships between entities and the activities. Finally, the system would need to convert the unstructured text to a data structure that a computer system could easily understand without losing any meaning from the text in the conversion. What is Semantic Natural Language Processing? Semantic Natural Language Processing (NLP) is the ability to capture the meaning of unstructured text in a way that a computer system can understand and fully take advantage of. Ordenite offers a highly configurable Semantic NLP Extraction platform that orders and unites unstructured data by determining the semantic meaning of text and building linked node graph-based data structures from the content. These data structures enable computer systems to query and analyze unstructured content. Ordenite goes beyond typical syntactic comparison of words to interpret the meaning of statements. Ordenite s design allows users to extract objects and graphs by configuring or extending user-defined lenses without software code or statistical training. Thus 3

4 providing a machine friendly format that captures the meaning of the text within the perspective of the configured topic area. How Ordenite Works Ordenite s patent pending methods and algorithms empower an organization to unlock their unstructured content for machine evaluation, search, and analysis. Ordenite does ship with certain lenses (configurations for specific topics) for extraction but a great advantage for the customer is the ability to modify or create new lenses. In this section we ll briefly walk through how Ordenite s configuration works and how it relates to text extraction. User Configuration One of Ordenite s most versatile features is the ability to create new configurations based on area of interest. We call these lenses because they offer a different view of the data specific to the scope of desired topic area. Each lens consists of an ontology for entities and another for activities. Ordenite has an easy to use web interface for creating new lenses or modifying existing ones. A graphical web interface (as shown in figure 1) makes it easy to create and modify ontologies for activities and entities. Users can also attach rules and operations to each entity or activity directly in the web interface. Figure 1 Entity Ontology with the Human entity highlighted 4

5 Lenses Ordenite divides different extraction configurations by user configurable topics of interest. We call these lenses. A lens is a configuration for a specific subject of interest. User-defined lenses allow Ordenite to provide multiple perspectives to a given corpora of input. This means that different users can interpret content based on specific interests, which enhances flexibility. Lenses can range in topics as different as sports to terrorism to finance and can be as specific or generic as needed. A lens is comprised of an entity ontology and an activity ontology. The two ontologies represent things and things that happen within a subject of interest. Users can create multiple lenses and even derive new lenses from existing ones. Entities and the Entity Ontology An entity is some sort of distinct and independent thing. As mentioned above, Ordenite s purpose is to be a highly configurable system that recognizes activities and entities and how they are related to each other. Many NLP products and tools include entities in some form. Ordenite takes the idea of entities much farther than most other because it allows you to relate the entities to their attributes and also to associated activities. Making it even more practical for custom use, Ordenite gives control of the configuration of the entities to the user per topic area. These are called lenses and are described in greater detail above. Entities can contain a set of attributes and also inherit the attributes of its parents. They can be a portion of the entity s text or another entity all together. For example, The red car was parked at the store. The entity Vehicle could have the attribute color which in this sentence would be red. The ability to attach attributes to entities is important so that the meaning of the text is maintained when translated to a graph data structure. There are many advantages of using an ontology to configure entities for a lens. An ontology is a method of modeling knowledge around a domain. Specifying the relationships between entities is valuable for recognition and graph data construction. In addition entities can inherit from parent entities within the ontology. Inheriting a parent s attributes can reduce a lot of configuration needed for an entity. For example, consider the previous example sentence The red car was parked at the store. If we were to have a parent entity named Tangible which has a definition for the attribute color, the entity Vehicle could inherit the attribute definition since it would be a child of Tangible. The real advantage is that any other entity that also inherits from Tangible would also inherit the rule. Activities and the Activity Ontology The section above regarding entities is how Ordenite extracts things. Activities are how Ordenite extracts and understands things that happen. Ordenite uses fully 5

6 configurable activity ontologies where rules can be attached to each activity. The activity ontology follows a hierarchical structure that allows children to inherit rules and attributes from parent objects, greatly reducing the amount of configuration for subtypes. Ordenite uses a combination of lexical items and attribute rules to determine the semantic meaning of the statement. To better explain how it works, consider the three following simple example sentences: 1) Sally Smith made Joy walk to the park. 2) Sally Smith made Joy some cookies. 3) Sally Smith made Joy happy. Each of the three above sentences has the same lexical item, the verb to make. However, each sentence has a very different meaning. Humans can determine the difference in the meaning because of context. Ordenite can do the same. The activity Force Person is identified in the first sentence. The configuration specifies 4 required rules to be met: to make or to force as the lexical item Human entity in the subject position of the statement as the Actor Human entity in the object position of the statement as the Affected Verb phrase with the Affected as the subject as the Action attribute. Ordenite can determine the position of a word in relation to the lexical item. A word or entity in the subject position is what is performing the lexical item. In the example Sally Smith is the entity performing the to make. The object position is the word or entity in which the lexical item is affecting. In the example Joy is the entity affected by the lexical item. Ordenite can determine the correct position regardless of the multiple ways a statement can be constructed. 6

7 Figure 2 Ordenite graph data output of Sally Smith made Joy walk to the park Building the Graph Data Structure The building blocks of a Semantic Web graph are triples, which consist of a subject, predicate, and object. To build a graph data structure the entities and activities are first extracted from the text. Once entities are populated from the rules found within the entity ontology in the lens, triples are constructed from the related attributes. Similarly, once activities have been extracted from the text they are converted into triples. The name of the activity is the subject, the name of the attribute is the predicate, and the value of the attribute is the object. The value of an activity attribute is most likely to be an entity, which enables the activities and entities to be related to one another. When the triples are merged together they create a group of interconnected nodes or graph data structure. Ordenite generates this graph data structure automatically from unstructured text based on the lens configuration. The graph can be outputted in open standard formats like RDF, N-Quads, or as JSON for easier integration into certain software code. 7

8 Figure 3 Graph data visualization for multiple terrorism narratives Ordenite Implementations Ordenite is and has been used in a wide variety of implementations. Ordenite is based on open standards so that it can be quickly and easily integrated into existing enterprise architectures with minimal integration. Below some interesting uses of Ordenite are highlighted. Question Answer System Ordenite was used to create a system that essentially allows a user to type a what, where, or when question and receive an answer with a snippet of the original text for reference. As shown in multiple sections above, Ordenite can convert unstructured text into a graph data structure based on lens configuration. Ordenite also has the ability to convert a human language question to a graph data query for a specific topic area (lens). To conform with open standards, Ordenite uses SPARQL as the query syntax. The ability to convert a question to SPARQL empowers users to perform complex graph queries without needing to know the syntax or even the data store ontology. While only SPARQL is supported, an API for question conversion exists to extract query parameters for other query syntaxes. 8

9 Figure 4 Screen shot of question answer system with the question What buildings were damaged from dynamite Populating a Triple Store As mentioned throughout this document, Ordenite outputs a graph data structure based on a user defined lens configuration. The graph data structure is outputted as JSON, RDF, or N-Quads. RDF and N-Quads can be directly inserted into most triple stores. In the figure below the Open Source triple store Sesame is shown with data ingested from Ordenite. Ordenite was used to create graph data structures from tens of thousands of narratives describing terrorism events. The graph was outputted as N-Quads so that the triple plus the context could be inserted into Sesame. Ordenite has been used to ingest unstructured text into Sesame using a variety of lenses from a wide range of unstructured sources. 9

10 Ordenite: User Configurable Semantic Natural Language Processing Figure 5 Screen shot of the Sesame Workbench with Ordenite ingested triples Data Mining While the actual output of Ordenite extraction is a graph data structure, Ordenite has the ability to convert the graph to single or interrelated tables. This is useful for data mining instances where the desired product might be an excel spreadsheet or even the population of a traditional relational database. Below is an example of extracting criminal activity along with the details of the crime. In this example, news stories were used as the source and Ordenite was used to mine the desired details. Figure 6 Snippet of a crimes committed table generated from news feeds Facet Generation Ordenite is easily integrated with Solr, the popular open source enterprise search platform. Ordenite has built in features to populate Solr fields, which are used in faceted search. In addition to populating facets Ordenite comes with an Open Source Solr visualization platform for user friendly Solr Search. Ordenite can populate Solr fields from entities, entity attributes, activities, and activity attributes. Ordenite can be configured to extract locations and time as well as text. Below is an example of an Ordenite and Solr integration using records describing terrorist events. The facets generated by Ordenite in this example are activity, location, date of incident, actor, victim, actor, weapon, relief organization, and terrorist group. Edgetide LLC 2015 All Rights Reserved 10

11 Figure 7 Screen shot of Solr Interface with Ordenite generated fields for facets Data for Machine Analysis Ordenite has extracted unstructured data for uses ranging from data science to dashboards. When text has the ability to be structured in a way that machines easily understand, it s trivial to use unstructured data in commercial and open source products and libraries that normally would be impossible to use with text. Ordenite has empowered text to be used in several proprietary and open source products and libraries. Figure 8 Screen shots of visualizations using Ordenite extracted data 11

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper. Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...

More information

Profiles Research Networking Software API Guide

Profiles Research Networking Software API Guide Profiles Research Networking Software API Guide Documentation Version: March 13, 2013 Software Version: ProfilesRNS_1.0.3 Table of Contents Overview... 2 PersonID, URI, and Aliases... 3 1) Profiles RNS

More information

Text Mining. Representation of Text Documents

Text Mining. Representation of Text Documents Data Mining is typically concerned with the detection of patterns in numeric data, but very often important (e.g., critical to business) information is stored in the form of text. Unlike numeric data,

More information

Text Mining for Software Engineering

Text Mining for Software Engineering Text Mining for Software Engineering Faculty of Informatics Institute for Program Structures and Data Organization (IPD) Universität Karlsruhe (TH), Germany Department of Computer Science and Software

More information

Knowledge Base for Business Intelligence

Knowledge Base for Business Intelligence Knowledge Base for Business Intelligence System for population and linking of knowledge bases dealing with data, information and knowledge comming from heterogeneous data sources to provide pluggable Business

More information

Natural Language Processing with PoolParty

Natural Language Processing with PoolParty Natural Language Processing with PoolParty Table of Content Introduction to PoolParty 2 Resolving Language Problems 4 Key Features 5 Entity Extraction and Term Extraction 5 Shadow Concepts 6 Word Sense

More information

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Integrating Complex Financial Workflows in Oracle Database Xavier Lopez Seamus Hayes Oracle PolarLake, LTD 2 Copyright 2011, Oracle

More information

Object-oriented Compiler Construction

Object-oriented Compiler Construction 1 Object-oriented Compiler Construction Extended Abstract Axel-Tobias Schreiner, Bernd Kühl University of Osnabrück, Germany {axel,bekuehl}@uos.de, http://www.inf.uos.de/talks/hc2 A compiler takes a program

More information

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 93-94

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 93-94 ه عا ی Semantic Web Ontology Engineering and Evaluation Morteza Amini Sharif University of Technology Fall 93-94 Outline Ontology Engineering Class and Class Hierarchy Ontology Evaluation 2 Outline Ontology

More information

IT1105 Information Systems and Technology. BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing. Student Manual

IT1105 Information Systems and Technology. BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing. Student Manual IT1105 Information Systems and Technology BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing Student Manual Lesson 3: Organizing Data and Information (6 Hrs) Instructional Objectives Students

More information

Whitepaper on a 360 Degree Strategy for Text Analysis

Whitepaper on a 360 Degree Strategy for Text Analysis Whitepaper on a 360 Degree Strategy for Text Analysis Cohan Sujay Carlos Researcher, Aiaioo Labs Benson Town, Bangalore, India http://www.aiaioo.com cohan@aiaioo.com Abstract We propose a strategy for

More information

Query-Time JOIN for Active Intelligence Engine (AIE)

Query-Time JOIN for Active Intelligence Engine (AIE) Query-Time JOIN for Active Intelligence Engine (AIE) Ad hoc JOINing of Structured Data and Unstructured Content: An Attivio-Patented Breakthrough in Information- Centered Business Agility An Attivio Technology

More information

Enhancing applications with Cognitive APIs IBM Corporation

Enhancing applications with Cognitive APIs IBM Corporation Enhancing applications with Cognitive APIs After you complete this section, you should understand: The Watson Developer Cloud offerings and APIs The benefits of commonly used Cognitive services 2 Watson

More information

IJCSC Volume 5 Number 1 March-Sep 2014 pp ISSN

IJCSC Volume 5 Number 1 March-Sep 2014 pp ISSN Movie Related Information Retrieval Using Ontology Based Semantic Search Tarjni Vyas, Hetali Tank, Kinjal Shah Nirma University, Ahmedabad tarjni.vyas@nirmauni.ac.in, tank92@gmail.com, shahkinjal92@gmail.com

More information

SemSearch 2008, CEUR Workshop Proceedings, ISSN , online at CEUR-WS.org/Vol-334/ QuiKey a Demo. Heiko Haller

SemSearch 2008, CEUR Workshop Proceedings, ISSN , online at CEUR-WS.org/Vol-334/ QuiKey a Demo. Heiko Haller QuiKey a Demo Heiko Haller Forschungszentrum Informatik (FZI), Germany heiko.haller@fzi.de Abstract. QuiKey is a light-weight tool that can act as an interactive command-line for a semantic knowledge base.

More information

STS Infrastructural considerations. Christian Chiarcos

STS Infrastructural considerations. Christian Chiarcos STS Infrastructural considerations Christian Chiarcos chiarcos@uni-potsdam.de Infrastructure Requirements Candidates standoff-based architecture (Stede et al. 2006, 2010) UiMA (Ferrucci and Lally 2004)

More information

Enterprise Knowledge Map: Toward Subject Centric Computing. March 21st, 2007 Dmitry Bogachev

Enterprise Knowledge Map: Toward Subject Centric Computing. March 21st, 2007 Dmitry Bogachev Enterprise Knowledge Map: Toward Subject Centric Computing March 21st, 2007 Dmitry Bogachev Are we ready?...the idea of an application is an artificial one, convenient to the programmer but not to the

More information

Jumpstarting the Semantic Web

Jumpstarting the Semantic Web Jumpstarting the Semantic Web Mark Watson. Copyright 2003, 2004 Version 0.3 January 14, 2005 This work is licensed under the Creative Commons Attribution-NoDerivs-NonCommercial License. To view a copy

More information

CHAPTER 6 DATABASE MANAGEMENT SYSTEMS

CHAPTER 6 DATABASE MANAGEMENT SYSTEMS CHAPTER 6 DATABASE MANAGEMENT SYSTEMS Management Information Systems, 10 th edition, By Raymond McLeod, Jr. and George P. Schell 2007, Prentice Hall, Inc. 1 Learning Objectives Understand the hierarchy

More information

Building the News Search Engine

Building the News Search Engine Building the News Search Engine Ramkumar Aiyengar Team Leader, R&D News Search, Bloomberg L.P. andyetitmoves@apache.org A technology company Our strength and focus is data The Terminal, vertical portals

More information

Fractal Data Modeling

Fractal Data Modeling Fractal Data Modeling Fractal geometry creates beautiful patterns from simple recursive algorithms. One of the things we find so appealing is their self- similarity at different scales. That is, as you

More information

Requirements. Chapter Learning objectives of this chapter. 2.2 Definition and syntax

Requirements. Chapter Learning objectives of this chapter. 2.2 Definition and syntax Chapter 2 Requirements A requirement is a textual description of system behaviour. A requirement describes in plain text, usually English, what a system is expected to do. This is a basic technique much

More information

True Natural Language Understanding: How Does Kyndi Numeric Mapping Work?

True Natural Language Understanding: How Does Kyndi Numeric Mapping Work? True Natural Language Understanding: How Does Kyndi Numeric Mapping Work? Kyndi is pioneering methods that bring natural language understanding to the enterprise toolkit. Kyndi converts text resources

More information

Universal Model Framework -- An Introduction

Universal Model Framework -- An Introduction Universal Model Framework -- An Introduction By Visible Systems Corporation www.visible.com This document provides an introductory description of the Universal Model Framework an overview of its construct

More information

Customisable Curation Workflows in Argo

Customisable Curation Workflows in Argo Customisable Curation Workflows in Argo Rafal Rak*, Riza Batista-Navarro, Andrew Rowley, Jacob Carter and Sophia Ananiadou National Centre for Text Mining, University of Manchester, UK *Corresponding author:

More information

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,

More information

SADT Structured Analysis & Design Technique

SADT Structured Analysis & Design Technique 1 SADT Structured Analysis & Design Technique Yuling Li 12/5/16 2 How to Make a Pizza? 3 4 How to Make a Pizza (Process/Activities) Systematically? Analysis Determine what the system will do Design Define

More information

Unlocking the full potential of location-based services: Linked Data driven Web APIs

Unlocking the full potential of location-based services: Linked Data driven Web APIs Unlocking the full potential of location-based services: Linked Data driven Web APIs Open Standards for Linked Organisations about Raf Buyle Ziggy Vanlishout www.vlaanderen.be/informatievlaanderen 6.4

More information

Developing SQL Data Models

Developing SQL Data Models Developing SQL Data Models 20768B; 3 Days; Instructor-led Course Description The focus of this 3-day instructor-led course is on creating managed enterprise BI solutions. It describes how to implement

More information

OWL as a Target for Information Extraction Systems

OWL as a Target for Information Extraction Systems OWL as a Target for Information Extraction Systems Clay Fink, Tim Finin, James Mayfield and Christine Piatko Johns Hopkins University Applied Physics Laboratory and the Human Language Technology Center

More information

Introduction to RDF and the Semantic Web for the life sciences

Introduction to RDF and the Semantic Web for the life sciences Introduction to RDF and the Semantic Web for the life sciences Simon Jupp Sample Phenotypes and Ontologies Team European Bioinformatics Institute jupp@ebi.ac.uk Practical sessions Converting data to RDF

More information

Semantic Annotation, Search and Analysis

Semantic Annotation, Search and Analysis Semantic Annotation, Search and Analysis Borislav Popov, Ontotext Ontology A machine readable conceptual model a common vocabulary for sharing information machine-interpretable definitions of concepts in

More information

Finding Topic-centric Identified Experts based on Full Text Analysis

Finding Topic-centric Identified Experts based on Full Text Analysis Finding Topic-centric Identified Experts based on Full Text Analysis Hanmin Jung, Mikyoung Lee, In-Su Kang, Seung-Woo Lee, Won-Kyung Sung Information Service Research Lab., KISTI, Korea jhm@kisti.re.kr

More information

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 95-96

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 95-96 ه عا ی Semantic Web Ontology Engineering and Evaluation Morteza Amini Sharif University of Technology Fall 95-96 Outline Ontology Engineering Class and Class Hierarchy Ontology Evaluation 2 Outline Ontology

More information

Orchestrating Music Queries via the Semantic Web

Orchestrating Music Queries via the Semantic Web Orchestrating Music Queries via the Semantic Web Milos Vukicevic, John Galletly American University in Bulgaria Blagoevgrad 2700 Bulgaria +359 73 888 466 milossmi@gmail.com, jgalletly@aubg.bg Abstract

More information

FINANCIAL REGULATORY REPORTING ACROSS AN EVOLVING SCHEMA

FINANCIAL REGULATORY REPORTING ACROSS AN EVOLVING SCHEMA FINANCIAL REGULATORY REPORTING ACROSS AN EVOLVING SCHEMA MODELDR & MARKLOGIC - DATA POINT MODELING MARKLOGIC WHITE PAPER JUNE 2015 CHRIS ATKINSON Contents Regulatory Satisfaction is Increasingly Difficult

More information

Semantic Web Mining and its application in Human Resource Management

Semantic Web Mining and its application in Human Resource Management International Journal of Computer Science & Management Studies, Vol. 11, Issue 02, August 2011 60 Semantic Web Mining and its application in Human Resource Management Ridhika Malik 1, Kunjana Vasudev 2

More information

Semantics In Action For Proactive Policing

Semantics In Action For Proactive Policing Semantics In Action For Proactive Policing Jen Shorten Technical Delivery Architect, Consulting Services Jon Williams Senior Sales Engineer, UK Public Sector The Nature of Policing Is Changing The increasing

More information

Model driven paradigm

Model driven paradigm Model driven paradigm a) The system is defined as an executable specification which is an object-oriented analysis model. b) The system is validated at the analysis model level. c) A software and execution

More information

Setting up a CIDOC CRM Adoption and Use Strategy CIDOC CRM: Success Stories, Challenges and New Perspective

Setting up a CIDOC CRM Adoption and Use Strategy CIDOC CRM: Success Stories, Challenges and New Perspective Setting up a CIDOC CRM Adoption and Use Strategy CIDOC CRM: Success Stories, Challenges and New Perspective George Bruseker CIDOC 2017 Tblisi, Georgia 27/09/2017 Researcher, Interpreter Goal: A Semantic

More information

A semantic approach for discovering egovernment services

A semantic approach for discovering egovernment services 6th WSEAS International Conference on SYSTEM SCIENCE and SIMULATION in ENGINEERING, Venice, Italy, November 21-23, 2007 153 A semantic approach for discovering egovernment services EFTHIMIOS TAMBOURIS,

More information

CA ERwin Data Modeler

CA ERwin Data Modeler CA ERwin Data Modeler Implementation Guide Service Pack 9.5.2 This Documentation, which includes embedded help systems and electronically distributed materials, (hereinafter referred to only and is subject

More information

Precise Medication Extraction using Agile Text Mining

Precise Medication Extraction using Agile Text Mining Precise Medication Extraction using Agile Text Mining Chaitanya Shivade *, James Cormack, David Milward * The Ohio State University, Columbus, Ohio, USA Linguamatics Ltd, Cambridge, UK shivade@cse.ohio-state.edu,

More information

Electronic Health Records with Cleveland Clinic and Oracle Semantic Technologies

Electronic Health Records with Cleveland Clinic and Oracle Semantic Technologies Electronic Health Records with Cleveland Clinic and Oracle Semantic Technologies David Booth, Ph.D., Cleveland Clinic (contractor) Oracle OpenWorld 20-Sep-2010 Latest version of these slides: http://dbooth.org/2010/oow/

More information

GROW YOUR BUSINESS WITH AN ALL-IN-ONE REAL ESTATE PLATFORM

GROW YOUR BUSINESS WITH AN ALL-IN-ONE REAL ESTATE PLATFORM GROW YOUR BUSINESS WITH AN ALL-IN-ONE REAL ESTATE PLATFORM ZipperAgent TABLE OF CONTENTS 1. Introduction: How valuable is your CRM? 2. Online Lead Capture: Online lead capture builds your business 3. Timely

More information

Data Sheet: ITTIA ODBC. Copyright 2005 ITTIA LLC All rights reserved

Data Sheet: ITTIA ODBC. Copyright 2005 ITTIA LLC All rights reserved Data Sheet: ITTIA ODBC Copyright 2005 ITTIA LLC All rights reserved Data Sheet: ITTIA ODBC ITTIA ODBC meets the needs of developers who need to connect their db.* Open Source database or proprietary Birdstep

More information

Outline. Morning program Preliminaries Semantic matching Learning to rank Entities

Outline. Morning program Preliminaries Semantic matching Learning to rank Entities 112 Outline Morning program Preliminaries Semantic matching Learning to rank Afternoon program Modeling user behavior Generating responses Recommender systems Industry insights Q&A 113 are polysemic Finding

More information

Data and Information Integration: Information Extraction

Data and Information Integration: Information Extraction International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Data and Information Integration: Information Extraction Varnica Verma 1 1 (Department of Computer Science Engineering, Guru Nanak

More information

Developing SQL Data Models

Developing SQL Data Models Course 20768B: Developing SQL Data Models Page 1 of 5 Developing SQL Data Models Course 20768B: 2 days; Instructor-Led Introduction The focus of this 2-day instructor-led course is on creating managed

More information

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent

More information

2 The IBM Data Governance Unified Process

2 The IBM Data Governance Unified Process 2 The IBM Data Governance Unified Process The benefits of a commitment to a comprehensive enterprise Data Governance initiative are many and varied, and so are the challenges to achieving strong Data Governance.

More information

CTL.SC4x Technology and Systems

CTL.SC4x Technology and Systems in Supply Chain Management CTL.SC4x Technology and Systems Key Concepts Document This document contains the Key Concepts for the SC4x course, Weeks 1 and 2. These are meant to complement, not replace,

More information

Business Intelligence

Business Intelligence Business Intelligence The Metadata Layer Asroni Ver. 01 asroni@umy.ac.id Part IV Business Intelligence Applications 345 Applications In This Part Chapter 12: The Metadata Layer Chapter 13: Using the Pentaho

More information

Ontology Extraction from Heterogeneous Documents

Ontology Extraction from Heterogeneous Documents Vol.3, Issue.2, March-April. 2013 pp-985-989 ISSN: 2249-6645 Ontology Extraction from Heterogeneous Documents Kirankumar Kataraki, 1 Sumana M 2 1 IV sem M.Tech/ Department of Information Science & Engg

More information

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany Information Systems & University of Koblenz Landau, Germany Semantic Search examples: Swoogle and Watson Steffen Staad credit: Tim Finin (swoogle), Mathieu d Aquin (watson) and their groups 2009-07-17

More information

Web Ontology for Software Package Management

Web Ontology for Software Package Management Proceedings of the 8 th International Conference on Applied Informatics Eger, Hungary, January 27 30, 2010. Vol. 2. pp. 331 338. Web Ontology for Software Package Management Péter Jeszenszky Debreceni

More information

BUILDING THE SEMANTIC WEB

BUILDING THE SEMANTIC WEB BUILDING THE SEMANTIC WEB You might have come across the term Semantic Web Applications often, during talks about the future of Web apps. Check out what this is all about There are two aspects to the possible

More information

SharePoint 2013 Site Owner

SharePoint 2013 Site Owner SharePoint 2013 Site Owner Effective Content and Document Collaboration with Axalta Teams 9 May 2014 Instructor: Jason Christie Site Owner Course Topics to be Covered Content Management Creating and configuring

More information

Business Modelling. PRACTICAL OBJECT-ORIENTED DESIGN WITH UML 2e. Early phase of development Inputs: Activities: informal specification

Business Modelling. PRACTICAL OBJECT-ORIENTED DESIGN WITH UML 2e. Early phase of development Inputs: Activities: informal specification PRACTICAL OBJECT-ORIENTED DESIGN WITH UML 2e Chapter 4: Restaurant System: Business Modelling Slide 1/1 Business Modelling Early phase of development Inputs: informal specification Activities: create use

More information

Finding Sentiment and the Value Within

Finding Sentiment and the Value Within WHITE PAPER Finding Sentiment and the Value Within by W H Inmon, Forest Rim Technology Nearly every corporation has a call center. Ask an executive if his/her organization has a call center and the odds

More information

Implementing a Knowledge Database for Scientific Control Systems. Daniel Gresh Wheatland-Chili High School LLE Advisor: Richard Kidder Summer 2006

Implementing a Knowledge Database for Scientific Control Systems. Daniel Gresh Wheatland-Chili High School LLE Advisor: Richard Kidder Summer 2006 Implementing a Knowledge Database for Scientific Control Systems Abstract Daniel Gresh Wheatland-Chili High School LLE Advisor: Richard Kidder Summer 2006 A knowledge database for scientific control systems

More information

PRINCIPLES OF COMPILER DESIGN UNIT I INTRODUCTION TO COMPILERS

PRINCIPLES OF COMPILER DESIGN UNIT I INTRODUCTION TO COMPILERS Objective PRINCIPLES OF COMPILER DESIGN UNIT I INTRODUCTION TO COMPILERS Explain what is meant by compiler. Explain how the compiler works. Describe various analysis of the source program. Describe the

More information

Semantic Parsing for Location Intelligence

Semantic Parsing for Location Intelligence Semantic Parsing for Location Intelligence Voicebox s advanced system for helping you navigate the world Voicebox s Location Intelligence system combines cutting-edge Semantic Parsing for natural language

More information

Xton Access Manager GETTING STARTED GUIDE

Xton Access Manager GETTING STARTED GUIDE Xton Access Manager GETTING STARTED GUIDE XTON TECHNOLOGIES, LLC PHILADELPHIA Copyright 2017. Xton Technologies LLC. Contents Introduction... 2 Technical Support... 2 What is Xton Access Manager?... 3

More information

Is Linked Data the future of data integration in the enterprise?

Is Linked Data the future of data integration in the enterprise? Is Linked Data the future of data integration in the enterprise? John Walker Email: john.walker@nxp.com Twitter: @NXPdata Pilot Linked Open Data NXP is a semiconductor (microchip) manufacturer Established:

More information

SC32 WG2 Metadata Standards Tutorial

SC32 WG2 Metadata Standards Tutorial SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China WG2 Viewpoint Big Data magnifies the existing challenges and issues of managing and interpreting

More information

A Scotas white paper September Scotas Push Connector

A Scotas white paper September Scotas Push Connector A Scotas white paper September 2013 Scotas Push Connector Introduction When you have to perform searches over big data, you need specialized solutions that can deal with the velocity, variety and volume

More information

WELCOME TO TECH IMMERSION

WELCOME TO TECH IMMERSION WELCOME TO TECH IMMERSION Track: SQL/BI PowerPivot with Excel 2010 Presenter: Jeff Jones Outline o BI EcoSystem o PowerPivot for Excel What, Why, Who? o Using PowerPivot o New Powerful Formulas o Using

More information

Question #1: 1. The assigned readings use the phrase "Database Approach." In your own words, what is the essence of a database approach?

Question #1: 1. The assigned readings use the phrase Database Approach. In your own words, what is the essence of a database approach? Question #1: 1. The assigned readings use the phrase "Database Approach." In your own words, what is the essence of a database approach? Respondant: I understand the essence of "database approach" to be

More information

Natural Language Requirements

Natural Language Requirements Natural Language Requirements Software Verification and Validation Laboratory Requirement Elaboration Heuristic Domain Model» Requirement Relationship Natural Language is elaborated via Requirement application

More information

Xcelerated Business Insights (xbi): Going beyond business intelligence to drive information value

Xcelerated Business Insights (xbi): Going beyond business intelligence to drive information value KNOWLEDGENT INSIGHTS volume 1 no. 5 October 7, 2011 Xcelerated Business Insights (xbi): Going beyond business intelligence to drive information value Today s growing commercial, operational and regulatory

More information

Event Stores (I) [Source: DB-Engines.com, accessed on August 28, 2016]

Event Stores (I) [Source: DB-Engines.com, accessed on August 28, 2016] Event Stores (I) Event stores are database management systems implementing the concept of event sourcing. They keep all state changing events for an object together with a timestamp, thereby creating a

More information

The necessity of hypermedia RDF and an approach to achieve it

The necessity of hypermedia RDF and an approach to achieve it The necessity of hypermedia RDF and an approach to achieve it Kjetil Kjernsmo 1 Department of Informatics, Postboks 1080 Blindern, 0316 Oslo, Norway kjekje@ifi.uio.no Abstract. This paper will give an

More information

Semantic Web in a Constrained Environment

Semantic Web in a Constrained Environment Semantic Web in a Constrained Environment Laurens Rietveld and Stefan Schlobach Department of Computer Science, VU University Amsterdam, The Netherlands {laurens.rietveld,k.s.schlobach}@vu.nl Abstract.

More information

QuickSpecs. ISG Navigator for Universal Data Access M ODELS OVERVIEW. Retired. ISG Navigator for Universal Data Access

QuickSpecs. ISG Navigator for Universal Data Access M ODELS OVERVIEW. Retired. ISG Navigator for Universal Data Access M ODELS ISG Navigator from ISG International Software Group is a new-generation, standards-based middleware solution designed to access data from a full range of disparate data sources and formats.. OVERVIEW

More information

Rapid Information Discovery System (RAID)

Rapid Information Discovery System (RAID) Int'l Conf. Artificial Intelligence ICAI'17 321 Rapid Information Discovery System (RAID) B. Gopal, P. Benjamin, and K. Madanagopal Knowledge Based Systems, Inc. (KBSI), College Station, TX, USA Summary

More information

THE GETTY VOCABULARIES TECHNICAL UPDATE

THE GETTY VOCABULARIES TECHNICAL UPDATE AAT TGN ULAN CONA THE GETTY VOCABULARIES TECHNICAL UPDATE International Working Group Meetings January 7-10, 2013 Joan Cobb Gregg Garcia Information Technology Services J. Paul Getty Trust International

More information

Chapter. Relational Database Concepts COPYRIGHTED MATERIAL

Chapter. Relational Database Concepts COPYRIGHTED MATERIAL Chapter Relational Database Concepts 1 COPYRIGHTED MATERIAL Every organization has data that needs to be collected, managed, and analyzed. A relational database fulfills these needs. Along with the powerful

More information

Mining the Biomedical Research Literature. Ken Baclawski

Mining the Biomedical Research Literature. Ken Baclawski Mining the Biomedical Research Literature Ken Baclawski Data Formats Flat files Spreadsheets Relational databases Web sites XML Documents Flexible very popular text format Self-describing records XML Documents

More information

Multi-agent and Semantic Web Systems: Linked Open Data

Multi-agent and Semantic Web Systems: Linked Open Data Multi-agent and Semantic Web Systems: Linked Open Data Fiona McNeill School of Informatics 14th February 2013 Fiona McNeill Multi-agent Semantic Web Systems: *lecture* Date 0/27 Jena Vcard 1: Triples Fiona

More information

Ivan Herman. F2F Meeting of the W3C Business Group on Oil, Gas, and Chemicals Houston, February 13, 2012

Ivan Herman. F2F Meeting of the W3C Business Group on Oil, Gas, and Chemicals Houston, February 13, 2012 Ivan Herman F2F Meeting of the W3C Business Group on Oil, Gas, and Chemicals Houston, February 13, 2012 (2) (3) } An intelligent system manipulating and analyzing knowledge bases e.g., via big ontologies,

More information

Automile User Guide. Last updated May 2017

Automile User Guide. Last updated May 2017 Automile User Guide Last updated May 2017 Welcome to Automile We are excited to have you onboard. In this guide, you will find detailed instructions on how to install your Automile Box and how to use both

More information

IBM Research Report. Overview of Component Services for Knowledge Integration in UIMA (a.k.a. SUKI)

IBM Research Report. Overview of Component Services for Knowledge Integration in UIMA (a.k.a. SUKI) RC24074 (W0610-047) October 10, 2006 Computer Science IBM Research Report Overview of Component Services for Knowledge Integration in UIMA (a.k.a. SUKI) David Ferrucci, J. William Murdock, Christopher

More information

Oracle Warehouse Builder 10g Release 2 Integrating Packaged Applications Data

Oracle Warehouse Builder 10g Release 2 Integrating Packaged Applications Data Oracle Warehouse Builder 10g Release 2 Integrating Packaged Applications Data June 2006 Note: This document is for informational purposes. It is not a commitment to deliver any material, code, or functionality,

More information

A Scotas white paper September Scotas OLS

A Scotas white paper September Scotas OLS A Scotas white paper September 2013 Scotas OLS Introduction When you have to perform searches over big data, you need specialized solutions that can deal with the velocity, variety and volume of this valuable

More information

Library of Congress BIBFRAME Pilot. NOTSL Fall Meeting October 30, 2015

Library of Congress BIBFRAME Pilot. NOTSL Fall Meeting October 30, 2015 Library of Congress BIBFRAME Pilot NOTSL Fall Meeting October 30, 2015 THE BIBFRAME EDITOR AND THE LC PILOT The Semantic Web and Linked Data : a Recap of the Key Concepts Learning Objectives Describe the

More information

Overview of Web Mining Techniques and its Application towards Web

Overview of Web Mining Techniques and its Application towards Web Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous

More information

Parallel SQL and Streaming Expressions in Apache Solr 6. Shalin Shekhar Lucidworks Inc.

Parallel SQL and Streaming Expressions in Apache Solr 6. Shalin Shekhar Lucidworks Inc. Parallel SQL and Streaming Expressions in Apache Solr 6 Shalin Shekhar Mangar @shalinmangar Lucidworks Inc. Introduction Shalin Shekhar Mangar Lucene/Solr Committer PMC Member Senior Solr Consultant with

More information

Towards a Semantic Wiki Experience Desktop Integration and Interactivity in WikSAR

Towards a Semantic Wiki Experience Desktop Integration and Interactivity in WikSAR Towards a Semantic Wiki Experience Desktop Integration and Interactivity in WikSAR David Aumueller, Sören Auer Department of Computer Science University of Leipzig, Augustusplatz 10-11, 04103 Leipzig,

More information

LIDER Survey. Overview. Number of participants: 24. Participant profile (organisation type, industry sector) Relevant use-cases

LIDER Survey. Overview. Number of participants: 24. Participant profile (organisation type, industry sector) Relevant use-cases LIDER Survey Overview Participant profile (organisation type, industry sector) Relevant use-cases Discovering and extracting information Understanding opinion Content and data (Data Management) Monitoring

More information

SEMANTIC SOLUTIONS FOR OIL & GAS: ROLES AND RESPONSIBILITIES

SEMANTIC SOLUTIONS FOR OIL & GAS: ROLES AND RESPONSIBILITIES SEMANTIC SOLUTIONS FOR OIL & GAS: ROLES AND RESPONSIBILITIES Jeremy Carroll, Ralph Hodgson, {jeremy,ralph}@topquadrant.com This paper is submitted to The W3C Workshop on Semantic Web in Energy Industries

More information

BPMN Working Draft. 1. Introduction

BPMN Working Draft. 1. Introduction 1. Introduction The Business Process Management Initiative (BPMI) has developed a standard Business Process Modeling Notation (BPMN). The primary goal of BPMN is to provide a notation that is readily understandable

More information

Fundamentals of STEP Implementation

Fundamentals of STEP Implementation Fundamentals of STEP Implementation David Loffredo loffredo@steptools.com STEP Tools, Inc., Rensselaer Technology Park, Troy, New York 12180 A) Introduction The STEP standard documents contain such a large

More information

SELF-SERVICE SEMANTIC DATA FEDERATION

SELF-SERVICE SEMANTIC DATA FEDERATION SELF-SERVICE SEMANTIC DATA FEDERATION WE LL MAKE YOU A DATA SCIENTIST Contact: IPSNP Computing Inc. Chris Baker, CEO Chris.Baker@ipsnp.com (506) 721 8241 BIG VISION: SELF-SERVICE DATA FEDERATION Biomedical

More information

4) DAVE CLARKE. OASIS: Constructing knowledgebases around high resolution images using ontologies and Linked Data

4) DAVE CLARKE. OASIS: Constructing knowledgebases around high resolution images using ontologies and Linked Data require a change in development culture and thus training. 5. Impact and Benefits The project was delivered on time and on budget unusual for a project of this scale and the project was hailed as a great

More information

Ontology-based Architecture Documentation Approach

Ontology-based Architecture Documentation Approach 4 Ontology-based Architecture Documentation Approach In this chapter we investigate how an ontology can be used for retrieving AK from SA documentation (RQ2). We first give background information on the

More information

2012 Microsoft Corporation. All rights reserved. Microsoft, Active Directory, Excel, Lync, Outlook, SharePoint, Silverlight, SQL Server, Windows,

2012 Microsoft Corporation. All rights reserved. Microsoft, Active Directory, Excel, Lync, Outlook, SharePoint, Silverlight, SQL Server, Windows, 2012 Microsoft Corporation. All rights reserved. Microsoft, Active Directory, Excel, Lync, Outlook, SharePoint, Silverlight, SQL Server, Windows, Windows Server, and other product names are or may be registered

More information

F08: Intro to Composition

F08: Intro to Composition F08: Intro to Composition Semantics - Ling 331/731 University of Kansas 1 erbs as functions (1). Here is a sadly simplified sentence structure: S P P There are two lexical items that feed this structure:

More information

Furl Furled Furling. Social on-line book marking for the masses. Jim Wenzloff Blog:

Furl Furled Furling. Social on-line book marking for the masses. Jim Wenzloff Blog: Furl Furled Furling Social on-line book marking for the masses. Jim Wenzloff jwenzloff@misd.net Blog: http://www.visitmyclass.com/blog/wenzloff February 7, 2005 This work is licensed under a Creative Commons

More information