KDD- Service based Numerical Entity Searcher (KSNES) Presentation 3 on April 14 th, Naga Sowjanya Karumuri. CIS 895 MSE PROJECT

Size: px
Start display at page:

Download "KDD- Service based Numerical Entity Searcher (KSNES) Presentation 3 on April 14 th, Naga Sowjanya Karumuri. CIS 895 MSE PROJECT"

Transcription

1 KDD- Service based Numerical Entity Searcher (KSNES) Presentation 3 on April 14 th, 2009 Naga Sowjanya Karumuri sowji@ksu.edu 1 CIS 895 MSE PROJECT

2 OUTLINE Introduction Terms Motivation Goal Project Overview Project Data Flow Diagram Component Design Project Evaluation Future Work Prototype Demonstration Questions / Comments 2

3 TERMS[1] Knowledge Discovery in Databases (KDD) a group headed by Dr. Hsu primary focus is machine learning, data mining, human-computer intelligent interaction Natural Language Processing (NLP) To allow computers to process and understand human languages Some areas like Text Segmentation (identify word boundaries) Part-of-speech tagging Word sense disambiguation (words with more than one meaning) 3

4 TERMS[2] Named Entity Recognition (NER) Locating and classifying atomic elements (single part of speech) in text into predefined categories such as Names of Persons Names of Locations Names of Organizations Names of Miscellaneous Entities Example Dr. William H. Hsu is a Professor at Kansas State University located in Manhattan, Kansas. Dr. [PER William H. Hsu ] is a Professor at [ORG Kansas State University ] located in [LOC Manhattan ], [LOC Kansas ]. 4

5 TERMS[3] Shallow Parsing/Chunking NLP technique that attempts to look for key phrases but not to fully parse into a parse tree. Output - series of words mostly nouns, verbs, preposition phrases etc., Example Chunker: [NP He ] [VP reckons ] [NP the current account deficit ] [VP will narrow ] [PP to ] [NP only L1.8 billion ] Full Parser: (PRP)He (VBZ)reckons (DT)the (JJ)current (NN)account (NN)deficit (MD)will (VB)narrow (TO)to (RB)only (L)L (CD)1.8 (CD)billion 5

6 PROJECT OVERVIEW[1] Motivation Occurrence of events is naturally anchored in time within the narrative text Is Bush currently the President of America? When was India attacked by Pakistan in last century? To know the quantities of entities How many Oscar awards are won by Steven Spielberg? What was the highest temperature recorded in the year 2008? 6

7 PROJECT OVERVIEW[2] Goal To develop a system that extracts Numerical Phrases from raw text displays value unit unit-type System is set as a service on the web server User interacts through a webpage Numerical Phrase: Types Number Phrase 33 dollars, 100 Watts, 13 years, two miles Date Phrase Aug 1998, Nov 10 th 1984, between 1989 and

8 PROJECT OVERVIEW[3] Purpose To understand the timestamp of an event To understand the order of occurrence of events To understand the persistence of an event i.e., the time period over which the event occurred and continued For KDD Group To gather certain statistical information from the data they gather by crawling different web pages How many cattle have been affected by the virus? When did the disease break out? Sample NABC (National Agricultural Bio-Security Centre) data is given to the system for testing 8

9 APPLICATION AREAS Textual Entailment (TE) Recognition Given two fragments, whether the meaning of one text can be inferred from another text. Question Answering (QA) System Identifies text that entails the expected answer. Ex: During 1997, 10,000 cattle were killed because of the RVF. Possible inferences (TE) 10,000 cattle were killed because of RVF. RVF occurred during Possible Questions (QA) How many cattle were killed during 1997 RVF outbreak? When did RVF occur? 9

10 SYSTEM OVERVIEW 10

11 PROJECT DATA FLOW DIAGRAM: NUMERICAL ENTITY SEARCHER 11

12 MODULES IN THE PROJECT Webpage (JSP): For requesting and receiving information from the service. POS Tagger (Java): Stanford POS Tagger Numerical Phrase Extractor (Java): Implemented using Shallow Parsing Technique Number-Unit/Date Pattern Recognizer (Java): Implemented based on the Numerical Quantifier developed by Benjamin Sapp, UIUC. 12

13 POS TAGGER TAGSET 13

14 IMPLEMENTING NUMERICAL PHRASE EXTRACTOR Input: Tagged Text I/PRP lost/vbd thirty-three/jj dollars/nns in/in 1998/CD Regular expressions (regex) are used to determine the numerical patterns in the input. thirty-three/jj dollars/nns in/in 1998/CD Output: Numerical Phrases thirty-three dollars in

15 SOME PATTERNS "\\d+-\\d+(/jj /CD) [a-za-z]+/nn" parses \\d+-\\d+(/jj /CD) 3-2/JJ 20-20/JJ [a-za-z]+/nn lead/nn match/nn "(between Between from From In in since Since during During)/IN.../CD (([a-za- Z]+/CC [a-z]+/to).../cd)? parses 'between 1987 and 1997', 'in 2007 and

16 COMPONENT DESIGN Contains class variables and functions Added separate table to describe the roles of functions 16

17 COMPONENT DESIGN (MYPATTERNS)[1] Patterns p_words Matching Numerical Phrases about, around, approximately, more than, nearly, almost, no more than, at least, less than, no fewer than p_tnl p_inl p_words + p_abtfrac p_words + p_age p_words + p_ampm p_and p_tnl + p_anydate this, next, last, since, in between, from, in, since, during about two-thirds of the vote, millions of books 27 year-old bachelor, 27-year-old bachelor About 3:00 a.m., 4:15 p.m. CST 3,792 children and adolescents Oct 1st 1987, Nov 5, December 21,

18 COMPONENT DESIGN (MYPATTERNS)[2] Patterns p_inl + p_btwfrm p_inl + p_btwfrmd Matching Numerical Phrases between 1987 and 1997, in 2007 and 2008 from 200 to 300 miles, from 7.5 percent to 6.85 percent p_date 18 April 2008 p_tnl + p_days p_centuary p_words + p_hyphenww p_hyphennumn um p_in p_mids p_months this Monday, next Saturday, last Friday, Tuesday, Wednesday, 17 th century, 17 th -centuary million-dollar home, six-bedroom home, thirty-three dollars the match, a 3-2 lead 9 in 10 people, 1 in every 8 women mid-1990s, the early 1990s, 1970s January, February, December, Jan, Feb, Sept, Dec 18

19 COMPONENT DESIGN (MYPATTERNS)[3] Patterns p_words + p_numunit p_words + p_per p_words + p_percentinches p_ratio p_tty p_twmy p_xbits p_words + p_yrange Matching Numerical Phrases 33 USD, about 34 miles, 33,333 tons, 3.3 million dollars, one thing, 3.4 billion $33 per day, about 100 miles per hour 39%, 0.5-1%, about 90 %, 20" one of the five people, 89 percent of people, 3 out of 5 people today, tomorrow, yesterday, noon this year, this month, next year, next month, last week, last year, last month 1024KB, 8MB, 320GB, 1TB In , during

20 SAMPLE SENTENCES[1] Sentence I have lost 33,000 dinars in 1998 At just 12-years-old, he enrolled as a freshman at F.I.U. in Miami. The 20" imac is cheaper at $1200 and it has a 320GB hard drive. Volunteers bring in a heavy crane for work on a bridge last month. As for those who do not invest, around 40% say capitalism is better. As of 7 January 2007, about 75 people have died and another 183 infected. Patterns p_numnit p_btwfrm p_age p_percentinches p_numunit p_xbits p_twmy p_percentinches p_date p_numunit 20

21 SAMPLE SENTENCES[2] Sentence Approximately 1% of human sufferers die of the disease. Current listings of 2,000 children and adults who are reported missing, including in-depth coverage of high-profile cases. 38 of the 62 patients who provided blood samples tested positive. She became an exotic dancer at Scores in New York City in the mid-1990s. Peterson's three capped the surge, giving New Orleans a lead. Patterns p_percentinches p_and p_ratio p_mids p_numunit p_hyphennumnum 21

22 PROBLEMS ENCOUNTERED Determining the Patterns Lots of Numerical Phrases found Designed Patterns to filter more than one kind of Numerical Pattern Prioritizing the Patterns More than one pattern may match the same Numerical Phrase To avoid clashes between the Patterns 22

23 PROJECT EVALUATION[1] Test Case Main Functionality Tested Pass/Fail Test Case 1 Application Functionality Pass Test Case 2 POS Tagger Functionality Pass Test Case 3 Numerical Phrase Extractor Functionality Pass Test Case 4 Number-Unit/Date Pattern Recognizer Functionality Pass 23

24 PROJECT EVALUATION[2] Phase Expected Completion Phase Actual Completion Phase 1 February 26, 2009 February 24, March 26, 2009 March 31, April 17, 2009 April 14,

25 PROJECT EVALUATION[3] Phase 2 took more time since Implementation and Testing are done simultaneously 25

26 PROJECT EVALUATION[4] More time for Coding and the Documentation 26

27 PROJECT EVALUATION[5] More time spent in discussing since it s the initial phase 27

28 PROJECT EVALUATION[6] More time is spent in Coding after gather the requirements in the first phase. 28

29 PROJECT EVALUATION[7] Lot of time spent on Documenting the things as per the ETDR standards. 29

30 FUTURE WORK Adding more Patterns To filter more different kinds of numerical phrases Improving the Output Display By displaying the number and date phrases in different colors To make it more readable for the user 30

31 LESSONS LEARNED Java Tool Usage Java Eclipse IDE Design Development MS Visio SDLC Documentation 31

32 PROTOTYPE DEMONSTRATION KSNES Project Set up as a Service on the CIS Server A webpage is set up: 32

33 FINAL STEPS Final Examination Ballot Make necessary changes to the MSE Portfolio Deliver the Portfolio 33

34 Questions?? Suggestions!! THANK YOU 34

KDD- Service based Numerical Entity Searcher (KSNES) Presentation 2 on March 31 st, Naga Sowjanya Karumuri. CIS 895 MSE PROJECT

KDD- Service based Numerical Entity Searcher (KSNES) Presentation 2 on March 31 st, Naga Sowjanya Karumuri. CIS 895 MSE PROJECT KDD- Service based Numerical Entity Searcher (KSNES) Presentation 2 on March 31 st, 2009 Naga Sowjanya Karumuri sowji@ksu.edu 1 CIS 895 MSE PROJECT OUTLINE Project Data Flow Diagram Action Items Architectural

More information

Vision Plan. For KDD- Service based Numerical Entity Searcher (KSNES) Version 2.0

Vision Plan. For KDD- Service based Numerical Entity Searcher (KSNES) Version 2.0 Vision Plan For KDD- Service based Numerical Entity Searcher (KSNES) Version 2.0 Submitted in partial fulfillment of the Masters of Software Engineering Degree. Naga Sowjanya Karumuri CIS 895 MSE Project

More information

Component Design. For KDD- Service based Numerical Entity Searcher (KSNES) Version 1.0

Component Design. For KDD- Service based Numerical Entity Searcher (KSNES) Version 1.0 Component Design For KDD- Service based Numerical Entity Searcher (KSNES) Version 1.0 Submitted in partial fulfillment of the Masters of Software Engineering degree. Naga Sowjanya Karumuri CIS 895 MSE

More information

Assignment #1: Named Entity Recognition

Assignment #1: Named Entity Recognition Assignment #1: Named Entity Recognition Dr. Zornitsa Kozareva USC Information Sciences Institute Spring 2013 Task Description: You will be given three data sets total. First you will receive the train

More information

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,

More information

AIMMS Function Reference - Date Time Related Identifiers

AIMMS Function Reference - Date Time Related Identifiers AIMMS Function Reference - Date Time Related Identifiers This file contains only one chapter of the book. For a free download of the complete book in pdf format, please visit www.aimms.com Aimms 3.13 Date-Time

More information

Calendar PPF Production Cycles Non-Production Activities and Events

Calendar PPF Production Cycles Non-Production Activities and Events 20-207 Calendar PPF Production Cycles Non-Production Activities and Events Four Productions For non-holiday productions 7 Week Stage Cycles 36 Uses plus strike (as in prior years and per agreement with

More information

Natural Language Processing

Natural Language Processing Natural Language Processing Machine Learning Potsdam, 26 April 2012 Saeedeh Momtazi Information Systems Group Introduction 2 Machine Learning Field of study that gives computers the ability to learn without

More information

Pearson Edexcel Award

Pearson Edexcel Award Pearson Edexcel Award January 2018 Examination Timetable FINAL For more information on Edexcel qualifications please visit http://qualifications.pearson.com Pearson Edexcel Award January 2018 Examination

More information

Software Quality Assurance Plan

Software Quality Assurance Plan Software Quality Assurance Plan For KDD-Research Entity Search Tool (KREST) Version 1.2 Submitted in partial fulfillment of the Masters of Software Engineering degree. Eric Davis CIS 895 MSE Project Department

More information

INFORMATION TECHNOLOGY SPREADSHEETS. Part 1

INFORMATION TECHNOLOGY SPREADSHEETS. Part 1 INFORMATION TECHNOLOGY SPREADSHEETS Part 1 Page: 1 Created by John Martin Exercise Built-In Lists 1. Start Excel Spreadsheet 2. In cell B1 enter Mon 3. In cell C1 enter Tue 4. Select cell C1 5. At the

More information

Annotating Spatio-Temporal Information in Documents

Annotating Spatio-Temporal Information in Documents Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de

More information

Project Evaluation Online Book Store Phase-III. Vamsi Krishna Mummaneni

Project Evaluation Online Book Store Phase-III. Vamsi Krishna Mummaneni Project Evaluation Online Book Store Phase-III Submitted in partial fulfillment of the requirements of the degree of Master of Software Engineering Vamsi Krishna Mummaneni CIS 895 MSE Project Kansas State

More information

Section 1.2: What is a Function? y = 4x

Section 1.2: What is a Function? y = 4x Section 1.2: What is a Function? y = 4x y is the dependent variable because it depends on what x is. x is the independent variable because any value can be chosen to replace x. Domain: a set of values

More information

Data and Information Integration: Information Extraction

Data and Information Integration: Information Extraction International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Data and Information Integration: Information Extraction Varnica Verma 1 1 (Department of Computer Science Engineering, Guru Nanak

More information

CIMA Certificate BA Interactive Timetable

CIMA Certificate BA Interactive Timetable CIMA Certificate BA Interactive Timetable 2018 Nottingham & Leicester Version 3.2 Information last updated 09/03/18 Please note: Information and dates in this timetable are subject to change. Introduction

More information

Use Case Study: Reducing Patient No-Shows. Geisinger Health System Central and Northeastern Pennsylvania

Use Case Study: Reducing Patient No-Shows. Geisinger Health System Central and Northeastern Pennsylvania Use Case Study: Reducing Patient No-Shows Geisinger Health System Central and Northeastern Pennsylvania February 2014 Geisinger is a leading integrated health services organization widely recognized for

More information

Text mining tools for semantically enriching the scientific literature

Text mining tools for semantically enriching the scientific literature Text mining tools for semantically enriching the scientific literature Sophia Ananiadou Director National Centre for Text Mining School of Computer Science University of Manchester Need for enriching the

More information

Data Transfers in the Grid: Workload Analysis of Globus GridFTP

Data Transfers in the Grid: Workload Analysis of Globus GridFTP Data Transfers in the Grid: Workload Analysis of Globus GridFTP Nicolas Kourtellis, Lydia Prieto, Gustavo Zarrate, Adriana Iamnitchi University of South Florida Dan Fraser Argonne National Laboratory Objective

More information

corenlp-xml-reader Documentation

corenlp-xml-reader Documentation corenlp-xml-reader Documentation Release 0.0.4 Edward Newell Feb 07, 2018 Contents 1 Purpose 1 2 Install 3 3 Example 5 3.1 Instantiation............................................... 5 3.2 Sentences.................................................

More information

Ranking in a Domain Specific Search Engine

Ranking in a Domain Specific Search Engine Ranking in a Domain Specific Search Engine CS6998-03 - NLP for the Web Spring 2008, Final Report Sara Stolbach, ss3067 [at] columbia.edu Abstract A search engine that runs over all domains must give equal

More information

Pearson Edexcel Award

Pearson Edexcel Award Pearson Edexcel Award May June 2018 Examination Timetable FINAL For more information on Edexcel qualifications please visit http://qualifications.pearson.com v3 Pearson Edexcel Award 2018 Examination View

More information

Fast and Effective System for Name Entity Recognition on Big Data

Fast and Effective System for Name Entity Recognition on Big Data International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-3, Issue-2 E-ISSN: 2347-2693 Fast and Effective System for Name Entity Recognition on Big Data Jigyasa Nigam

More information

Math 2280: Introduction to Differential Equations- Syllabus

Math 2280: Introduction to Differential Equations- Syllabus Math 2280: Introduction to Differential Equations- Syllabus University of Utah Spring 2013 1 Basic Information Instructor - Patrick Dylan Zwick Email - zwick@math.utah.edu Phone - 801-651-8768 Office Hour

More information

ACE (Automatic Content Extraction) English Annotation Guidelines for Values. Version

ACE (Automatic Content Extraction) English Annotation Guidelines for Values. Version ACE (Automatic Content Extraction) English Annotation Guidelines for Values Version 1.2.4 Linguistic Data Consortium http://www.ldc.upenn.edu/projects/ace/ 1. Basic Concepts...3 1.1 What is a Value?...3

More information

Lab II - Product Specification Outline. CS 411W Lab II. Prototype Product Specification For CLASH. Professor Janet Brunelle Professor Hill Price

Lab II - Product Specification Outline. CS 411W Lab II. Prototype Product Specification For CLASH. Professor Janet Brunelle Professor Hill Price Lab II - Product Specification Outline CS 411W Lab II Prototype Product Specification For CLASH Professor Janet Brunelle Professor Hill Price Prepared by: Artem Fisan Date: 04/20/2015 Table of Contents

More information

name name C S M E S M E S M E Block S Block M C-Various October Sunday

name name C S M E S M E S M E Block S Block M C-Various October Sunday 01.10.2017 October Sunday 1 10 2017 02.10.2017 October Monday 2 10 2017 03.10.2017 October Tuesday 3 10 2017 Tag der Deutschen Einheit 04.10.2017 October Wednesday 4 10 2017 05.10.2017 October Thursday

More information

Question Answering System for Yioop

Question Answering System for Yioop Question Answering System for Yioop Advisor Dr. Chris Pollett Committee Members Dr. Thomas Austin Dr. Robert Chun By Niravkumar Patel Problem Statement Question Answering System Yioop Proposed System Triplet

More information

Math in Focus Vocabulary. Kindergarten

Math in Focus Vocabulary. Kindergarten Math in Focus Vocabulary Kindergarten Chapter Word Definition 1 one 1 * 1 two 2 * * 1 three 3 * * * 1 four 4 * * * * 1 five 5 * * * * * 1 same things that have a common property 1 different things that

More information

Example. Section: PS 709 Examples of Calculations of Reduced Hours of Work Last Revised: February 2017 Last Reviewed: February 2017 Next Review:

Example. Section: PS 709 Examples of Calculations of Reduced Hours of Work Last Revised: February 2017 Last Reviewed: February 2017 Next Review: Following are three examples of calculations for MCP employees (undefined hours of work) and three examples for MCP office employees. Examples use the data from the table below. For your calculations use

More information

Question Answering Using XML-Tagged Documents

Question Answering Using XML-Tagged Documents Question Answering Using XML-Tagged Documents Ken Litkowski ken@clres.com http://www.clres.com http://www.clres.com/trec11/index.html XML QA System P Full text processing of TREC top 20 documents Sentence

More information

B.2 Measures of Central Tendency and Dispersion

B.2 Measures of Central Tendency and Dispersion Appendix B. Measures of Central Tendency and Dispersion B B. Measures of Central Tendency and Dispersion What you should learn Find and interpret the mean, median, and mode of a set of data. Determine

More information

Mobley, Jenna W. Monday, February 23, :48 AM

Mobley, Jenna W. Monday, February 23, :48 AM DeLancy From: Sent: To: Subject: Mobley, Jenna W. Monday, February 23, 2015 7:48 AM DeLancy RE: Confirming next Monday's meeting Good morning, and thank you. We look forward

More information

Private Swimming Lessons

Private Swimming Lessons Private Swimming Lessons Private Lessons Designed for participants who would like a 1:1 ratio. Participants will receive individual attention to improve their swimming technique and have the convenience

More information

Rocky Hock Easter Musical

Rocky Hock Easter Musical Rocky Hock Easter Musical 3/12/13 Vol. 1314... We're in rehearsals now for our upcoming Easter production of Come, Follow Me and wanted to introduce you to a new member of the cast. Her name is Suzanne

More information

Natural Language Processing Tutorial May 26 & 27, 2011

Natural Language Processing Tutorial May 26 & 27, 2011 Cognitive Computation Group Natural Language Processing Tutorial May 26 & 27, 2011 http://cogcomp.cs.illinois.edu So why aren t words enough? Depends on the application more advanced task may require more

More information

PRIS at TAC2012 KBP Track

PRIS at TAC2012 KBP Track PRIS at TAC2012 KBP Track Yan Li, Sijia Chen, Zhihua Zhou, Jie Yin, Hao Luo, Liyin Hong, Weiran Xu, Guang Chen, Jun Guo School of Information and Communication Engineering Beijing University of Posts and

More information

ITSMR Research Note KEY FINDINGS. Crash Analyses: Ticket Analyses:

ITSMR Research Note KEY FINDINGS. Crash Analyses: Ticket Analyses: December 2018 KEY FINDINGS Crash Analyses: 2013-2017 Less than 1% of police-reported fatal and personal injury (F & PI) crashes involved the use of a cell phone over the five years, 2013-2017. 15 persons

More information

Schedule/BACnet Schedule

Schedule/BACnet Schedule Object Dictionary 1 Schedule/BACnet Schedule Introduction Note: The Johnson Controls Schedule object is considered a BACnet Schedule object because it supports BACnet functionality. In addition, this object

More information

Murdock-Portal Band Parent Handbook

Murdock-Portal Band Parent Handbook Murdock-Portal Band Parent Handbook for the 14/15 School Year *****PLEASE READ***** This handbook contains your child s CLASS SCHEDULE and THE TOP TEN THINGS BAND PARENTS NEED TO KNOW Top Ten Things Band

More information

INFORMATION EXTRACTION

INFORMATION EXTRACTION COMP90042 LECTURE 13 INFORMATION EXTRACTION INTRODUCTION Given this: Brasilia, the Brazilian capital, was founded in 1960. Obtain this: capital(brazil, Brasilia) founded(brasilia, 1960) Main goal: turn

More information

Automatic Metadata Extraction for Archival Description and Access

Automatic Metadata Extraction for Archival Description and Access Automatic Metadata Extraction for Archival Description and Access WILLIAM UNDERWOOD Georgia Tech Research Institute Abstract: The objective of the research reported is this paper is to develop techniques

More information

Named Entity Detection and Entity Linking in the Context of Semantic Web

Named Entity Detection and Entity Linking in the Context of Semantic Web [1/52] Concordia Seminar - December 2012 Named Entity Detection and in the Context of Semantic Web Exploring the ambiguity question. Eric Charton, Ph.D. [2/52] Concordia Seminar - December 2012 Challenge

More information

GreenThumb Garden Registration

GreenThumb Garden Registration GreenThumb Garden Registration 2015-2019 Garden Name Block Lot CB Jurisdiction Two members must provide full contact information on the license agreement, including phone numbers, addresses and emails.

More information

Meta-Content framework for back index generation

Meta-Content framework for back index generation Meta-Content framework for back index generation Tripti Sharma, Assistant Professor Department of computer science Chhatrapati Shivaji Institute of Technology. Durg, India triptisharma@csitdurg.in Sarang

More information

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent

More information

QI TALK TIME. Run Charts. Speaker: Dr Michael Carton. Connect Improve Innovate. 19 th Dec Building an Irish Network of Quality Improvers

QI TALK TIME. Run Charts. Speaker: Dr Michael Carton. Connect Improve Innovate. 19 th Dec Building an Irish Network of Quality Improvers QI TALK TIME Building an Irish Network of Quality Improvers Run Charts Speaker: Dr Michael Carton 19 th Dec 2017 Connect Improve Innovate Speaker Speaker: Dr Michael Carton Michael s background is as a

More information

Tokyo Institute of Technology Style Guide

Tokyo Institute of Technology Style Guide Tokyo Institute of Technology Style Guide For Written Communication in English By English Documentation Support Services About the Style Guide Tokyo Institute of Technology (Tokyo Tech) presents an image

More information

Hands-Free Internet using Speech Recognition

Hands-Free Internet using Speech Recognition Introduction Trevor Donnell December 7, 2001 6.191 Preliminary Thesis Proposal Hands-Free Internet using Speech Recognition The hands-free Internet will be a system whereby a user has the ability to access

More information

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) CONTEXT SENSITIVE TEXT SUMMARIZATION USING HIERARCHICAL CLUSTERING ALGORITHM

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) CONTEXT SENSITIVE TEXT SUMMARIZATION USING HIERARCHICAL CLUSTERING ALGORITHM INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & 6367(Print), ISSN 0976 6375(Online) Volume 3, Issue 1, January- June (2012), TECHNOLOGY (IJCET) IAEME ISSN 0976 6367(Print) ISSN 0976 6375(Online) Volume

More information

How It All Stacks Up - or - Bar Charts with Plotly. ISC1057 Janet Peterson and John Burkardt Computational Thinking Fall Semester 2016

How It All Stacks Up - or - Bar Charts with Plotly. ISC1057 Janet Peterson and John Burkardt Computational Thinking Fall Semester 2016 * How It All Stacks Up - or - Bar Charts with Plotly ISC1057 Janet Peterson and John Burkardt Computational Thinking Fall Semester 2016 In a game of poker, players bet by tossing chips into the center

More information

V. Thulasinath M.Tech, CSE Department, JNTU College of Engineering Anantapur, Andhra Pradesh, India

V. Thulasinath M.Tech, CSE Department, JNTU College of Engineering Anantapur, Andhra Pradesh, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2017 IJSRCSEIT Volume 2 Issue 5 ISSN : 2456-3307 Natural Language Interface to Database Using Modified

More information

The Anatomy of a Large-Scale Hypertextual Web Search Engine

The Anatomy of a Large-Scale Hypertextual Web Search Engine The Anatomy of a Large-Scale Hypertextual Web Search Engine Article by: Larry Page and Sergey Brin Computer Networks 30(1-7):107-117, 1998 1 1. Introduction The authors: Lawrence Page, Sergey Brin started

More information

WEBSITE AUDIT CHECKLIST. Branding

WEBSITE AUDIT CHECKLIST. Branding WEBSITE AUDIT CHECKLIST Branding Colour Typography Images Videos About Us Page Active Voice Abbreviations/acronyms Alumni Building Names Use the Western University and the Schulich School of Medicine &

More information

Dependency grammar and dependency parsing

Dependency grammar and dependency parsing Dependency grammar and dependency parsing Syntactic analysis (5LN455) 2015-12-09 Sara Stymne Department of Linguistics and Philology Based on slides from Marco Kuhlmann Activities - dependency parsing

More information

Style Guide. For Athletics logos and standards, please contact Athletics or visit

Style Guide. For Athletics logos and standards, please contact Athletics or visit Style Guide This guide is produced by the Office of Communications & Public Relations. Updates and suggestions can be directed to lucomm@lincoln.edu. This document is available online at the Communications

More information

Susan had $50 to spend at the carnival. She spent $12 on food and twice as much on rides. How many dollars did she have left to spend?

Susan had $50 to spend at the carnival. She spent $12 on food and twice as much on rides. How many dollars did she have left to spend? Susan had $50 to spend at the carnival. She spent $12 on food and twice as much on rides. How many dollars did she have left to spend? (A) 12 (B) 14 (C) 26 (D) 38 (E) 50 2008 AMC 8, Problem #1 Susan spent

More information

Cybersecurity is a Team Sport

Cybersecurity is a Team Sport Cybersecurity is a Team Sport Cyber Security Summit at Loyola Marymount University - October 22 2016 Dr. Robert Pittman, CISM Chief Information Security Officer National Cyber Security Awareness Month

More information

Leadership Training Winter

Leadership Training Winter Cancellation Policy: Pre-registration and payment are required. Hopefully you won't need to cancel, but if you do, please do so at least three business days prior to the course for a full refund. No refunds

More information

The Optical Receipt Management Application. Design Document

The Optical Receipt Management Application. Design Document The Optical Receipt Management Application Design Document Version 1.0 Garry Ledford Roberto Vieras John Klein Charles Reed Advisor: Professor Jeff Salvage 1 Document History... 4 1. Introduction... 5

More information

I Just Missed a $450 Paypay I am Playing 958 This Week 955 Hit Yesterday in New York

I Just Missed a $450 Paypay I am Playing 958 This Week 955 Hit Yesterday in New York I Just Missed a $450 Paypay I am Playing 958 This Week 955 Hit Yesterday in New York 1 2 I am still working on the 7 Day Coding System report. In all likelihood, this will be my final solution to the Pick

More information

TIC: A Topic-based Intelligent Crawler

TIC: A Topic-based Intelligent Crawler 2011 International Conference on Information and Intelligent Computing IPCSIT vol.18 (2011) (2011) IACSIT Press, Singapore TIC: A Topic-based Intelligent Crawler Hossein Shahsavand Baghdadi and Bali Ranaivo-Malançon

More information

Dependency grammar and dependency parsing

Dependency grammar and dependency parsing Dependency grammar and dependency parsing Syntactic analysis (5LN455) 2016-12-05 Sara Stymne Department of Linguistics and Philology Based on slides from Marco Kuhlmann Activities - dependency parsing

More information

Apache UIMA and Mayo ctakes

Apache UIMA and Mayo ctakes Apache and Mayo and how it is used in the clinical domain March 16, 2012 Apache and Mayo Outline 1 Apache and Mayo Outline 1 2 Introducing Pipeline Modules Apache and Mayo What is? (You - eee - muh) Unstructured

More information

Entity-centric Topic Extraction and Exploration: A Network-based Approach

Entity-centric Topic Extraction and Exploration: A Network-based Approach Entity-centric Topic Extraction and Exploration: A Network-based Approach Andreas Spitz and Michael Gertz March 27, 2018 ECIR 2018, Grenoble Heidelberg University, Germany Database Systems Research Group

More information

Food service training & certification

Food service training & certification Food service training & certification Required by the State of South Dakota Training Schedule 2012 South Dakota Retailers Association Serving safe food is not an option... The state of South Dakota requires

More information

Information Extraction

Information Extraction Information Extraction Tutor: Rahmad Mahendra rahmad.mahendra@cs.ui.ac.id Slide by: Bayu Distiawan Trisedya Main Reference: Stanford University Natural Language Processing & Text Mining Short Course Pusat

More information

The KNIME Text Processing Plugin

The KNIME Text Processing Plugin The KNIME Text Processing Plugin Kilian Thiel Nycomed Chair for Bioinformatics and Information Mining, University of Konstanz, 78457 Konstanz, Deutschland, Kilian.Thiel@uni-konstanz.de Abstract. This document

More information

Natural Language Processing. SoSe Question Answering

Natural Language Processing. SoSe Question Answering Natural Language Processing SoSe 2017 Question Answering Dr. Mariana Neves July 5th, 2017 Motivation Find small segments of text which answer users questions (http://start.csail.mit.edu/) 2 3 Motivation

More information

American Board of Addiction Medicine

American Board of Addiction Medicine American Board of Addiction Medicine Frequently Asked Questions 1. What are the requirements for taking the certification or recertification exam? Initial certification requires verification of a. Graduation

More information

Reshaping Text Data for Efficient Processing on Amazon EC2. Gabriela Turcu, Ian Foster, Svetlozar Nestorov

Reshaping Text Data for Efficient Processing on Amazon EC2. Gabriela Turcu, Ian Foster, Svetlozar Nestorov Reshaping Text Data for Efficient Processing on Amazon EC2 Gabriela Turcu, Ian Foster, Svetlozar Nestorov Outline Motivation Goals: Determine empirically simple application performance model Statically

More information

Let s get parsing! Each component processes the Doc object, then passes it on. doc.is_parsed attribute checks whether a Doc object has been parsed

Let s get parsing! Each component processes the Doc object, then passes it on. doc.is_parsed attribute checks whether a Doc object has been parsed Let s get parsing! SpaCy default model includes tagger, parser and entity recognizer nlp = spacy.load('en ) tells spacy to use "en" with ["tagger", "parser", "ner"] Each component processes the Doc object,

More information

Department Highlights. Annie Rosenfeld, Director of Risk Management & Real Property- April 2018

Department Highlights. Annie Rosenfeld, Director of Risk Management & Real Property- April 2018 Department Highlights Annie Rosenfeld, Director of Risk Management & Real Property- April 2018 Covenants Highlights Covenants Enforcement: 7-days a week Monday Thursday 8:00 a.m. 5:00 p.m. Friday Sunday

More information

Cambridge English Dates and Fees for 2018

Cambridge English Dates and Fees for 2018 Cambridge English Dates and Fees for 2018 Cambridge English: Key (KET) KET 10,900.00 10,500.00 10,300.00 Saturday, 17 March Thursday, 01 February 9 March 18 March Saturday, 12 May Friday, 6 April 4 May

More information

For personal use only. Update Event & nearmap Solar

For personal use only. Update Event & nearmap Solar Update Event & nearmap Solar Update Event and nearmap Solar Paul Peterson Senior VP Product & Engineering 2 Current Clear Change Current What s on the ground now Clear Unrivalled clarity Change Monitor

More information

Time Expression Analysis and Recognition Using Syntactic Token Types and General Heuristic Rules

Time Expression Analysis and Recognition Using Syntactic Token Types and General Heuristic Rules Time Expression Analysis and Recognition Using Syntactic Token Types and General Heuristic Rules Xiaoshi Zhong, Aixin Sun, and Erik Cambria Computer Science and Engineering Nanyang Technological University

More information

Privacy and Security in Online Social Networks Department of Computer Science and Engineering Indian Institute of Technology, Madras

Privacy and Security in Online Social Networks Department of Computer Science and Engineering Indian Institute of Technology, Madras Privacy and Security in Online Social Networks Department of Computer Science and Engineering Indian Institute of Technology, Madras Lecture - 25 Tutorial 5: Analyzing text using Python NLTK Hi everyone,

More information

Vehicle Network Seminar Series

Vehicle Network Seminar Series CAN and Higher Layer Protocols One of our most popular seminars, this course is relevant for passenger cars and light duty trucks. CAN (Controller Area Network) is the worldwide standard for automotive

More information

ITSMR Research Note. Crashes Involving Cell Phone Use and Distracted Driving KEY FINDINGS ABSTRACT INTRODUCTION. Crash Analyses.

ITSMR Research Note. Crashes Involving Cell Phone Use and Distracted Driving KEY FINDINGS ABSTRACT INTRODUCTION. Crash Analyses. December 2016 KEY FINDINGS Crash Analyses Less than 1% of police-reported fatal and personal injury (F & PI) crashes involved the use of a cell phone over the five years, 2011-2015. 12 persons were killed

More information

CS 572: Information Retrieval. Lecture 1: Course Overview and Introduction 11 January 2016

CS 572: Information Retrieval. Lecture 1: Course Overview and Introduction 11 January 2016 CS 572: Information Retrieval Lecture 1: Course Overview and Introduction 11 January 2016 1/11/2016 CS 572: Information Retrieval. Spring 2016 1 Lecture Plan What is IR? (the big questions) Course overview

More information

Spiegel Research 3.0 The Mobile App Story

Spiegel Research 3.0 The Mobile App Story Spiegel Research 3.0 The Mobile App Story The effects of adopting and using a brand s mobile application on purchase behaviors SU JUNG KIM THE PROJECT Smartphone penetration in the U.S. has reached 68

More information

Database Design with Entity Relationship Model

Database Design with Entity Relationship Model Database Design with Entity Relationship Model Vijay Kumar SICE, Computer Networking University of Missouri-Kansas City Kansas City, MO kumarv@umkc.edu Database Design Process Database design process integrates

More information

Programming with CUDA

Programming with CUDA Programming with CUDA Jens K. Mueller jkm@informatik.uni-jena.de Department of Mathematics and Computer Science Friedrich-Schiller-University Jena Monday 4 th April, 2011 Today s lecture: Organization

More information

Plan Smart: Don t Leave Your End of Year Campaigns to Chance Convio, Inc. Page 1

Plan Smart: Don t Leave Your End of Year Campaigns to Chance Convio, Inc. Page 1 Plan Smart: Don t Leave Your End of Year Campaigns to Chance 2009 Convio, Inc. Page 1 July 2009 Q&A You can ask a question at any time using the Q&A chat box at the top of your screen. All questions will

More information

Training of BRs/NCs reviewers and experts for Biennial Update Reports technical analysis. 5 th BRs and NCs lead reviewers meeting

Training of BRs/NCs reviewers and experts for Biennial Update Reports technical analysis. 5 th BRs and NCs lead reviewers meeting Training of BRs/NCs reviewers and experts for Biennial Update Reports technical analysis 5 th BRs and NCs lead reviewers meeting Kyoko Miwa, Haike Stephen Mitigation Data Analysis programme, UNFCCC secretariat

More information

A tool for Cross-Language Pair Annotations: CLPA

A tool for Cross-Language Pair Annotations: CLPA A tool for Cross-Language Pair Annotations: CLPA August 28, 2006 This document describes our tool called Cross-Language Pair Annotator (CLPA) that is capable to automatically annotate cognates and false

More information

Néonaute: mining web archives for linguistic analysis

Néonaute: mining web archives for linguistic analysis Néonaute: mining web archives for linguistic analysis Sara Aubry, Bibliothèque nationale de France Emmanuel Cartier, LIPN, University of Paris 13 Peter Stirling, Bibliothèque nationale de France IIPC Web

More information

AP Statistics Assignments Mr. Kearns José Martí MAST 6-12 Academy

AP Statistics Assignments Mr. Kearns José Martí MAST 6-12 Academy AP Statistics Assignments Mr. Kearns José Martí MAST 6-12 Academy 2016-2017 Date Assigned Assignments Interested in Join the Edmodo group 2017 Summer Work Group for community service Green Club using the

More information

3 Steps To Create A Pipeline Full of Your Ideal Corporate Decision Makers Using LinkedIn

3 Steps To Create A Pipeline Full of Your Ideal Corporate Decision Makers Using LinkedIn 3 Steps To Create A Pipeline Full of Your Ideal Corporate Decision Makers Using LinkedIn by Ana Melikian, Paul G. McManus, & JoAnne Henein Copyright 2017 MORE CLIENTS MORE FUN LLC 1 How to Quickly Bypass

More information

Conditional Formatting

Conditional Formatting Microsoft Excel 2013: Part 5 Conditional Formatting, Viewing, Sorting, Filtering Data, Tables and Creating Custom Lists Conditional Formatting This command can give you a visual analysis of your raw data

More information

Scheduling. Scheduling Tasks At Creation Time CHAPTER

Scheduling. Scheduling Tasks At Creation Time CHAPTER CHAPTER 13 This chapter explains the scheduling choices available when creating tasks and when scheduling tasks that have already been created. Tasks At Creation Time The tasks that have the scheduling

More information

Pulse of The Industry Periodicals Volume PAG Initiatives. Incentives & Promotions. Open Discussion. Agenda

Pulse of The Industry Periodicals Volume PAG Initiatives. Incentives & Promotions. Open Discussion. Agenda November 19, 2014 Agenda Pulse of The Industry Periodicals Volume PAG Initiatives Incentives & Promotions 2014 Promotions Saturation & High Density Incentive Every Door Direct Mail Alternate Postage Proposed

More information

Information and Enrolment Session

Information and Enrolment Session CPA Information Session for Session 1 2019 Master of Accounting (CPA Program) Master of Accounting (Advanced) (CPA Program) Master of Advanced Professional Accounting Information and Enrolment Session

More information

Information Extraction Techniques in Terrorism Surveillance

Information Extraction Techniques in Terrorism Surveillance Information Extraction Techniques in Terrorism Surveillance Roman Tekhov Abstract. The article gives a brief overview of what information extraction is and how it might be used for the purposes of counter-terrorism

More information

Admin. ! Assignment 3. ! due Monday at 11:59pm! one small error in 5b (fast division) that s been fixed. ! Midterm next Thursday in-class (10/1)

Admin. ! Assignment 3. ! due Monday at 11:59pm! one small error in 5b (fast division) that s been fixed. ! Midterm next Thursday in-class (10/1) Admin CS4B MACHINE David Kauchak CS 5 Fall 5! Assignment 3! due Monday at :59pm! one small error in 5b (fast division) that s been fixed! Midterm next Thursday in-class (/)! Comprehensive! Closed books,

More information

General course information

General course information General Instructor: B. Hyle Park (MSE 243 / Bourns B207, hylepark@engr.ucr.edu) Teaching assistant: Junchao Wang (MSE 217, jwang071@ucr.edu) Reader: Michael Xiong (MSE 217, zhehao.xiong@email.ucr.edu)

More information

Opportunity: BaltimoreLink

Opportunity: BaltimoreLink Opportunity: BaltimoreLink Cities and Transit: Reimagined, Redesigned, and Reborn Rail~Volution October 11, 2016 Joshua B Diamond 51 Monroe St, Suite 1103 Rockville, MD 20850 301-774-4566 X 410 jdiamond@foursquareitp.com

More information

CHAPTER 5 EXPERT LOCATOR USING CONCEPT LINKING

CHAPTER 5 EXPERT LOCATOR USING CONCEPT LINKING 94 CHAPTER 5 EXPERT LOCATOR USING CONCEPT LINKING 5.1 INTRODUCTION Expert locator addresses the task of identifying the right person with the appropriate skills and knowledge. In large organizations, it

More information

SOUTH DAKOTA BOARD OF REGENTS. Board Work ******************************************************************************

SOUTH DAKOTA BOARD OF REGENTS. Board Work ****************************************************************************** SOUTH DAKOTA BOARD OF REGENTS Board Work AGENDA ITEM: 1 G DATE: August 7-9, 2018 ****************************************************************************** SUBJECT Rolling Calendar CONTROLLING STATUTE,

More information

Question Answering Approach Using a WordNet-based Answer Type Taxonomy

Question Answering Approach Using a WordNet-based Answer Type Taxonomy Question Answering Approach Using a WordNet-based Answer Type Taxonomy Seung-Hoon Na, In-Su Kang, Sang-Yool Lee, Jong-Hyeok Lee Department of Computer Science and Engineering, Electrical and Computer Engineering

More information