KDD- Service based Numerical Entity Searcher (KSNES) Presentation 3 on April 14 th, Naga Sowjanya Karumuri. CIS 895 MSE PROJECT
|
|
- Tyler Douglas
- 5 years ago
- Views:
Transcription
1 KDD- Service based Numerical Entity Searcher (KSNES) Presentation 3 on April 14 th, 2009 Naga Sowjanya Karumuri sowji@ksu.edu 1 CIS 895 MSE PROJECT
2 OUTLINE Introduction Terms Motivation Goal Project Overview Project Data Flow Diagram Component Design Project Evaluation Future Work Prototype Demonstration Questions / Comments 2
3 TERMS[1] Knowledge Discovery in Databases (KDD) a group headed by Dr. Hsu primary focus is machine learning, data mining, human-computer intelligent interaction Natural Language Processing (NLP) To allow computers to process and understand human languages Some areas like Text Segmentation (identify word boundaries) Part-of-speech tagging Word sense disambiguation (words with more than one meaning) 3
4 TERMS[2] Named Entity Recognition (NER) Locating and classifying atomic elements (single part of speech) in text into predefined categories such as Names of Persons Names of Locations Names of Organizations Names of Miscellaneous Entities Example Dr. William H. Hsu is a Professor at Kansas State University located in Manhattan, Kansas. Dr. [PER William H. Hsu ] is a Professor at [ORG Kansas State University ] located in [LOC Manhattan ], [LOC Kansas ]. 4
5 TERMS[3] Shallow Parsing/Chunking NLP technique that attempts to look for key phrases but not to fully parse into a parse tree. Output - series of words mostly nouns, verbs, preposition phrases etc., Example Chunker: [NP He ] [VP reckons ] [NP the current account deficit ] [VP will narrow ] [PP to ] [NP only L1.8 billion ] Full Parser: (PRP)He (VBZ)reckons (DT)the (JJ)current (NN)account (NN)deficit (MD)will (VB)narrow (TO)to (RB)only (L)L (CD)1.8 (CD)billion 5
6 PROJECT OVERVIEW[1] Motivation Occurrence of events is naturally anchored in time within the narrative text Is Bush currently the President of America? When was India attacked by Pakistan in last century? To know the quantities of entities How many Oscar awards are won by Steven Spielberg? What was the highest temperature recorded in the year 2008? 6
7 PROJECT OVERVIEW[2] Goal To develop a system that extracts Numerical Phrases from raw text displays value unit unit-type System is set as a service on the web server User interacts through a webpage Numerical Phrase: Types Number Phrase 33 dollars, 100 Watts, 13 years, two miles Date Phrase Aug 1998, Nov 10 th 1984, between 1989 and
8 PROJECT OVERVIEW[3] Purpose To understand the timestamp of an event To understand the order of occurrence of events To understand the persistence of an event i.e., the time period over which the event occurred and continued For KDD Group To gather certain statistical information from the data they gather by crawling different web pages How many cattle have been affected by the virus? When did the disease break out? Sample NABC (National Agricultural Bio-Security Centre) data is given to the system for testing 8
9 APPLICATION AREAS Textual Entailment (TE) Recognition Given two fragments, whether the meaning of one text can be inferred from another text. Question Answering (QA) System Identifies text that entails the expected answer. Ex: During 1997, 10,000 cattle were killed because of the RVF. Possible inferences (TE) 10,000 cattle were killed because of RVF. RVF occurred during Possible Questions (QA) How many cattle were killed during 1997 RVF outbreak? When did RVF occur? 9
10 SYSTEM OVERVIEW 10
11 PROJECT DATA FLOW DIAGRAM: NUMERICAL ENTITY SEARCHER 11
12 MODULES IN THE PROJECT Webpage (JSP): For requesting and receiving information from the service. POS Tagger (Java): Stanford POS Tagger Numerical Phrase Extractor (Java): Implemented using Shallow Parsing Technique Number-Unit/Date Pattern Recognizer (Java): Implemented based on the Numerical Quantifier developed by Benjamin Sapp, UIUC. 12
13 POS TAGGER TAGSET 13
14 IMPLEMENTING NUMERICAL PHRASE EXTRACTOR Input: Tagged Text I/PRP lost/vbd thirty-three/jj dollars/nns in/in 1998/CD Regular expressions (regex) are used to determine the numerical patterns in the input. thirty-three/jj dollars/nns in/in 1998/CD Output: Numerical Phrases thirty-three dollars in
15 SOME PATTERNS "\\d+-\\d+(/jj /CD) [a-za-z]+/nn" parses \\d+-\\d+(/jj /CD) 3-2/JJ 20-20/JJ [a-za-z]+/nn lead/nn match/nn "(between Between from From In in since Since during During)/IN.../CD (([a-za- Z]+/CC [a-z]+/to).../cd)? parses 'between 1987 and 1997', 'in 2007 and
16 COMPONENT DESIGN Contains class variables and functions Added separate table to describe the roles of functions 16
17 COMPONENT DESIGN (MYPATTERNS)[1] Patterns p_words Matching Numerical Phrases about, around, approximately, more than, nearly, almost, no more than, at least, less than, no fewer than p_tnl p_inl p_words + p_abtfrac p_words + p_age p_words + p_ampm p_and p_tnl + p_anydate this, next, last, since, in between, from, in, since, during about two-thirds of the vote, millions of books 27 year-old bachelor, 27-year-old bachelor About 3:00 a.m., 4:15 p.m. CST 3,792 children and adolescents Oct 1st 1987, Nov 5, December 21,
18 COMPONENT DESIGN (MYPATTERNS)[2] Patterns p_inl + p_btwfrm p_inl + p_btwfrmd Matching Numerical Phrases between 1987 and 1997, in 2007 and 2008 from 200 to 300 miles, from 7.5 percent to 6.85 percent p_date 18 April 2008 p_tnl + p_days p_centuary p_words + p_hyphenww p_hyphennumn um p_in p_mids p_months this Monday, next Saturday, last Friday, Tuesday, Wednesday, 17 th century, 17 th -centuary million-dollar home, six-bedroom home, thirty-three dollars the match, a 3-2 lead 9 in 10 people, 1 in every 8 women mid-1990s, the early 1990s, 1970s January, February, December, Jan, Feb, Sept, Dec 18
19 COMPONENT DESIGN (MYPATTERNS)[3] Patterns p_words + p_numunit p_words + p_per p_words + p_percentinches p_ratio p_tty p_twmy p_xbits p_words + p_yrange Matching Numerical Phrases 33 USD, about 34 miles, 33,333 tons, 3.3 million dollars, one thing, 3.4 billion $33 per day, about 100 miles per hour 39%, 0.5-1%, about 90 %, 20" one of the five people, 89 percent of people, 3 out of 5 people today, tomorrow, yesterday, noon this year, this month, next year, next month, last week, last year, last month 1024KB, 8MB, 320GB, 1TB In , during
20 SAMPLE SENTENCES[1] Sentence I have lost 33,000 dinars in 1998 At just 12-years-old, he enrolled as a freshman at F.I.U. in Miami. The 20" imac is cheaper at $1200 and it has a 320GB hard drive. Volunteers bring in a heavy crane for work on a bridge last month. As for those who do not invest, around 40% say capitalism is better. As of 7 January 2007, about 75 people have died and another 183 infected. Patterns p_numnit p_btwfrm p_age p_percentinches p_numunit p_xbits p_twmy p_percentinches p_date p_numunit 20
21 SAMPLE SENTENCES[2] Sentence Approximately 1% of human sufferers die of the disease. Current listings of 2,000 children and adults who are reported missing, including in-depth coverage of high-profile cases. 38 of the 62 patients who provided blood samples tested positive. She became an exotic dancer at Scores in New York City in the mid-1990s. Peterson's three capped the surge, giving New Orleans a lead. Patterns p_percentinches p_and p_ratio p_mids p_numunit p_hyphennumnum 21
22 PROBLEMS ENCOUNTERED Determining the Patterns Lots of Numerical Phrases found Designed Patterns to filter more than one kind of Numerical Pattern Prioritizing the Patterns More than one pattern may match the same Numerical Phrase To avoid clashes between the Patterns 22
23 PROJECT EVALUATION[1] Test Case Main Functionality Tested Pass/Fail Test Case 1 Application Functionality Pass Test Case 2 POS Tagger Functionality Pass Test Case 3 Numerical Phrase Extractor Functionality Pass Test Case 4 Number-Unit/Date Pattern Recognizer Functionality Pass 23
24 PROJECT EVALUATION[2] Phase Expected Completion Phase Actual Completion Phase 1 February 26, 2009 February 24, March 26, 2009 March 31, April 17, 2009 April 14,
25 PROJECT EVALUATION[3] Phase 2 took more time since Implementation and Testing are done simultaneously 25
26 PROJECT EVALUATION[4] More time for Coding and the Documentation 26
27 PROJECT EVALUATION[5] More time spent in discussing since it s the initial phase 27
28 PROJECT EVALUATION[6] More time is spent in Coding after gather the requirements in the first phase. 28
29 PROJECT EVALUATION[7] Lot of time spent on Documenting the things as per the ETDR standards. 29
30 FUTURE WORK Adding more Patterns To filter more different kinds of numerical phrases Improving the Output Display By displaying the number and date phrases in different colors To make it more readable for the user 30
31 LESSONS LEARNED Java Tool Usage Java Eclipse IDE Design Development MS Visio SDLC Documentation 31
32 PROTOTYPE DEMONSTRATION KSNES Project Set up as a Service on the CIS Server A webpage is set up: 32
33 FINAL STEPS Final Examination Ballot Make necessary changes to the MSE Portfolio Deliver the Portfolio 33
34 Questions?? Suggestions!! THANK YOU 34
KDD- Service based Numerical Entity Searcher (KSNES) Presentation 2 on March 31 st, Naga Sowjanya Karumuri. CIS 895 MSE PROJECT
KDD- Service based Numerical Entity Searcher (KSNES) Presentation 2 on March 31 st, 2009 Naga Sowjanya Karumuri sowji@ksu.edu 1 CIS 895 MSE PROJECT OUTLINE Project Data Flow Diagram Action Items Architectural
More informationVision Plan. For KDD- Service based Numerical Entity Searcher (KSNES) Version 2.0
Vision Plan For KDD- Service based Numerical Entity Searcher (KSNES) Version 2.0 Submitted in partial fulfillment of the Masters of Software Engineering Degree. Naga Sowjanya Karumuri CIS 895 MSE Project
More informationComponent Design. For KDD- Service based Numerical Entity Searcher (KSNES) Version 1.0
Component Design For KDD- Service based Numerical Entity Searcher (KSNES) Version 1.0 Submitted in partial fulfillment of the Masters of Software Engineering degree. Naga Sowjanya Karumuri CIS 895 MSE
More informationAssignment #1: Named Entity Recognition
Assignment #1: Named Entity Recognition Dr. Zornitsa Kozareva USC Information Sciences Institute Spring 2013 Task Description: You will be given three data sets total. First you will receive the train
More informationA Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2
A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,
More informationAIMMS Function Reference - Date Time Related Identifiers
AIMMS Function Reference - Date Time Related Identifiers This file contains only one chapter of the book. For a free download of the complete book in pdf format, please visit www.aimms.com Aimms 3.13 Date-Time
More informationCalendar PPF Production Cycles Non-Production Activities and Events
20-207 Calendar PPF Production Cycles Non-Production Activities and Events Four Productions For non-holiday productions 7 Week Stage Cycles 36 Uses plus strike (as in prior years and per agreement with
More informationNatural Language Processing
Natural Language Processing Machine Learning Potsdam, 26 April 2012 Saeedeh Momtazi Information Systems Group Introduction 2 Machine Learning Field of study that gives computers the ability to learn without
More informationPearson Edexcel Award
Pearson Edexcel Award January 2018 Examination Timetable FINAL For more information on Edexcel qualifications please visit http://qualifications.pearson.com Pearson Edexcel Award January 2018 Examination
More informationSoftware Quality Assurance Plan
Software Quality Assurance Plan For KDD-Research Entity Search Tool (KREST) Version 1.2 Submitted in partial fulfillment of the Masters of Software Engineering degree. Eric Davis CIS 895 MSE Project Department
More informationINFORMATION TECHNOLOGY SPREADSHEETS. Part 1
INFORMATION TECHNOLOGY SPREADSHEETS Part 1 Page: 1 Created by John Martin Exercise Built-In Lists 1. Start Excel Spreadsheet 2. In cell B1 enter Mon 3. In cell C1 enter Tue 4. Select cell C1 5. At the
More informationAnnotating Spatio-Temporal Information in Documents
Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de
More informationProject Evaluation Online Book Store Phase-III. Vamsi Krishna Mummaneni
Project Evaluation Online Book Store Phase-III Submitted in partial fulfillment of the requirements of the degree of Master of Software Engineering Vamsi Krishna Mummaneni CIS 895 MSE Project Kansas State
More informationSection 1.2: What is a Function? y = 4x
Section 1.2: What is a Function? y = 4x y is the dependent variable because it depends on what x is. x is the independent variable because any value can be chosen to replace x. Domain: a set of values
More informationData and Information Integration: Information Extraction
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Data and Information Integration: Information Extraction Varnica Verma 1 1 (Department of Computer Science Engineering, Guru Nanak
More informationCIMA Certificate BA Interactive Timetable
CIMA Certificate BA Interactive Timetable 2018 Nottingham & Leicester Version 3.2 Information last updated 09/03/18 Please note: Information and dates in this timetable are subject to change. Introduction
More informationUse Case Study: Reducing Patient No-Shows. Geisinger Health System Central and Northeastern Pennsylvania
Use Case Study: Reducing Patient No-Shows Geisinger Health System Central and Northeastern Pennsylvania February 2014 Geisinger is a leading integrated health services organization widely recognized for
More informationText mining tools for semantically enriching the scientific literature
Text mining tools for semantically enriching the scientific literature Sophia Ananiadou Director National Centre for Text Mining School of Computer Science University of Manchester Need for enriching the
More informationData Transfers in the Grid: Workload Analysis of Globus GridFTP
Data Transfers in the Grid: Workload Analysis of Globus GridFTP Nicolas Kourtellis, Lydia Prieto, Gustavo Zarrate, Adriana Iamnitchi University of South Florida Dan Fraser Argonne National Laboratory Objective
More informationcorenlp-xml-reader Documentation
corenlp-xml-reader Documentation Release 0.0.4 Edward Newell Feb 07, 2018 Contents 1 Purpose 1 2 Install 3 3 Example 5 3.1 Instantiation............................................... 5 3.2 Sentences.................................................
More informationRanking in a Domain Specific Search Engine
Ranking in a Domain Specific Search Engine CS6998-03 - NLP for the Web Spring 2008, Final Report Sara Stolbach, ss3067 [at] columbia.edu Abstract A search engine that runs over all domains must give equal
More informationPearson Edexcel Award
Pearson Edexcel Award May June 2018 Examination Timetable FINAL For more information on Edexcel qualifications please visit http://qualifications.pearson.com v3 Pearson Edexcel Award 2018 Examination View
More informationFast and Effective System for Name Entity Recognition on Big Data
International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-3, Issue-2 E-ISSN: 2347-2693 Fast and Effective System for Name Entity Recognition on Big Data Jigyasa Nigam
More informationMath 2280: Introduction to Differential Equations- Syllabus
Math 2280: Introduction to Differential Equations- Syllabus University of Utah Spring 2013 1 Basic Information Instructor - Patrick Dylan Zwick Email - zwick@math.utah.edu Phone - 801-651-8768 Office Hour
More informationACE (Automatic Content Extraction) English Annotation Guidelines for Values. Version
ACE (Automatic Content Extraction) English Annotation Guidelines for Values Version 1.2.4 Linguistic Data Consortium http://www.ldc.upenn.edu/projects/ace/ 1. Basic Concepts...3 1.1 What is a Value?...3
More informationLab II - Product Specification Outline. CS 411W Lab II. Prototype Product Specification For CLASH. Professor Janet Brunelle Professor Hill Price
Lab II - Product Specification Outline CS 411W Lab II Prototype Product Specification For CLASH Professor Janet Brunelle Professor Hill Price Prepared by: Artem Fisan Date: 04/20/2015 Table of Contents
More informationname name C S M E S M E S M E Block S Block M C-Various October Sunday
01.10.2017 October Sunday 1 10 2017 02.10.2017 October Monday 2 10 2017 03.10.2017 October Tuesday 3 10 2017 Tag der Deutschen Einheit 04.10.2017 October Wednesday 4 10 2017 05.10.2017 October Thursday
More informationQuestion Answering System for Yioop
Question Answering System for Yioop Advisor Dr. Chris Pollett Committee Members Dr. Thomas Austin Dr. Robert Chun By Niravkumar Patel Problem Statement Question Answering System Yioop Proposed System Triplet
More informationMath in Focus Vocabulary. Kindergarten
Math in Focus Vocabulary Kindergarten Chapter Word Definition 1 one 1 * 1 two 2 * * 1 three 3 * * * 1 four 4 * * * * 1 five 5 * * * * * 1 same things that have a common property 1 different things that
More informationExample. Section: PS 709 Examples of Calculations of Reduced Hours of Work Last Revised: February 2017 Last Reviewed: February 2017 Next Review:
Following are three examples of calculations for MCP employees (undefined hours of work) and three examples for MCP office employees. Examples use the data from the table below. For your calculations use
More informationQuestion Answering Using XML-Tagged Documents
Question Answering Using XML-Tagged Documents Ken Litkowski ken@clres.com http://www.clres.com http://www.clres.com/trec11/index.html XML QA System P Full text processing of TREC top 20 documents Sentence
More informationB.2 Measures of Central Tendency and Dispersion
Appendix B. Measures of Central Tendency and Dispersion B B. Measures of Central Tendency and Dispersion What you should learn Find and interpret the mean, median, and mode of a set of data. Determine
More informationMobley, Jenna W. Monday, February 23, :48 AM
DeLancy From: Sent: To: Subject: Mobley, Jenna W. Monday, February 23, 2015 7:48 AM DeLancy RE: Confirming next Monday's meeting Good morning, and thank you. We look forward
More informationPrivate Swimming Lessons
Private Swimming Lessons Private Lessons Designed for participants who would like a 1:1 ratio. Participants will receive individual attention to improve their swimming technique and have the convenience
More informationRocky Hock Easter Musical
Rocky Hock Easter Musical 3/12/13 Vol. 1314... We're in rehearsals now for our upcoming Easter production of Come, Follow Me and wanted to introduce you to a new member of the cast. Her name is Suzanne
More informationNatural Language Processing Tutorial May 26 & 27, 2011
Cognitive Computation Group Natural Language Processing Tutorial May 26 & 27, 2011 http://cogcomp.cs.illinois.edu So why aren t words enough? Depends on the application more advanced task may require more
More informationPRIS at TAC2012 KBP Track
PRIS at TAC2012 KBP Track Yan Li, Sijia Chen, Zhihua Zhou, Jie Yin, Hao Luo, Liyin Hong, Weiran Xu, Guang Chen, Jun Guo School of Information and Communication Engineering Beijing University of Posts and
More informationITSMR Research Note KEY FINDINGS. Crash Analyses: Ticket Analyses:
December 2018 KEY FINDINGS Crash Analyses: 2013-2017 Less than 1% of police-reported fatal and personal injury (F & PI) crashes involved the use of a cell phone over the five years, 2013-2017. 15 persons
More informationSchedule/BACnet Schedule
Object Dictionary 1 Schedule/BACnet Schedule Introduction Note: The Johnson Controls Schedule object is considered a BACnet Schedule object because it supports BACnet functionality. In addition, this object
More informationMurdock-Portal Band Parent Handbook
Murdock-Portal Band Parent Handbook for the 14/15 School Year *****PLEASE READ***** This handbook contains your child s CLASS SCHEDULE and THE TOP TEN THINGS BAND PARENTS NEED TO KNOW Top Ten Things Band
More informationINFORMATION EXTRACTION
COMP90042 LECTURE 13 INFORMATION EXTRACTION INTRODUCTION Given this: Brasilia, the Brazilian capital, was founded in 1960. Obtain this: capital(brazil, Brasilia) founded(brasilia, 1960) Main goal: turn
More informationAutomatic Metadata Extraction for Archival Description and Access
Automatic Metadata Extraction for Archival Description and Access WILLIAM UNDERWOOD Georgia Tech Research Institute Abstract: The objective of the research reported is this paper is to develop techniques
More informationNamed Entity Detection and Entity Linking in the Context of Semantic Web
[1/52] Concordia Seminar - December 2012 Named Entity Detection and in the Context of Semantic Web Exploring the ambiguity question. Eric Charton, Ph.D. [2/52] Concordia Seminar - December 2012 Challenge
More informationGreenThumb Garden Registration
GreenThumb Garden Registration 2015-2019 Garden Name Block Lot CB Jurisdiction Two members must provide full contact information on the license agreement, including phone numbers, addresses and emails.
More informationMeta-Content framework for back index generation
Meta-Content framework for back index generation Tripti Sharma, Assistant Professor Department of computer science Chhatrapati Shivaji Institute of Technology. Durg, India triptisharma@csitdurg.in Sarang
More informationShrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent
More informationQI TALK TIME. Run Charts. Speaker: Dr Michael Carton. Connect Improve Innovate. 19 th Dec Building an Irish Network of Quality Improvers
QI TALK TIME Building an Irish Network of Quality Improvers Run Charts Speaker: Dr Michael Carton 19 th Dec 2017 Connect Improve Innovate Speaker Speaker: Dr Michael Carton Michael s background is as a
More informationTokyo Institute of Technology Style Guide
Tokyo Institute of Technology Style Guide For Written Communication in English By English Documentation Support Services About the Style Guide Tokyo Institute of Technology (Tokyo Tech) presents an image
More informationHands-Free Internet using Speech Recognition
Introduction Trevor Donnell December 7, 2001 6.191 Preliminary Thesis Proposal Hands-Free Internet using Speech Recognition The hands-free Internet will be a system whereby a user has the ability to access
More informationINTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) CONTEXT SENSITIVE TEXT SUMMARIZATION USING HIERARCHICAL CLUSTERING ALGORITHM
INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & 6367(Print), ISSN 0976 6375(Online) Volume 3, Issue 1, January- June (2012), TECHNOLOGY (IJCET) IAEME ISSN 0976 6367(Print) ISSN 0976 6375(Online) Volume
More informationHow It All Stacks Up - or - Bar Charts with Plotly. ISC1057 Janet Peterson and John Burkardt Computational Thinking Fall Semester 2016
* How It All Stacks Up - or - Bar Charts with Plotly ISC1057 Janet Peterson and John Burkardt Computational Thinking Fall Semester 2016 In a game of poker, players bet by tossing chips into the center
More informationV. Thulasinath M.Tech, CSE Department, JNTU College of Engineering Anantapur, Andhra Pradesh, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2017 IJSRCSEIT Volume 2 Issue 5 ISSN : 2456-3307 Natural Language Interface to Database Using Modified
More informationThe Anatomy of a Large-Scale Hypertextual Web Search Engine
The Anatomy of a Large-Scale Hypertextual Web Search Engine Article by: Larry Page and Sergey Brin Computer Networks 30(1-7):107-117, 1998 1 1. Introduction The authors: Lawrence Page, Sergey Brin started
More informationWEBSITE AUDIT CHECKLIST. Branding
WEBSITE AUDIT CHECKLIST Branding Colour Typography Images Videos About Us Page Active Voice Abbreviations/acronyms Alumni Building Names Use the Western University and the Schulich School of Medicine &
More informationDependency grammar and dependency parsing
Dependency grammar and dependency parsing Syntactic analysis (5LN455) 2015-12-09 Sara Stymne Department of Linguistics and Philology Based on slides from Marco Kuhlmann Activities - dependency parsing
More informationStyle Guide. For Athletics logos and standards, please contact Athletics or visit
Style Guide This guide is produced by the Office of Communications & Public Relations. Updates and suggestions can be directed to lucomm@lincoln.edu. This document is available online at the Communications
More informationSusan had $50 to spend at the carnival. She spent $12 on food and twice as much on rides. How many dollars did she have left to spend?
Susan had $50 to spend at the carnival. She spent $12 on food and twice as much on rides. How many dollars did she have left to spend? (A) 12 (B) 14 (C) 26 (D) 38 (E) 50 2008 AMC 8, Problem #1 Susan spent
More informationCybersecurity is a Team Sport
Cybersecurity is a Team Sport Cyber Security Summit at Loyola Marymount University - October 22 2016 Dr. Robert Pittman, CISM Chief Information Security Officer National Cyber Security Awareness Month
More informationLeadership Training Winter
Cancellation Policy: Pre-registration and payment are required. Hopefully you won't need to cancel, but if you do, please do so at least three business days prior to the course for a full refund. No refunds
More informationThe Optical Receipt Management Application. Design Document
The Optical Receipt Management Application Design Document Version 1.0 Garry Ledford Roberto Vieras John Klein Charles Reed Advisor: Professor Jeff Salvage 1 Document History... 4 1. Introduction... 5
More informationI Just Missed a $450 Paypay I am Playing 958 This Week 955 Hit Yesterday in New York
I Just Missed a $450 Paypay I am Playing 958 This Week 955 Hit Yesterday in New York 1 2 I am still working on the 7 Day Coding System report. In all likelihood, this will be my final solution to the Pick
More informationTIC: A Topic-based Intelligent Crawler
2011 International Conference on Information and Intelligent Computing IPCSIT vol.18 (2011) (2011) IACSIT Press, Singapore TIC: A Topic-based Intelligent Crawler Hossein Shahsavand Baghdadi and Bali Ranaivo-Malançon
More informationDependency grammar and dependency parsing
Dependency grammar and dependency parsing Syntactic analysis (5LN455) 2016-12-05 Sara Stymne Department of Linguistics and Philology Based on slides from Marco Kuhlmann Activities - dependency parsing
More informationApache UIMA and Mayo ctakes
Apache and Mayo and how it is used in the clinical domain March 16, 2012 Apache and Mayo Outline 1 Apache and Mayo Outline 1 2 Introducing Pipeline Modules Apache and Mayo What is? (You - eee - muh) Unstructured
More informationEntity-centric Topic Extraction and Exploration: A Network-based Approach
Entity-centric Topic Extraction and Exploration: A Network-based Approach Andreas Spitz and Michael Gertz March 27, 2018 ECIR 2018, Grenoble Heidelberg University, Germany Database Systems Research Group
More informationFood service training & certification
Food service training & certification Required by the State of South Dakota Training Schedule 2012 South Dakota Retailers Association Serving safe food is not an option... The state of South Dakota requires
More informationInformation Extraction
Information Extraction Tutor: Rahmad Mahendra rahmad.mahendra@cs.ui.ac.id Slide by: Bayu Distiawan Trisedya Main Reference: Stanford University Natural Language Processing & Text Mining Short Course Pusat
More informationThe KNIME Text Processing Plugin
The KNIME Text Processing Plugin Kilian Thiel Nycomed Chair for Bioinformatics and Information Mining, University of Konstanz, 78457 Konstanz, Deutschland, Kilian.Thiel@uni-konstanz.de Abstract. This document
More informationNatural Language Processing. SoSe Question Answering
Natural Language Processing SoSe 2017 Question Answering Dr. Mariana Neves July 5th, 2017 Motivation Find small segments of text which answer users questions (http://start.csail.mit.edu/) 2 3 Motivation
More informationAmerican Board of Addiction Medicine
American Board of Addiction Medicine Frequently Asked Questions 1. What are the requirements for taking the certification or recertification exam? Initial certification requires verification of a. Graduation
More informationReshaping Text Data for Efficient Processing on Amazon EC2. Gabriela Turcu, Ian Foster, Svetlozar Nestorov
Reshaping Text Data for Efficient Processing on Amazon EC2 Gabriela Turcu, Ian Foster, Svetlozar Nestorov Outline Motivation Goals: Determine empirically simple application performance model Statically
More informationLet s get parsing! Each component processes the Doc object, then passes it on. doc.is_parsed attribute checks whether a Doc object has been parsed
Let s get parsing! SpaCy default model includes tagger, parser and entity recognizer nlp = spacy.load('en ) tells spacy to use "en" with ["tagger", "parser", "ner"] Each component processes the Doc object,
More informationDepartment Highlights. Annie Rosenfeld, Director of Risk Management & Real Property- April 2018
Department Highlights Annie Rosenfeld, Director of Risk Management & Real Property- April 2018 Covenants Highlights Covenants Enforcement: 7-days a week Monday Thursday 8:00 a.m. 5:00 p.m. Friday Sunday
More informationCambridge English Dates and Fees for 2018
Cambridge English Dates and Fees for 2018 Cambridge English: Key (KET) KET 10,900.00 10,500.00 10,300.00 Saturday, 17 March Thursday, 01 February 9 March 18 March Saturday, 12 May Friday, 6 April 4 May
More informationFor personal use only. Update Event & nearmap Solar
Update Event & nearmap Solar Update Event and nearmap Solar Paul Peterson Senior VP Product & Engineering 2 Current Clear Change Current What s on the ground now Clear Unrivalled clarity Change Monitor
More informationTime Expression Analysis and Recognition Using Syntactic Token Types and General Heuristic Rules
Time Expression Analysis and Recognition Using Syntactic Token Types and General Heuristic Rules Xiaoshi Zhong, Aixin Sun, and Erik Cambria Computer Science and Engineering Nanyang Technological University
More informationPrivacy and Security in Online Social Networks Department of Computer Science and Engineering Indian Institute of Technology, Madras
Privacy and Security in Online Social Networks Department of Computer Science and Engineering Indian Institute of Technology, Madras Lecture - 25 Tutorial 5: Analyzing text using Python NLTK Hi everyone,
More informationVehicle Network Seminar Series
CAN and Higher Layer Protocols One of our most popular seminars, this course is relevant for passenger cars and light duty trucks. CAN (Controller Area Network) is the worldwide standard for automotive
More informationITSMR Research Note. Crashes Involving Cell Phone Use and Distracted Driving KEY FINDINGS ABSTRACT INTRODUCTION. Crash Analyses.
December 2016 KEY FINDINGS Crash Analyses Less than 1% of police-reported fatal and personal injury (F & PI) crashes involved the use of a cell phone over the five years, 2011-2015. 12 persons were killed
More informationCS 572: Information Retrieval. Lecture 1: Course Overview and Introduction 11 January 2016
CS 572: Information Retrieval Lecture 1: Course Overview and Introduction 11 January 2016 1/11/2016 CS 572: Information Retrieval. Spring 2016 1 Lecture Plan What is IR? (the big questions) Course overview
More informationSpiegel Research 3.0 The Mobile App Story
Spiegel Research 3.0 The Mobile App Story The effects of adopting and using a brand s mobile application on purchase behaviors SU JUNG KIM THE PROJECT Smartphone penetration in the U.S. has reached 68
More informationDatabase Design with Entity Relationship Model
Database Design with Entity Relationship Model Vijay Kumar SICE, Computer Networking University of Missouri-Kansas City Kansas City, MO kumarv@umkc.edu Database Design Process Database design process integrates
More informationProgramming with CUDA
Programming with CUDA Jens K. Mueller jkm@informatik.uni-jena.de Department of Mathematics and Computer Science Friedrich-Schiller-University Jena Monday 4 th April, 2011 Today s lecture: Organization
More informationPlan Smart: Don t Leave Your End of Year Campaigns to Chance Convio, Inc. Page 1
Plan Smart: Don t Leave Your End of Year Campaigns to Chance 2009 Convio, Inc. Page 1 July 2009 Q&A You can ask a question at any time using the Q&A chat box at the top of your screen. All questions will
More informationTraining of BRs/NCs reviewers and experts for Biennial Update Reports technical analysis. 5 th BRs and NCs lead reviewers meeting
Training of BRs/NCs reviewers and experts for Biennial Update Reports technical analysis 5 th BRs and NCs lead reviewers meeting Kyoko Miwa, Haike Stephen Mitigation Data Analysis programme, UNFCCC secretariat
More informationA tool for Cross-Language Pair Annotations: CLPA
A tool for Cross-Language Pair Annotations: CLPA August 28, 2006 This document describes our tool called Cross-Language Pair Annotator (CLPA) that is capable to automatically annotate cognates and false
More informationNéonaute: mining web archives for linguistic analysis
Néonaute: mining web archives for linguistic analysis Sara Aubry, Bibliothèque nationale de France Emmanuel Cartier, LIPN, University of Paris 13 Peter Stirling, Bibliothèque nationale de France IIPC Web
More informationAP Statistics Assignments Mr. Kearns José Martí MAST 6-12 Academy
AP Statistics Assignments Mr. Kearns José Martí MAST 6-12 Academy 2016-2017 Date Assigned Assignments Interested in Join the Edmodo group 2017 Summer Work Group for community service Green Club using the
More information3 Steps To Create A Pipeline Full of Your Ideal Corporate Decision Makers Using LinkedIn
3 Steps To Create A Pipeline Full of Your Ideal Corporate Decision Makers Using LinkedIn by Ana Melikian, Paul G. McManus, & JoAnne Henein Copyright 2017 MORE CLIENTS MORE FUN LLC 1 How to Quickly Bypass
More informationConditional Formatting
Microsoft Excel 2013: Part 5 Conditional Formatting, Viewing, Sorting, Filtering Data, Tables and Creating Custom Lists Conditional Formatting This command can give you a visual analysis of your raw data
More informationScheduling. Scheduling Tasks At Creation Time CHAPTER
CHAPTER 13 This chapter explains the scheduling choices available when creating tasks and when scheduling tasks that have already been created. Tasks At Creation Time The tasks that have the scheduling
More informationPulse of The Industry Periodicals Volume PAG Initiatives. Incentives & Promotions. Open Discussion. Agenda
November 19, 2014 Agenda Pulse of The Industry Periodicals Volume PAG Initiatives Incentives & Promotions 2014 Promotions Saturation & High Density Incentive Every Door Direct Mail Alternate Postage Proposed
More informationInformation and Enrolment Session
CPA Information Session for Session 1 2019 Master of Accounting (CPA Program) Master of Accounting (Advanced) (CPA Program) Master of Advanced Professional Accounting Information and Enrolment Session
More informationInformation Extraction Techniques in Terrorism Surveillance
Information Extraction Techniques in Terrorism Surveillance Roman Tekhov Abstract. The article gives a brief overview of what information extraction is and how it might be used for the purposes of counter-terrorism
More informationAdmin. ! Assignment 3. ! due Monday at 11:59pm! one small error in 5b (fast division) that s been fixed. ! Midterm next Thursday in-class (10/1)
Admin CS4B MACHINE David Kauchak CS 5 Fall 5! Assignment 3! due Monday at :59pm! one small error in 5b (fast division) that s been fixed! Midterm next Thursday in-class (/)! Comprehensive! Closed books,
More informationGeneral course information
General Instructor: B. Hyle Park (MSE 243 / Bourns B207, hylepark@engr.ucr.edu) Teaching assistant: Junchao Wang (MSE 217, jwang071@ucr.edu) Reader: Michael Xiong (MSE 217, zhehao.xiong@email.ucr.edu)
More informationOpportunity: BaltimoreLink
Opportunity: BaltimoreLink Cities and Transit: Reimagined, Redesigned, and Reborn Rail~Volution October 11, 2016 Joshua B Diamond 51 Monroe St, Suite 1103 Rockville, MD 20850 301-774-4566 X 410 jdiamond@foursquareitp.com
More informationCHAPTER 5 EXPERT LOCATOR USING CONCEPT LINKING
94 CHAPTER 5 EXPERT LOCATOR USING CONCEPT LINKING 5.1 INTRODUCTION Expert locator addresses the task of identifying the right person with the appropriate skills and knowledge. In large organizations, it
More informationSOUTH DAKOTA BOARD OF REGENTS. Board Work ******************************************************************************
SOUTH DAKOTA BOARD OF REGENTS Board Work AGENDA ITEM: 1 G DATE: August 7-9, 2018 ****************************************************************************** SUBJECT Rolling Calendar CONTROLLING STATUTE,
More informationQuestion Answering Approach Using a WordNet-based Answer Type Taxonomy
Question Answering Approach Using a WordNet-based Answer Type Taxonomy Seung-Hoon Na, In-Su Kang, Sang-Yool Lee, Jong-Hyeok Lee Department of Computer Science and Engineering, Electrical and Computer Engineering
More information