Last Week: Visualization Design II
|
|
- Charity Dalton
- 5 years ago
- Views:
Transcription
1 Last Week: Visualization Design II Chart Junks Vis Lies 1
2 Last Week: Visualization Design II Sensory representation Understand without learning Sensory immediacy Cross-cultural validity Arbitrary representation Hard to learn Easy to forget Embedded in culture and apps 汉字 : 一二三人 dog 山 Antidisestablishmentarianism 森 Euler diagram: circle for boundary Language: need to learn 2
3 Last Week: Data Model and Explorative Visual Analytics 1-D (Linear, Set and Sequences) SeeSoft, Info Mural 2-D (Map) GIS, ArcView, PageMaker 3-D (Shape, the World) CAD, Medical, Architecture n-d (Relational) Spotfire, Tableau Temporal LifeLines, Palantir Tree (Hierarchy) Cone/Cam/Hyperbolic Network (Graph) Pajek, JUNG 3
4 Last Week: Data Model and Explorative Visual Analytics 4
5 Text Visualization I IV Course Spring 14 Graduate Course of UCAS Mar. 28th,
6 InfoVis Pipeline Visualization Text Visualization Data Text Data Model User Potential Users? Tasks 6
7 Outline Text visualization background Examples User, tasks and text visualization pipeline Text visualization approaches Information Retrieval purpose Overview and sense-making purpose Text analytics basics Word/sentence-level Corpus-level 7
8 Text is Everywhere We use documents as primary information artifact in our lives Our access to documents has grown tremendously in recent years due to the Internet... WWW Digital libraries Web 2.0 8
9 Text Visualization Examples 9
10 Examples 10
11 Examples 11
12 Examples 12
13 Examples 13
14 Examples 14
15 Examples 15
16 Examples 16
17 Examples 17
18 Big gquestions What can information visualization provide to help users in understanding and gathering information from text and document collections? (Task) Who will be interested and benefit from text visualization? (User)... 18
19 Tasks & Goals Which h documents contain text t on topic XYZ? Which documents are of interest to me? Are there other documents that are similar to this one (so they are worthwhile)? How are different words used in a document or a document collection? What are the main themes and ideas in a document or a collection? Which documents have an angry tone? How are certain words or themes distributed through a document? Identify hidden messages or stories in this document collection. How does one set of documents differ from another set? Quickly gain an understanding of a document or collection in order to subsequently do XYZ. Understand the history of changes in a document. Find connections between documents. 19
20 Another Task: Ask Better Questions on Text Collections 20
21 Users of Text Visualization Government Intelligence Analysts? Literature researcher? Artist?...???? [To be answered in Assignment II] 21
22 Potential ti User: Parents y p 22
23 Text Visualization Pipeline 23
24 Text Visualization for Information Retrieval Which documents contain text on topic XYZ? Which documents are of interest to me? Are there other documents that are similar to this one (so they are worthwhile)?... 24
25 Text Visualization for Information Retrieval 25
26 TileBar Search engine query results do not include: How strong the match is How frequent each term is How each term is distributed in the document Overlap between terms Length of document Document ranking is opaque Inability to compare between results Input limits term relationships 26
27 TileBar Search Terms Query Result Visualization 27
28 TileBar 28
29 More Text Visualization for IR Visualize One query... query distance document 29
30 More Text Visualization for IR Multiple queries... 30
31 More Text Visualization for IR 31
32 Comparing Search Results Color represents different search engines 32
33 Text Visualization for Sensemaking How are different words used in a document or a document collection? What are the main themes and ideas in a document or a collection? on? Which documents have an angry tone? How are certain words or themes distributed through a document? Identify hidden messages or stories in this document collection. How does one set of documents differ from another set? Quickly gain an understanding of a document or collection in order to subsequently do XYZ. Understand the history of changes in a document. Find connections between documents
34 Text Visualization Method Taxonomy Document-level visualization: document distribution & summarization Text content-level visualization: overview & navigation Keyword frequency Associated facet: time, topic, sentiment, etc. Text entities in context: keyword occurrence Text entity relationship and/or internal text structure... 34
35 Document Visualization InfoSky & SPIRE: 2D projection of document vectors by PCA/MDS/etc. /... InfoSky SPIRE 35
36 Document Visualization Exemplar-based document visualization... Visualization of documents in 20 Newsgroups (18, documents, 20 topics) by EV. Each point represents a document; each color shape represents a news topic; and the corresponding big color shape indicates the mean of a news group. 36
37 Document Visualization Document Card InfoVis 08 Proceedings 37
38 Text Content Visualization: Keywords Bubble Chart 38
39 Text Content Visualization: Keywords Tag Cloud 39
40 Text Content Visualization: Keywords Ordered Tag Cloud 40
41 Text Content Visualization: Keywords Bi-gram 41
42 Text Content Visualization: Keywords Wordle 42
43 Text Content Visualization: Keywords Manipulating Wordle 43
44 Text Content Visualization with Facets TIARA & ThemeRiver & Context-Preserving Tag Cloud & Parallel TagCloud Temporal/topical/facet extension of TagCloud/Wordle Provide more interactions to drill-down to small document portions TIARA ThemeRiver Context-Preserving TagCloud Parallel TagCloud 44
45 Text Content Visualization with Facets Parallel Tag Cloud 45
46 Text Entities: Keyword in Context TAKMI & FeatureLens & TileBar Visualizing entity/feature/concept within the content Visualizing occurrence patterns within the content: temporal, topical, correlational Keyword + context paradigm for details FeatureLens TileBar TAKMI 46
47 Visual Readability Analysis 47
48 Jigsaw & WordTree Visualizing entity relationships Text Entity Relationship Extract natural relationships: co- occurrence, sequential Support navigation with focus redirection Jigsaw Word Tree 48
49 Text Entity Relationship PhraseNet & FacetAtlas Visualizing entity relationships with advanced analytics: WordNet, intermediate word, multi-faceted relationships Start t from a search item : relationship item or concept item Only visualization, few navigation PhraseNet DocuBurst FacetAtlas 49
50 Text Analytics Basics: Text Mining Text pre-processing (parsing) Remove stop words Keyword stemming Text feature extraction Keyword frequency Topic modeling Text feature measurement m Similarity Text clustering 50
51 Text Parsing "I have a dream that one day this nation will rise up and live out the true meaning of its creed: "We hold these truths to be self- evident, that t all men are created equal." Stop word removal: a, the, that, t etc. Keyword stemming: men->man, truths->truth Parsing result: I, dream, one, day, nation, rise, up, live, out, true, meaning, creed, hold, truth, be, self-evident, all, man, created, equal 51
52 Basic Text Modeling Bag-of-words model: vector representation Word I dream color skin nation slave injustice owner Frequency Text similarity:cosine similarity between two words TF-IDF weighting: term frequency * inverse document frequency 52
53 Topic Modeling Popular methods: Latent Semantic Indexing plsi, LDA 53
54 Background Examples Summary User, tasks and text visualization pipeline pp Text visualization methods IR purpose Overview and sense-making: 5 categories Text analytics basics Text parsing, measurement and topic modeling 54
55 Questions? What s Next -- Lecture 8: Text Visualization II 55
Multidimensional (Multivariate)
Multidimensional (Multivariate) Data Visualization IV Course Spring 14 Graduate Course of UCAS May 9th, 2014 1 Data by Dimensionality 1-D (Linear, Set and Sequences) SeeSoft, Info Mural 2-D (Map) GIS,
More informationText and Document Visualization
Text and Document Visualization CS 4460/7450 - Information Visualization March 26, 2009 John Stasko Text is Everywhere We use documents as primary information artifact in our lives Our access to documents
More informationCS 4460/ Information Visualization February 23, 2010 John Stasko
Text and Document Visualization CS 4460/7450 - Information Visualization February 23, 2010 John Stasko Text is Everywhere We use documents as primary information artifact in our lives Our access to documents
More informationUnstructured Data. CS102 Winter 2019
Winter 2019 Big Data Tools and Techniques Basic Data Manipulation and Analysis Performing well-defined computations or asking well-defined questions ( queries ) Data Mining Looking for patterns in data
More informationText Analytics (Text Mining)
CSE 6242 / CX 4242 Apr 1, 2014 Text Analytics (Text Mining) Concepts and Algorithms Duen Horng (Polo) Chau Georgia Tech Some lectures are partly based on materials by Professors Guy Lebanon, Jeffrey Heer,
More informationInteractive Visual Text Analytics for Decision Making. Shixia Liu Microsoft Research Asia
Interactive Visual Text Analytics for Decision Making Shixia Liu Microsoft Research Asia 1 Text is Everywhere We use documents as primary information artifact in our lives Our access to documents has grown
More informationText Analytics (Text Mining)
CSE 6242 / CX 4242 Text Analytics (Text Mining) Concepts and Algorithms Duen Horng (Polo) Chau Georgia Tech Some lectures are partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko,
More informationIBM Research - China
TIARA: A Visual Exploratory Text Analytic System Furu Wei +, Shixia Liu +, Yangqiu Song +, Shimei Pan #, Michelle X. Zhou*, Weihong Qian +, Lei Shi +, Li Tan + and Qiang Zhang + + IBM Research China, Beijing,
More informationChapter 6: Information Retrieval and Web Search. An introduction
Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods
More informationFacetAtlas: Multifaceted Visualization for Rich Text Corpora
1172 IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 16, NO. 6, NOVEMBER/DECEMBER 2010 FacetAtlas: Multifaceted Visualization for Rich Text Corpora Nan Cao, Jimeng Sun, Yu-Ru Lin, David
More informationRuslan Salakhutdinov and Geoffrey Hinton. University of Toronto, Machine Learning Group IRGM Workshop July 2007
SEMANIC HASHING Ruslan Salakhutdinov and Geoffrey Hinton University of oronto, Machine Learning Group IRGM orkshop July 2007 Existing Methods One of the most popular and widely used in practice algorithms
More informationIntroduction to Information Retrieval
Introduction to Information Retrieval Mohsen Kamyar چهارمین کارگاه ساالنه آزمایشگاه فناوری و وب بهمن ماه 1391 Outline Outline in classic categorization Information vs. Data Retrieval IR Models Evaluation
More informationVisual Analysis of Set Relations in a Graph
Visual Analysis of Set Relations in a Graph Panpan Xu 1, Fan Du 2, Nan Cao 3, Conglei Shi 1, Hong Zhou 4, Huamin Qu 1 1 Hong Kong University of Science and Technology, 2 Zhejiang University, 3 IBM T. J.
More informationNobody uploads till yesterday, difficult?
Survey Result 1 Assignment II! Nobody uploads till yesterday, difficult? 2 Last Week: Text Visualization 3 Interaction IV Course Spring 14 Graduate Course of UCAS April 4th, 2014 4 InfoVis Pipeline Visualization
More informationCS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University
CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and
More informationChapter 27 Introduction to Information Retrieval and Web Search
Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval
More informationMODELS AND FRAMEWORKS. Information Visualization Fall 2009 Jinwook Seo SNU CSE
MODELS AND FRAMEWORKS Information Visualization Fall 2009 Jinwook Seo SNU CSE Wednesday Prof. Hee-Joon Bae, Seoul National University Bundang Hostpital blood pressure and END (early neurologic deterioration)
More informationIntroduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.
Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. 6 What is Web Mining? p. 6 Summary of Chapters p. 8 How
More informationTxt2vz: a new tool for generating graph clouds
Txt2vz: a new tool for generating graph clouds HIRSCH, L and TIAN, D Available from Sheffield Hallam University Research Archive (SHURA) at: http://shura.shu.ac.uk/6619/
More informationWhat is this Song About?: Identification of Keywords in Bollywood Lyrics
What is this Song About?: Identification of Keywords in Bollywood Lyrics by Drushti Apoorva G, Kritik Mathur, Priyansh Agrawal, Radhika Mamidi in 19th International Conference on Computational Linguistics
More informationRelevance Feedback and Query Reformulation. Lecture 10 CS 510 Information Retrieval on the Internet Thanks to Susan Price. Outline
Relevance Feedback and Query Reformulation Lecture 10 CS 510 Information Retrieval on the Internet Thanks to Susan Price IR on the Internet, Spring 2010 1 Outline Query reformulation Sources of relevance
More informationDeveloping Focused Crawlers for Genre Specific Search Engines
Developing Focused Crawlers for Genre Specific Search Engines Nikhil Priyatam Thesis Advisor: Prof. Vasudeva Varma IIIT Hyderabad July 7, 2014 Examples of Genre Specific Search Engines MedlinePlus Naukri.com
More informationPowering Knowledge Discovery. Insights from big data with Linguamatics I2E
Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural
More informationInformation Retrieval. (M&S Ch 15)
Information Retrieval (M&S Ch 15) 1 Retrieval Models A retrieval model specifies the details of: Document representation Query representation Retrieval function Determines a notion of relevance. Notion
More informationUnderstanding Text Corpora with Multiple Facets
Understanding Text Corpora with Multiple Facets Lei Shi Furu Wei Shixia Liu Li Tan Xiaoxiao Lian IBM Research - China 19 Zhongguancun Software Park Beijing 100193, China Michelle X. Zhou IBM Research -
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationPart I: Data Mining Foundations
Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?
More informationLearning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search
1 / 33 Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search Bernd Wittefeld Supervisor Markus Löckelt 20. July 2012 2 / 33 Teaser - Google Web History http://www.google.com/history
More informationTag-based Social Interest Discovery
Tag-based Social Interest Discovery Xin Li / Lei Guo / Yihong (Eric) Zhao Yahoo!Inc 2008 Presented by: Tuan Anh Le (aletuan@vub.ac.be) 1 Outline Introduction Data set collection & Pre-processing Architecture
More informationShrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent
More informationCS377: Database Systems Text data and information. Li Xiong Department of Mathematics and Computer Science Emory University
CS377: Database Systems Text data and information retrieval Li Xiong Department of Mathematics and Computer Science Emory University Outline Information Retrieval (IR) Concepts Text Preprocessing Inverted
More informationIntroduction to Text Mining. Hongning Wang
Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:
More informationBing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer
Bing Liu Web Data Mining Exploring Hyperlinks, Contents, and Usage Data With 177 Figures Springer Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web
More informationTaming Text. How to Find, Organize, and Manipulate It MANNING GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS. Shelter Island
Taming Text How to Find, Organize, and Manipulate It GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS 11 MANNING Shelter Island contents foreword xiii preface xiv acknowledgments xvii about this book
More informationInformation Retrieval (IR) Introduction to Information Retrieval. Lecture Overview. Why do we need IR? Basics of an IR system.
Introduction to Information Retrieval Ethan Phelps-Goodman Some slides taken from http://www.cs.utexas.edu/users/mooney/ir-course/ Information Retrieval (IR) The indexing and retrieval of textual documents.
More informationInformation Retrieval CSCI
Information Retrieval CSCI 4141-6403 My name is Anwar Alhenshiri My email is: anwar@cs.dal.ca I prefer: aalhenshiri@gmail.com The course website is: http://web.cs.dal.ca/~anwar/ir/main.html 5/6/2012 1
More informationSemantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.
Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...
More informationCANDIDATE LINK GENERATION USING SEMANTIC PHEROMONE SWARM
CANDIDATE LINK GENERATION USING SEMANTIC PHEROMONE SWARM Ms.Susan Geethu.D.K 1, Ms. R.Subha 2, Dr.S.Palaniswami 3 1, 2 Assistant Professor 1,2 Department of Computer Science and Engineering, Sri Krishna
More informationInformation Retrieval Using Context Based Document Indexing and Term Graph
Information Retrieval Using Context Based Document Indexing and Term Graph Mr. Mandar Donge ME Student, Department of Computer Engineering, P.V.P.I.T, Bavdhan, Savitribai Phule Pune University, Pune, Maharashtra,
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationInformation Retrieval
Information Retrieval CSC 375, Fall 2016 An information retrieval system will tend not to be used whenever it is more painful and troublesome for a customer to have information than for him not to have
More informationUsing Semantic Similarity in Crawling-based Web Application Testing. (National Taiwan Univ.)
Using Semantic Similarity in Crawling-based Web Application Testing Jun-Wei Lin Farn Wang Paul Chu (UC-Irvine) (National Taiwan Univ.) (QNAP, Inc) Crawling-based Web App Testing the web app under test
More informationCS 572: Information Retrieval. Lecture 1: Course Overview and Introduction 11 January 2016
CS 572: Information Retrieval Lecture 1: Course Overview and Introduction 11 January 2016 1/11/2016 CS 572: Information Retrieval. Spring 2016 1 Lecture Plan What is IR? (the big questions) Course overview
More informationQuery Languages. Berlin Chen Reference: 1. Modern Information Retrieval, chapter 4
Query Languages Berlin Chen 2005 Reference: 1. Modern Information Retrieval, chapter 4 Data retrieval Pattern-based querying The Kinds of Queries Retrieve docs that contains (or exactly match) the objects
More informationInformation Retrieval: Retrieval Models
CS473: Web Information Retrieval & Management CS-473 Web Information Retrieval & Management Information Retrieval: Retrieval Models Luo Si Department of Computer Science Purdue University Retrieval Models
More informationINTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING
CS 7265 BIG DATA ANALYTICS INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING * Some contents are adapted from Dr. Hung Huang and Dr. Chengkai Li at UT Arlington Mingon Kang, PhD Computer Science,
More informationChapter 2. Architecture of a Search Engine
Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them
More informationInformation Visualization: See Patterns, Gain Insights & Make Decisions
Information Visualization: See Patterns, Gain Insights & Make Decisions Ben Shneiderman ben@cs.umd.edu @benbendc Founding Director (1983-2000), Human-Computer Interaction Lab Professor, Department of Computer
More informationOverview of Web Mining Techniques and its Application towards Web
Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous
More informationIE in Context. Machine Learning Problems for Text/Web Data
Machine Learning Problems for Text/Web Data Lecture 24: Document and Web Applications Sam Roweis Document / Web Page Classification or Detection 1. Does this document/web page contain an example of thing
More informationInformation Retrieval
Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,
More informationTERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.
More informationChrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO
Chrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO INDEX Proposal Recap Implementation Evaluation Future Works Proposal Recap Keyword Visualizer (chrome
More informationQuery-Time JOIN for Active Intelligence Engine (AIE)
Query-Time JOIN for Active Intelligence Engine (AIE) Ad hoc JOINing of Structured Data and Unstructured Content: An Attivio-Patented Breakthrough in Information- Centered Business Agility An Attivio Technology
More informationInformation Visualisation
Information Visualisation Computer Animation and Visualisation Lecture 18 Taku Komura tkomura@ed.ac.uk Institute for Perception, Action & Behaviour School of Informatics 1 Overview Information Visualisation
More informationClassroom Course Description. Course Outline. Tableau Intermediate & Advance. Audience
Classroom Course Description Tableau Intermediate & Advance Audience Tableau Fundamentals & Advance serves the beginner to intermediate Tableau user, targeted towards anyone who works with data regardless
More informationSummarizing Public Opinion on a Topic
Summarizing Public Opinion on a Topic 1 Abstract We present SPOT (Summarizing Public Opinion on a Topic), a new blog browsing web application that combines clustering with summarization to present an organized,
More informationmodern database systems lecture 4 : information retrieval
modern database systems lecture 4 : information retrieval Aristides Gionis Michael Mathioudakis spring 2016 in perspective structured data relational data RDBMS MySQL semi-structured data data-graph representation
More informationEveryday Activity. Course Content. Objectives of Lecture 13 Search Engine
Web Technologies and Applications Winter 2001 CMPUT 499: Search Engines Dr. Osmar R. Zaïane University of Alberta Everyday Activity We use search engines whenever we look for resources on the Internet
More informationVisualizing Translation Variation of Othello : A Survey of Text Visualization and Analysis Tools
Visualizing Translation Variation of Othello : A Survey of Text Visualization and Analysis Tools Zhao Geng 1, Robert S.Laramee 1, Tom Cheesman 2, Stephan Thiel 3 1 Visual Computing Group, Swansea University
More informationVisualization and text mining of patent and non-patent data
of patent and non-patent data Anton Heijs Information Solutions Delft, The Netherlands http://www.treparel.com/ ICIC conference, Nice, France, 2008 Outline Introduction Applications on patent and non-patent
More informationWeb Page Recommender System based on Folksonomy Mining for ITNG 06 Submissions
Web Page Recommender System based on Folksonomy Mining for ITNG 06 Submissions Satoshi Niwa University of Tokyo niwa@nii.ac.jp Takuo Doi University of Tokyo Shinichi Honiden University of Tokyo National
More informationContextual Search using Cognitive Discovery Capabilities
Contextual Search using Cognitive Discovery Capabilities In this exercise, you will work with a sample application that uses the Watson Discovery service API s for cognitive search use cases. Discovery
More informationImplementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky
Implementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky The Chinese University of Hong Kong Abstract Husky is a distributed computing system, achieving outstanding
More informationAppendix A Additional Information
Appendix A Additional Information In this appendix, we provide more information on building practical applications using the techniques discussed in the chapters of this book. In Sect. A.1, we discuss
More informationTopic Diversity Method for Image Re-Ranking
Topic Diversity Method for Image Re-Ranking D.Ashwini 1, P.Jerlin Jeba 2, D.Vanitha 3 M.E, P.Veeralakshmi M.E., Ph.D 4 1,2 Student, 3 Assistant Professor, 4 Associate Professor 1,2,3,4 Department of Information
More informationChapter 27. Other Approaches to Reasoning and Representation
Chapter 27. Other Approaches to Reasoning and Representation The Quest for Artificial Intelligence, Nilsson, N. J., 2009. Lecture Notes on Artificial Intelligence Summarized by Ha, Jung-Woo and Lee, Beom-Jin
More informationDIGIT.B4 Big Data PoC
DIGIT.B4 Big Data PoC RTD Health papers D02.02 Technological Architecture Table of contents 1 Introduction... 5 2 Methodological Approach... 6 2.1 Business understanding... 7 2.2 Data linguistic understanding...
More informationQlik Sense Desktop. Data, Discovery, Collaboration in minutes. Qlik Sense Desktop. Qlik Associative Model. Get Started for Free
Qlik Sense Desktop Data, Discovery, Collaboration in minutes With Qlik Sense Desktop making business decisions becomes faster, easier, and more collaborative than ever. Qlik Sense Desktop puts rapid analytics
More informationAutomated Classification. Lars Marius Garshol Topic Maps
Automated Classification Lars Marius Garshol Topic Maps 2007 2007-03-21 Automated classification What is it? Why do it? 2 What is automated classification? Create parts of a topic map
More informationMultimodal Information Spaces for Content-based Image Retrieval
Research Proposal Multimodal Information Spaces for Content-based Image Retrieval Abstract Currently, image retrieval by content is a research problem of great interest in academia and the industry, due
More informationMahout in Action MANNING ROBIN ANIL SEAN OWEN TED DUNNING ELLEN FRIEDMAN. Shelter Island
Mahout in Action SEAN OWEN ROBIN ANIL TED DUNNING ELLEN FRIEDMAN II MANNING Shelter Island contents preface xvii acknowledgments about this book xx xix about multimedia extras xxiii about the cover illustration
More informationCS54701: Information Retrieval
CS54701: Information Retrieval Basic Concepts 19 January 2016 Prof. Chris Clifton 1 Text Representation: Process of Indexing Remove Stopword, Stemming, Phrase Extraction etc Document Parser Extract useful
More informationClustering. Bruno Martins. 1 st Semester 2012/2013
Departamento de Engenharia Informática Instituto Superior Técnico 1 st Semester 2012/2013 Slides baseados nos slides oficiais do livro Mining the Web c Soumen Chakrabarti. Outline 1 Motivation Basic Concepts
More informationTopic Model Visualization with IPython
Topic Model Visualization with IPython Sergey Karpovich 1, Alexander Smirnov 2,3, Nikolay Teslya 2,3, Andrei Grigorev 3 1 Mos.ru, Moscow, Russia 2 SPIIRAS, St.Petersburg, Russia 3 ITMO University, St.Petersburg,
More informationCS6200 Information Retrieval. Jesse Anderton College of Computer and Information Science Northeastern University
CS6200 Information Retrieval Jesse Anderton College of Computer and Information Science Northeastern University Major Contributors Gerard Salton! Vector Space Model Indexing Relevance Feedback SMART Karen
More informationText Analytics (Text Mining)
http://poloclub.gatech.edu/cse6242 CSE6242 / CX4242: Data & Visual Analytics Text Analytics (Text Mining) Concepts, Algorithms, LSI/SVD Duen Horng (Polo) Chau Assistant Professor Associate Director, MS
More informationA Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2
A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,
More informationContents. About this Book...1 Audience... 1 Prerequisites... 1 Conventions... 2
Contents About this Book...1 Audience... 1 Prerequisites... 1 Conventions... 2 1 About SAS Sentiment Analysis Workbench...3 1.1 What Is SAS Sentiment Analysis Workbench?... 3 1.2 Benefits of Using SAS
More informationOutline. Possible solutions. The basic problem. How? How? Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity
Outline Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity Lecture 10 CS 410/510 Information Retrieval on the Internet Query reformulation Sources of relevance for feedback Using
More informationFall CS646: Information Retrieval. Lecture 2 - Introduction to Search Result Ranking. Jiepu Jiang University of Massachusetts Amherst 2016/09/12
Fall 2016 CS646: Information Retrieval Lecture 2 - Introduction to Search Result Ranking Jiepu Jiang University of Massachusetts Amherst 2016/09/12 More course information Programming Prerequisites Proficiency
More informationA BFS-BASED SIMILAR CONFERENCE RETRIEVAL FRAMEWORK
A BFS-BASED SIMILAR CONFERENCE RETRIEVAL FRAMEWORK Qing Guo 1, 2 1 Nanyang Technological University, Singapore 2 SAP Innovation Center Network,Singapore ABSTRACT Literature review is part of scientific
More informationPlan for today. CS276B Text Retrieval and Mining Winter Vector spaces and XML. Text-centric XML retrieval. Vector spaces and XML
CS276B Text Retrieval and Mining Winter 2005 Plan for today Vector space approaches to XML retrieval Evaluating text-centric retrieval Lecture 15 Text-centric XML retrieval Documents marked up as XML E.g.,
More informationCP SC 8810 Data Visualization. Joshua Levine
CP SC 8810 Data Visualization Joshua Levine levinej@clemson.edu Lecture 15 Text and Sets Oct. 14, 2014 Agenda Lab 02 Grades! Lab 03 due in 1 week Lab 2 Summary Preferences on x-axis label separation 10
More informationLecture 2 Map design. Dr. Zhang Spring, 2017
Lecture 2 Map design Dr. Zhang Spring, 2017 Model of the course Using and making maps Navigating GIS maps Map design Working with spatial data Geoprocessing Spatial data infrastructure Digitizing File
More informationInformation Retrieval. Information Retrieval and Web Search
Information Retrieval and Web Search Introduction to IR models and methods Information Retrieval The indexing and retrieval of textual documents. Searching for pages on the World Wide Web is the most recent
More informationText Mining. Representation of Text Documents
Data Mining is typically concerned with the detection of patterns in numeric data, but very often important (e.g., critical to business) information is stored in the form of text. Unlike numeric data,
More informationInferring Variable Labels Considering Co-occurrence of Variable Labels in Data Jackets
2016 IEEE 16th International Conference on Data Mining Workshops Inferring Variable Labels Considering Co-occurrence of Variable Labels in Data Jackets Teruaki Hayashi Department of Systems Innovation
More informationPromoting Ranking Diversity for Biomedical Information Retrieval based on LDA
Promoting Ranking Diversity for Biomedical Information Retrieval based on LDA Yan Chen, Xiaoshi Yin, Zhoujun Li, Xiaohua Hu and Jimmy Huang State Key Laboratory of Software Development Environment, Beihang
More informationInstructor: Stefan Savev
LECTURE 2 What is indexing? Indexing is the process of extracting features (such as word counts) from the documents (in other words: preprocessing the documents). The process ends with putting the information
More informationWeb Information Retrieval using WordNet
Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT
More informationWeek 6: Networks, Stories, Vis in the Newsroom
Week 6: Networks, Stories, Vis in the Newsroom Tamara Munzner Department of Computer Science University of British Columbia JRNL 520H, Special Topics in Contemporary Journalism: Data Visualization Week
More informationNatural Language Processing
Natural Language Processing Information Retrieval Potsdam, 14 June 2012 Saeedeh Momtazi Information Systems Group based on the slides of the course book Outline 2 1 Introduction 2 Indexing Block Document
More informationText Analytics (Text Mining)
CSE 6242 / CX 4242 Text Analytics (Text Mining) Concepts, Algorithms, LSI/SVD Duen Horng (Polo) Chau Georgia Tech Some lectures are partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John
More informationAutomated Tagging for Online Q&A Forums
1 Automated Tagging for Online Q&A Forums Rajat Sharma, Nitin Kalra, Gautam Nagpal University of California, San Diego, La Jolla, CA 92093, USA {ras043, nikalra, gnagpal}@ucsd.edu Abstract Hashtags created
More informationKristina Lerman University of Southern California. This lecture is partly based on slides prepared by Anon Plangprasopchok
Kristina Lerman University of Southern California This lecture is partly based on slides prepared by Anon Plangprasopchok Social Web is a platform for people to create, organize and share information Users
More informationInformation Retrieval & Text Mining
Information Retrieval & Text Mining Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References 2 Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques", The Morgan Kaufmann
More informationCMSC 476/676 Information Retrieval Midterm Exam Spring 2014
CMSC 476/676 Information Retrieval Midterm Exam Spring 2014 Name: You may consult your notes and/or your textbook. This is a 75 minute, in class exam. If there is information missing in any of the question
More informationSearch Framework for a Large Digital Records Archive DLF SPRING 2007 April 23-25, 25, 2007 Dyung Le & Quyen Nguyen ERA Systems Engineering National Ar
Search Framework for a Large Digital Records Archive DLF SPRING 2007 April 23-25, 25, 2007 Dyung Le & Quyen Nguyen ERA Systems Engineering National Archives & Records Administration Agenda ERA Overview
More informationDealing with Data Especially Big Data
Dealing with Data Especially Big Data INFO-GB-2346.01 Fall 2017 Professor Norman White nwhite@stern.nyu.edu normwhite@twitter Teaching Assistant: Frenil Sanghavi fps241@stern.nyu.edu Administrative Assistant:
More information