Visualizing Translation Variation of Othello : A Survey of Text Visualization and Analysis Tools

Size: px
Start display at page:

Download "Visualizing Translation Variation of Othello : A Survey of Text Visualization and Analysis Tools"

Transcription

1 Visualizing Translation Variation of Othello : A Survey of Text Visualization and Analysis Tools Zhao Geng 1, Robert S.Laramee 1, Tom Cheesman 2, Stephan Thiel 3 1 Visual Computing Group, Swansea University cszg,r.s.laramee@swansea.ac.uk 2 College of Arts and Humanities, Swansea University, UK, t.cheesman@swansea.ac.uk and 3 April 30, Abstract Being a global icon, Shakespeare s plays have been translated into dozens of languages for about 300 years. Also, there are many re-translations to the same language, for example, there are more than 40 translation of Othello into German. Every translation is a different interpretation of the play. These large quantities of translations reflect changing culture or express individual thought by the authors. They build a wide connection between different regions and reveal a retrospective view of their histories. At the moment, researchers from Modern Languages collect a large number of translations of William Shakespeare s play, Othello. In recent years, since roughly 2005, we have witnessed a rapid increase in the number of off-the-shelf text visualization tools which can benefit this study. Here we set out to utilize existing text visualization techniques and tools in order to gain a better understanding of the various translations of the Shakespeare s work. In particular, we would like to learn more about which content varies highly with each translation, and which content remains table. We would also like to form hypothesis as to the implications behind this variations. 2 Introduction The goal of this project is to visualize the various translations of Shakespeare s work, Othello. The initial task is to identify and extract the nonsemantic features from the original text of a document corpus. The non-semantic features refer to the number of words, tokens and patterns in the concordance. Text pre-processing facilitates the construction of text concordance, term relations, document relevance and other properties of interest. Based on the extracted information, various visualizations can be applied. In this document, we present the result of our survey on the state-ofart techniques and free, off-the-shelf tools for text analysis and visualization. We conduct some experiments with a selection of tools using Shakespear s Othello as an example. We investigate if any of the freely available tools can provide clues into the variation of German translations of Othello. The rest of the paper is organized as follows: 3 Related Work Since 2005, we observe a rapid increase in the number of text visualization prototypes being developed. As a result, various visual representations for text streams and documents are proposed to effectively present and explore the text features. In this section, we present some interesting and novel text visualizations which are able to present some of the extracted text attributes. The ThemeRiver visualization, proposed by Havre, Hetzler, Whitney & Nowell (2002), depicts thematic variations over time within a large collection of documents. The thematic changes are shown in the context of a time line and corresponding external events. The focus on temporal thematic change within a context framework allows a user to discern patterns that suggest relationships or trends. A Document 1

2 Contrast Diagram, proposed by Clark (2008), is a visual summary of the content of two text documents that illustrates shared words, words that are unique to one document or the other, word frequency, relative size of the two documents, distribution of emotional tone within the documents, related words based on co-occurrence, and the most common word in each document segment. It uses the familiar bubble technique and effective use of colour to contrast topic usage in two bodies of text. Parallel Tag Clouds, proposed by Collins, B.Viegas & Wattenberg (2009), combines the parallel coordinates and tag clouds to provide a rich overview of a document collection. Each vertical axis represents a category. The words in each category are summarized in the form of tag clouds along the vertical axis. When clicking on a word, the same word appearing in other vertical axes is connected. Several filters can be defined to reduce the amount of text displayed in each category. This could help create more screen space and improve the clarity of the visualization. DocuBurst proposed by Collins, Carpendale & Penn (2009) uses a radial, space-filling layout to depict the document content by visualizing the structured text. The structured text in this visualization refers to the is-kind-of or is-type-of relationship. For example, robin and redbreast is kind of a bird. A bird is kind of an animal. An animal is kind of an organism or a living thing. A living thing is an entity. As we can see, such structured text can form a tree hierarchy, with the entity as the root and robin or redbreast as the leaf. The root node of DocBurst visualization is shown as a circle. All other nodes are assigned to a sector of an annulus. The angular width of each sector is mapped to the number of leaves or children. SparkClouds proposed by Lee, Riche, Karlson & Carpendale (2010) integrates sparklines into a tag cloud to convey trends between multiple tag clouds. The sparklines can be used to present the trend over time. From a controlled study that compares SparkClouds with two traditional trend visualizations, such as multiple line graphs, stacked bar charts and Parallel Tag Clouds, results show that SparkClouds is more effective at showing trends along the time. ManiWordle proposed by Koh, Lee, Kim & Seo (2010) provides flexible control such that user can directly manipulate on the original Wordle to change the layout, colour and etc. Figure 1: This figure shows the time and place of the collected 47 translations in scatterplots. The x-axis depicts the publication date and the y-axis depicts the place of the publication. The dot size corresponds to the year of publication, where the newer version has a larger size. 4 Background Data Discription The domain experts from Modern Languages have collected more than 47 different German translations of Shakepeare s play, Othello. These translations are published from 1766 to 2006 in 5 different countries, as shown in Figure 1. The detail of the authors and the title of each translation can be visualized by the Matrix Chart in ManyEyes Viegas, Wattenberg, van Ham, Kriss & Mckeon (2007), as shown in Figure 2. Apart from the main texts of the Othello being collected, the domain experts also list some of the important metadata, such as the originality score of each translation, which is manually calculated to give an relatively objective measurement of how similar each translation is to the original play. In the next few sections, we apply the 23 different versions of Othello translations on various available visualization tools. 5 Text Preprocessing Using of the text preprocessing tool introduced in Section 5, we can collect a wide range of text 2

3 Figure 2: This figure shows the detail of our Othello collection using Matrix Chart in ManyEyes developed by Viegas et al. (2007). The row is mapped to the publication country. The column is mapped to the authors and color represents the title of the book. attributes, such as word relationships, word frequency and sentence segmentation. The domain experts are of particular interest to the variations of different segments or paragraphs in various documents. The software WordSmith, developed by Word- Smith.org (1996), is able to generate various text attributes, such as word frequency, parts of speech and any other statistical information. The outcome of the analysis involves loads of statistical data about the word frequencies in the texts (both absolute values and compared with other texts, or compared with external corpora) and key words list (words which occur unusually frequently in comparison with some kind of reference corpus). A screen shot of the software is shown in Figure 3. Figure 4: This figure shows the interface of Concordance developed by Watt (2009). The software Concordance developed by Watt (2009) is created for people who need in-depth language or text analysis. It provides a free trial for the user. Concordance is able to generate indexes and word lists, count word frequencies, compare different usages of a word, analyse keywords and publish the analysis results on the web. The screen shot of the software is shown in Figure 4 6 Exploratory Specification Some questions that users may ask of a digital resource on Shakespeare in Translation : Where, when, into which languages, and into which kinds of text has Othello been translated? How have translations influenced on another? How do the versions vary in general? How do the versions vary in particular? How do translations deal with any specific term or phrase that the domain experts are interested in? Figure 3: This figure shows the interface of Word- Smith developed by WordSmith.org (1996). 7 State-of-art Text Visualization In this section, we investigate the text visualization tools which are free to the public. Our work 3

4 Figure 5: This figure shows the TextArc(Paley (2002)) visualization of the Shakepear s book Othello in English. The entire text is depicted as an ellipse. Each line is drawn on the outside of the ellipse. In the middle of an ellipse is rendered. can facilitate modern language experts search for visualizations that benefit most for the analysis of their collected Shakespeare s translations. In this section, we experiment with these freely available tools on the 23 German translations of Othello s speech to the senate appeared in Shakepeare s play Othello. The following list summarizes the tools and their visualizations we have tried: The ManyEyes(Viegas et al. (2007)) application with the following visualizations: Wordle, Tag Cloud, Phrase Net, Word Tree The TextArc(Paley (2002)) application with the following: Text Arc The NameVoyager(Wattenberg (2005)) application with the following visualizations: Stack Graph The Tagline Generator(Mehta (2006)): Cloud, Time Line Tag The Wordle(Jonathan Feinberg (2009)): Wordle A TextArc( Paley (2002)) is a visual representation of the entire text on a single page. It is an advanced combination of an index, concordance, and summary of the text. Animation is provided to enable the user keep track of the variations of relationship between different words, phrases and sentences. In TextArc, the entire text is depicted as an ellipse. Each line is drawn on the outside of the ellipse. It preserves the typographic structure of the text. A word appears in the middle of an ellipse. A word with high frequency is displayed in brighter color and larger size. If a word is used more than once, it appears at the center of all of its mentions. The accepted data for TextArc is only from the TextArc library. Figure 5 shows the visualization of the Shakepeare s play Othello generated by TextArc. The NameVoyager(Wattenberg (2005)) as a webbased visualization of historical trends in baby naming, has proved remarkably popular. The method used to visualize the data is straightforward: given a set of name popularity time series, a set of stacked graphs is produced. However, this tool does not accept user customized data sets. 4

5 Figure 6: This figure shows the visualizations of two German translations of Othello using Tagline Generator(Mehta (2006)). By moving the scroll bar, user is able to see the visualization of each individual document. We experimented more than 20 German translations of the Othello play using this tool. Figure 7: This figure shows the Tag Clouds(B.Scott. et al. (2008)) of Othello using ManyEyes(Viegas et al. (2007)). The left image depicts the tag clouds for every single word, whereas the right image shows the Tag Clouds of pairs of words staring with letter b. The ManyEyes does not provide the text preprocessing option in the Tag Cloud, such as removing the common words. The Tagline Generator(Mehta (2006)) is a simple PHP codebase that lets the user generate chronological tag clouds from simple text data sources without manually tagging the data entries. Once the users have populated the data source and configured the generator, it creates a list of all the unique words that have been used and counts their frequency. Next it identifies the different variations of words and combines them under the most common variation using the Porter Stemming Algorithm. The size of a word indicates its frequency in the document. The brightness indicates the year of the document, the newer document is brighter. The accepted data format of tagline generator is an XML file. Figure 6 shows the TagLine visualization of 23 German translations of Shakespeare s play, Othello. ManyEyes(Viegas et al. (2007)) is a free website where anyone can upload, visualize, and discuss data. It is an experiment created by the Visual Communication Lab. The input data of ManyEyes is not obtained by files, instead it accepts any forms of free text copied and pasted from any sources. It provides a number of text visualizations, such as Tag Clouds, Phrase Net and Word Tree. Again, we apply our Othello data, which contains 23 various German translations of the play, to the visualizations in this tool. The standard Tag Clouds( B.Scott. et al. (2008)) is a popular text visualization for depicting term frequencies. Tags are usually single words and are normally listed alphabetically, and the importance of each tag is shown with font size or color, as shown in Figure 7. Word Tree( Wattenberg & B.Viegas (2008)) is a graphical version of the traditional keyword-in- 5

6 Word Attribute Visual Mapping Figure 12: This table is a classification matrix with the column represents the visual mapping elements and the row with the text attribute. Each element of the matrix represents a visualization technique we have introduced in this paper. Figure 8: This image shows the Word Tree(Wattenberg & B.Viegas (2008)) of Othello data using ManyEyes(Viegas et al. (2007)). When we input the word liebte, all of sentences beginning with this word are shown. The size of a word represents its frequency. Figure 9: This image shows the Phrase Net(van Ham et al. (2009)) of our Othello data using ManyEyes(Viegas et al. (2007)). It depicts any two words connected with open space in the Othello play. The size of the words depict the word frequency. context method, and enables rapid querying and exploration of bodies of the text, as shown in Figure 8. It is a visual search tool for unstructured text, such as a book, article, speech or poem. It allows the user choose a word or phrase and shows them all the different contexts in which the word or phrase appears. The contexts are arranged in a tree-like branching structure to reveal recurrent themes and phrases. The size of a word represents its frequency. Phrase Nets( van Ham et al. (2009)) illustrates the relationships between different words used in a text. It uses a simple form of pattern matching to provide multiple views of the concepts contained in a book, speech, or poem. Such as given a network of words and connection pattern word and, where two words are connected if they appear together in a phrase of the form X and Y, as shown in Figure 9. TagCrowd(Steinbock (2008)) is a web application for visualizing word frequencies in any usersupplied text by creating a tag cloud or text cloud( B.Scott. et al. (2008)). The advantage of TagCrowd is that user can define the common words themselves and these common words will be automatically removed from the original text. The accepted input text is the same as ManyEyes. Figure 10 shows the Tag Cloud visualization of our Othello data sets. The common German words are removed. 6

7 Wordle(Jonathan Feinberg (2009)) is a tool for generating word clouds from text that the user provides. The accepted text format is the same as ManyEyes. Wordles are more artistically arranged (and often vibrantly colored) versions of a text. They tend to be less directly insightful as visualization, but often give a more personal feel to a document. The clouds give greater prominence to words that appear more frequently in the source text. The user can tweak their clouds with different fonts, layouts, and color schemes. As shown in Figure 11, is the wordle visualization of some passages from Othello. The common German words are removed. In this section, we also propose a classification matrix to categorize different visualization techniques. The column of our matrix depicts the encoded visual element. The size, orientation, shape and color are used to to encode an individual value, and lines / curves and trees are used to encode relationship and hierarchy. The row of the matrix depicts the text attributes including the individual word frequency and words relationship. An overview of the table is shown in Figure 12. Figure 10: This image shows the TagCrowd(Steinbock (2008)) visualization of a passage from Othello. The common German words or stop lists are manually defined and reduced from the original text. Figure 11: This image shows the Wordle visualization(jonathan Feinberg (2009)) of a passage from Othello data sets. The common German words are reduced. 8 Proposed Visualization In this section, we will briefly introduce our proposed visualization for our Othello data set. Parallel coordinates introduced by Inselberg (2009) is a widely used visualization technique for exploring large, multidimensional data sets. It is powerful in revealing a wide range of data characteristics such as different data distributions and functional dependencies as stated. The textual information of each document can be transformed into a vector. In our parallel coordinates, we encode the document dimensions as term frequencies. Domain experts from Arts and Humanities selected eight interesting translations according to their similarity score. For initial analysis, we chose a significant passage from the play, Othello s big speech to the Venetian Senate in Act1, Scene3: the longest single speech in the play (about 300 words in Shakespeare s text). Figure 13 shows an overview of our visualization. The column on the far left displays a list of selected keywords: these are most frequently occurring significant words in the document corpus. The parallel coordinates present a focused view of keyword frequencies. Each document is represented by a vertical axis. In order to maintain a unified scale, the height of each vertical axis is made proportional to the range between each document s minimal and maximal word frequencies. Zero frequency simply means that a keyword has not occurred in that document. The thickness of each vertical axis is mapped to the document s similarity with others in terms of LSI score: a thicker line means a higher similarity value. The number of occurrences of each keyword in each document is connected by a polyline. Each polyline is rendered in a different color to enable visual discrimination. The text boxes below the parallel coordinates provide context views for keywords selected by the user. Each text box repre- 7

8 Figure 13: This image shows an overview of our visualization. The parallel coordinates illustrates a focus view of the term frequency. The text boxes below the parallel coordinates show the context views. They present the entire sentences from the original text where each keyword appears. sents an individual document and shows the entire sentences from the original text where each selected keyword occurs. We also apply the edge bundling to enhance the visual clustering and user is able to control the curvature of the edge. Curves with the least curvature become a straight line. 9 Conclusion We have surveyed a range of off-the-shelf, freely available information visualization tools for the visual analysis and investigation of Othello data set. Although there are many options available, only a select few visualizations are useful for this particular application. Our study also serves as a useful tool for readers interested in gaining an overview of existing, free, state-of-the-art information visualization tools for text analysis. We also introduce our proposed visualization system designed for the analysis of Othello data set. References B.Scott., G.Carl. & N.Miguel. (2008), Seeing Things in the Clouds: The Effect of Visual Features on Tag Cloud Selections, in HT 08: Proceedings of the nineteenth ACM conference on Hypertext and hypermedia, ACM, New York, NY, USA, pp Clark, J. (2008), Document contrast diagrams. Collins, C., B.Viegas, F. & Wattenberg, M. (2009), Parallel Tag Clouds to Explore and Analyze Facted Text Corpora, in IEEE Symposium on Visual Analytics Science and Technology, Computer Society, pp Collins, C., Carpendale, M. S. T. & Penn, G. (2009), DocuBurst: Visualizing Document Content using Language Structure, Computer Graphics Forum 28(3), Havre, S., Hetzler, E., Whitney, P. & Nowell, L. (2002), ThemeRiver: Visualizing Thematic Changes in Large Document Collections, IEEE Transactions on Visualization and Computer Graphics 8(1), Inselberg, A. (2009), Parallel Coordinates: Visual Multidimensional Geometry and Its Applications, Springer. Jonathan Feinberg (2009), Wordle: Beautiful Word Clouds. 8

9 Koh, K., Lee, B., Kim, B. H. & Seo, J. (2010), ManiWordle: Providing Flexible Control over Wordle, IEEE Transactions on Visualization and Computer Graphics 16(6), Lee, B., Riche, N. H., Karlson, A. K. & Carpendale, M. S. T. (2010), SparkClouds: Visualizing Trends in Tag Clouds, IEEE Transactions on Visualization and Computer Graphics 16(6), Mehta, C. (2006), Tagline Generator - Timeline-based Tag Clouds. Last Access Date: Paley, W. B. (2002), TextArc: An Alternative Way to View Text. Steinbock, D. (2008), TagCrowd: Joining the Crowd Together. van Ham, F., Wattenberg, M. & Viégas, F. B. (2009), Mapping Text with Phrase Nets, IEEE Transactions on Visualization and Computer Graphics 15(6), Viegas, F. B., Wattenberg, M., van Ham, F., Kriss, J. & Mckeon, M. (2007), ManyEyes: A Site for Visualization at Internet Scale., IEEE Transactions on Visualization and Computer Graphics 13(6), Watt, R. J. C. (2009), Concordance Wattenberg, M. (2005), Baby Names Visualization, and Social Data Analysis, in Proceedings of 2005 IEEE Symposium on Information Visualization (INFOVIS), pp Wattenberg, M. & B.Viegas, F. (2008), The Word Tree, an Interactive Visual Concordance, IEEE Transactions on Visualization and Computer Graphics 14(6), WordSmith.org (1996), WordSmith Tools. Last Access Date:

Txt2vz: a new tool for generating graph clouds

Txt2vz: a new tool for generating graph clouds Txt2vz: a new tool for generating graph clouds HIRSCH, L and TIAN, D Available from Sheffield Hallam University Research Archive (SHURA) at: http://shura.shu.ac.uk/6619/

More information

CP SC 8810 Data Visualization. Joshua Levine

CP SC 8810 Data Visualization. Joshua Levine CP SC 8810 Data Visualization Joshua Levine levinej@clemson.edu Lecture 15 Text and Sets Oct. 14, 2014 Agenda Lab 02 Grades! Lab 03 due in 1 week Lab 2 Summary Preferences on x-axis label separation 10

More information

VISUALIZATION OF VERSION VARIATION

VISUALIZATION OF VERSION VARIATION VISUALIZATION OF VERSION VARIATION Majedah Mohammad Alrehiely 715803@swansea.ac.uk October 2014 Project Dissertation document submitted to the University of Wales, Swansea, in fulfillment of the requirements

More information

Information Visualisation

Information Visualisation Information Visualisation Computer Animation and Visualisation Lecture 18 Taku Komura tkomura@ed.ac.uk Institute for Perception, Action & Behaviour School of Informatics 1 Overview Information Visualisation

More information

Interactive Visualization for Computational Linguistics

Interactive Visualization for Computational Linguistics Interactive Visualization for Computational Linguistics ESSLII 2009 2 Interaction and animation References 3 Slides in this section are based on: Yi et al., Toward a Deeper Understanding of the Role of

More information

Geometric Techniques. Part 1. Example: Scatter Plot. Basic Idea: Scatterplots. Basic Idea. House data: Price and Number of bedrooms

Geometric Techniques. Part 1. Example: Scatter Plot. Basic Idea: Scatterplots. Basic Idea. House data: Price and Number of bedrooms Part 1 Geometric Techniques Scatterplots, Parallel Coordinates,... Geometric Techniques Basic Idea Visualization of Geometric Transformations and Projections of the Data Scatterplots [Cleveland 1993] Parallel

More information

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student Organizing data Learning Outcome 1. make an array 2. divide the array into class intervals 3. describe the characteristics of a table 4. construct a frequency distribution table 5. constructing a composite

More information

This demonstration is aimed at anyone with lots of text, unstructured or multiformat data to analyse.

This demonstration is aimed at anyone with lots of text, unstructured or multiformat data to analyse. 1 2 This demonstration is aimed at anyone with lots of text, unstructured or multiformat data to analyse. This could be lots of Word, PDF and text file formats or in various databases or spreadsheets,

More information

Interactive Visualizations for Linguistic Analysis

Interactive Visualizations for Linguistic Analysis Interactive Visualizations for Linguistic Analysis Verena Lyding and Henrik Dittmann {verena.lyding/henrik.dittmann}@eurac.edu Institute for Specialised Communication and Multilingualism, EURAC, Bozen-Bolzano

More information

Last Week: Visualization Design II

Last Week: Visualization Design II Last Week: Visualization Design II Chart Junks Vis Lies 1 Last Week: Visualization Design II Sensory representation Understand without learning Sensory immediacy Cross-cultural validity Arbitrary representation

More information

Visualization of Streaming Data: Observing Change and Context in Information Visualization Techniques

Visualization of Streaming Data: Observing Change and Context in Information Visualization Techniques Visualization of Streaming Data: Observing Change and Context in Information Visualization Techniques Miloš Krstajić, Daniel A. Keim University of Konstanz Konstanz, Germany {milos.krstajic,daniel.keim}@uni-konstanz.de

More information

Contents. About this Book...1 Audience... 1 Prerequisites... 1 Conventions... 2

Contents. About this Book...1 Audience... 1 Prerequisites... 1 Conventions... 2 Contents About this Book...1 Audience... 1 Prerequisites... 1 Conventions... 2 1 About SAS Sentiment Analysis Workbench...3 1.1 What Is SAS Sentiment Analysis Workbench?... 3 1.2 Benefits of Using SAS

More information

PowerPoint XP Advanced Techniques

PowerPoint XP Advanced Techniques PowerPoint XP Advanced Techniques Tutorial will cover: 1. Statistical Chart (using PowerPoint with Excel) 2. Organization Chart 3. Table 4. Drawing 5. Clip Art 6. Custom Animation 7. Sound and movies 1.

More information

Text Mining. Representation of Text Documents

Text Mining. Representation of Text Documents Data Mining is typically concerned with the detection of patterns in numeric data, but very often important (e.g., critical to business) information is stored in the form of text. Unlike numeric data,

More information

Social Visualization

Social Visualization Social Visualization CS 4460/7450 - Information Visualization April 16 19, 2009 John Stasko Casual InfoVis User population Everyday people Usage pattern Momentary, repeatable, contemplative Data type Often

More information

Interactive Visual Text Analytics for Decision Making. Shixia Liu Microsoft Research Asia

Interactive Visual Text Analytics for Decision Making. Shixia Liu Microsoft Research Asia Interactive Visual Text Analytics for Decision Making Shixia Liu Microsoft Research Asia 1 Text is Everywhere We use documents as primary information artifact in our lives Our access to documents has grown

More information

Visualizing Etymology: A Radial Graph Displaying Derivations and Origins

Visualizing Etymology: A Radial Graph Displaying Derivations and Origins Visualizing Etymology: A Radial Graph Displaying Derivations and Origins Chinmayi Dixit Stanford University cdixit@stanford.edu Filippa Karrfelt Stanford University filippak@stanford.edu ABSTRACT Study

More information

Data Visualization. Fall 2016

Data Visualization. Fall 2016 Data Visualization Fall 2016 Information Visualization Upon now, we dealt with scientific visualization (scivis) Scivisincludes visualization of physical simulations, engineering, medical imaging, Earth

More information

At the end of the chapter, you will learn to: Present data in textual form. Construct different types of table and graphs

At the end of the chapter, you will learn to: Present data in textual form. Construct different types of table and graphs DATA PRESENTATION At the end of the chapter, you will learn to: Present data in textual form Construct different types of table and graphs Identify the characteristics of a good table and graph Identify

More information

CS Information Visualization Sep. 2, 2015 John Stasko

CS Information Visualization Sep. 2, 2015 John Stasko Multivariate Visual Representations 2 CS 7450 - Information Visualization Sep. 2, 2015 John Stasko Recap We examined a number of techniques for projecting >2 variables (modest number of dimensions) down

More information

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.1- #

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.1- # Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 2 Summarizing and Graphing Data 2-1 Review and Preview 2-2 Frequency Distributions 2-3 Histograms

More information

CS 4460/ Information Visualization February 23, 2010 John Stasko

CS 4460/ Information Visualization February 23, 2010 John Stasko Text and Document Visualization CS 4460/7450 - Information Visualization February 23, 2010 John Stasko Text is Everywhere We use documents as primary information artifact in our lives Our access to documents

More information

This chapter will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form.

This chapter will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form. CHAPTER 2 Frequency Distributions and Graphs Objectives Organize data using frequency distributions. Represent data in frequency distributions graphically using histograms, frequency polygons, and ogives.

More information

Advanced data visualization (charts, graphs, dashboards, fever charts, heat maps, etc.)

Advanced data visualization (charts, graphs, dashboards, fever charts, heat maps, etc.) Advanced data visualization (charts, graphs, dashboards, fever charts, heat maps, etc.) It is a graphical representation of numerical data. The right data visualization tool can present a complex data

More information

Table of Contents (As covered from textbook)

Table of Contents (As covered from textbook) Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression

More information

TE Teacher s Edition PE Pupil Edition Page 1

TE Teacher s Edition PE Pupil Edition Page 1 Standard 4 WRITING: Writing Process Students discuss, list, and graphically organize writing ideas. They write clear, coherent, and focused essays. Students progress through the stages of the writing process

More information

Blog Pro for Magento 2 User Guide

Blog Pro for Magento 2 User Guide Blog Pro for Magento 2 User Guide Table of Contents 1. Blog Pro Configuration 1.1. Accessing the Extension Main Setting 1.2. Blog Index Page 1.3. Post List 1.4. Post Author 1.5. Post View (Related Posts,

More information

Chapter 2 - Graphical Summaries of Data

Chapter 2 - Graphical Summaries of Data Chapter 2 - Graphical Summaries of Data Data recorded in the sequence in which they are collected and before they are processed or ranked are called raw data. Raw data is often difficult to make sense

More information

Click2Annotate: Automated Insight Externalization with Rich Semantics

Click2Annotate: Automated Insight Externalization with Rich Semantics Click2Annotate: Automated Insight Externalization with Rich Semantics Yang Chen Scott Barlowe Jing Yang Department of Computer Science UNC Charlotte ABSTRACT Insight Externalization (IE) refers to the

More information

Overview of Web Mining Techniques and its Application towards Web

Overview of Web Mining Techniques and its Application towards Web Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous

More information

Keyword Extraction by KNN considering Similarity among Features

Keyword Extraction by KNN considering Similarity among Features 64 Int'l Conf. on Advances in Big Data Analytics ABDA'15 Keyword Extraction by KNN considering Similarity among Features Taeho Jo Department of Computer and Information Engineering, Inha University, Incheon,

More information

SAS Visual Analytics 8.2: Working with Report Content

SAS Visual Analytics 8.2: Working with Report Content SAS Visual Analytics 8.2: Working with Report Content About Objects After selecting your data source and data items, add one or more objects to display the results. SAS Visual Analytics provides objects

More information

Edge Equalized Treemaps

Edge Equalized Treemaps Edge Equalized Treemaps Aimi Kobayashi Department of Computer Science University of Tsukuba Ibaraki, Japan kobayashi@iplab.cs.tsukuba.ac.jp Kazuo Misue Faculty of Engineering, Information and Systems University

More information

INDEX UNIT 4 PPT SLIDES

INDEX UNIT 4 PPT SLIDES INDEX UNIT 4 PPT SLIDES S.NO. TOPIC 1. 2. Screen designing Screen planning and purpose arganizing screen elements 3. 4. screen navigation and flow Visually pleasing composition 5. 6. 7. 8. focus and emphasis

More information

Middle Years Data Analysis Display Methods

Middle Years Data Analysis Display Methods Middle Years Data Analysis Display Methods Double Bar Graph A double bar graph is an extension of a single bar graph. Any bar graph involves categories and counts of the number of people or things (frequency)

More information

Information Retrieval CSCI

Information Retrieval CSCI Information Retrieval CSCI 4141-6403 My name is Anwar Alhenshiri My email is: anwar@cs.dal.ca I prefer: aalhenshiri@gmail.com The course website is: http://web.cs.dal.ca/~anwar/ir/main.html 5/6/2012 1

More information

Text Mining: A Burgeoning technology for knowledge extraction

Text Mining: A Burgeoning technology for knowledge extraction Text Mining: A Burgeoning technology for knowledge extraction 1 Anshika Singh, 2 Dr. Udayan Ghosh 1 HCL Technologies Ltd., Noida, 2 University School of Information &Communication Technology, Dwarka, Delhi.

More information

An Introduction to Data Analysis, Statistics, and Graphing

An Introduction to Data Analysis, Statistics, and Graphing An Introduction to Data Analysis, Statistics, and Graphing What is a Graph? Present processes, relationships, and changes in a visual format that is easily understandable Attempts to engage viewers by

More information

Using Microsoft Word. Working With Objects

Using Microsoft Word. Working With Objects Using Microsoft Word Many Word documents will require elements that were created in programs other than Word, such as the picture to the right. Nontext elements in a document are referred to as Objects

More information

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,

More information

MARKET RESEARCH AND EVALUATION2017. Reporting Guidelines

MARKET RESEARCH AND EVALUATION2017. Reporting Guidelines MARKET RESEARCH AND EVALUATION2017 Reporting Guidelines July 25, 2017 At NEEA, we are dedicated to providing strategic and relevant insight to our primary audiences through market research and evaluation

More information

Interactive Campaign Planning for Marketing Analysts

Interactive Campaign Planning for Marketing Analysts Interactive Campaign Planning for Marketing Analysts Fan Du University of Maryland College Park, MD, USA fan@cs.umd.edu Sana Malik Adobe Research San Jose, CA, USA sana.malik@adobe.com Eunyee Koh Adobe

More information

HYBRID FORCE-DIRECTED AND SPACE-FILLING ALGORITHM FOR EULER DIAGRAM DRAWING. Maki Higashihara Takayuki Itoh Ochanomizu University

HYBRID FORCE-DIRECTED AND SPACE-FILLING ALGORITHM FOR EULER DIAGRAM DRAWING. Maki Higashihara Takayuki Itoh Ochanomizu University HYBRID FORCE-DIRECTED AND SPACE-FILLING ALGORITHM FOR EULER DIAGRAM DRAWING Maki Higashihara Takayuki Itoh Ochanomizu University ABSTRACT Euler diagram drawing is an important problem because we may often

More information

The KNIME Text Processing Plugin

The KNIME Text Processing Plugin The KNIME Text Processing Plugin Kilian Thiel Nycomed Chair for Bioinformatics and Information Mining, University of Konstanz, 78457 Konstanz, Deutschland, Kilian.Thiel@uni-konstanz.de Abstract. This document

More information

Text and Document Visualization

Text and Document Visualization Text and Document Visualization CS 4460/7450 - Information Visualization March 26, 2009 John Stasko Text is Everywhere We use documents as primary information artifact in our lives Our access to documents

More information

UIS USER GUIDE SEPTEMBER 2013 USER GUIDE FOR UIS.STAT (BETA)

UIS USER GUIDE SEPTEMBER 2013 USER GUIDE FOR UIS.STAT (BETA) UIS USER GUIDE SEPTEMBER 2013 USER GUIDE FOR UIS.STAT (BETA) Published in 2013 by: UNESCO Institute for Statistics P.O. Box 6128, Succursale Centre-Ville Montreal, Quebec H3C 3J7 Canada Tel: (1 514) 343-6880

More information

8. MINITAB COMMANDS WEEK-BY-WEEK

8. MINITAB COMMANDS WEEK-BY-WEEK 8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are

More information

Using surface markings to enhance accuracy and stability of object perception in graphic displays

Using surface markings to enhance accuracy and stability of object perception in graphic displays Using surface markings to enhance accuracy and stability of object perception in graphic displays Roger A. Browse a,b, James C. Rodger a, and Robert A. Adderley a a Department of Computing and Information

More information

Microsoft Office PowerPoint 2013 Courses 24 Hours

Microsoft Office PowerPoint 2013 Courses 24 Hours Microsoft Office PowerPoint 2013 Courses 24 Hours COURSE OUTLINES FOUNDATION LEVEL COURSE OUTLINE Using PowerPoint 2013 Opening PowerPoint 2013 Opening a Presentation Navigating between Slides Using the

More information

Adaptive Medical Information Delivery Combining User, Task and Situation Models

Adaptive Medical Information Delivery Combining User, Task and Situation Models Adaptive Medical Information Delivery Combining User, Task and Situation s Luis Francisco-Revilla and Frank M. Shipman III Department of Computer Science Texas A&M University College Station, TX 77843-3112,

More information

Encoding Words into String Vectors for Word Categorization

Encoding Words into String Vectors for Word Categorization Int'l Conf. Artificial Intelligence ICAI'16 271 Encoding Words into String Vectors for Word Categorization Taeho Jo Department of Computer and Information Communication Engineering, Hongik University,

More information

A Mixed Method Framework for Evaluating Video Digital Libraries

A Mixed Method Framework for Evaluating Video Digital Libraries Qualitative and Methods in Libraries (QQML) 1:245 252, 2014 A Mixed Method Framework for Evaluating Video Digital Libraries Dan Albertson School of Library and Information Studies, University of Alabama,

More information

Taming Text. How to Find, Organize, and Manipulate It MANNING GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS. Shelter Island

Taming Text. How to Find, Organize, and Manipulate It MANNING GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS. Shelter Island Taming Text How to Find, Organize, and Manipulate It GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS 11 MANNING Shelter Island contents foreword xiii preface xiv acknowledgments xvii about this book

More information

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable. 5-number summary 68-95-99.7 Rule Area principle Bar chart Bimodal Boxplot Case Categorical data Categorical variable Center Changing center and spread Conditional distribution Context Contingency table

More information

NVIvo 11 workshop. Hui Bian Office for Faculty Excellence

NVIvo 11 workshop. Hui Bian Office for Faculty Excellence 1 NVIvo 11 workshop Hui Bian Office for Faculty Excellence Contact information 2 Email: bianh@ecu.edu Phone: 328-5428 1001 Joyner Library, room 1006 Website: http://core.ecu.edu/ofe/st atisticsresearch/

More information

CADIAL Search Engine at INEX

CADIAL Search Engine at INEX CADIAL Search Engine at INEX Jure Mijić 1, Marie-Francine Moens 2, and Bojana Dalbelo Bašić 1 1 Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000 Zagreb, Croatia {jure.mijic,bojana.dalbelo}@fer.hr

More information

Profiles Research Networking Software User Guide

Profiles Research Networking Software User Guide Profiles Research Networking Software User Guide Accessing Profiles: Profiles is accessed through the Translational Research Institute s (TRI) portal, which can be found at http://tri.uams.edu/. There

More information

Enhancing applications with Cognitive APIs IBM Corporation

Enhancing applications with Cognitive APIs IBM Corporation Enhancing applications with Cognitive APIs After you complete this section, you should understand: The Watson Developer Cloud offerings and APIs The benefits of commonly used Cognitive services 2 Watson

More information

Assignment 1~ ICONS Grading Assessment

Assignment 1~ ICONS Grading Assessment Assignment 1~ ICONS W18 Section: 10013 9:00 am-11:25 am MW 24 23 22 21 20 19 18 16 15 14 13 12 11 09 08 07 06 05 04 03 02 01 0 17 10 Ability to Design Icon Likeness Images in a Theme as a Stylistic Series

More information

Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Processing, and Visualization

Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Processing, and Visualization Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Katsuya Masuda *, Makoto Tanji **, and Hideki Mima *** Abstract This study proposes a framework to access to the

More information

Visualdrives Forensic Tool

Visualdrives Forensic Tool Visualdrives Forensic Tool Mustafa Anil Tuncel 1, Hulya Francis 1, Mark Taylor 1, David Llewellyn Jones 1 1 School of Computing and Mathematical Sciences Liverpool John Moores University Liverpool L3 3AF,

More information

Trees & Graphs. Nathalie Henry Riche, Microsoft Research

Trees & Graphs. Nathalie Henry Riche, Microsoft Research Trees & Graphs Nathalie Henry Riche, Microsoft Research About Nathalie Henry Riche nath@microsoft.com Researcher @ Microsoft Research since 2009 Today: - Overview of techniques to visualize trees & graphs

More information

ATLAS.ti 8 WINDOWS & ATLAS.ti MAC THE NEXT LEVEL

ATLAS.ti 8 WINDOWS & ATLAS.ti MAC THE NEXT LEVEL ATLAS.ti 8 & ATLAS.ti THE NEXT LEVEL POWERFUL DATA ANALYSIS. EASY TO USE LIKE NEVER BEFORE. www.atlasti.com UNIVERSAL EXPORT. LIFE LONG DATA ACCESS. ATLAS.ti 8 AND ATLAS.ti DATA ANALYSIS WITH ATLAS.ti

More information

THE MARKETING WRITER S GUIDE FOR EFFECTIVE SEO IMPLEMENTATION

THE MARKETING WRITER S GUIDE FOR EFFECTIVE SEO IMPLEMENTATION THE MARKETING WRITER S GUIDE FOR EFFECTIVE SEO IMPLEMENTATION Table of Contents What Makes for Strong SEO? 3 Identifying Your Audience & the Tone of Your Article 4 The Role of the Keyword 5 Tips for Writing

More information

Information Extraction Techniques in Terrorism Surveillance

Information Extraction Techniques in Terrorism Surveillance Information Extraction Techniques in Terrorism Surveillance Roman Tekhov Abstract. The article gives a brief overview of what information extraction is and how it might be used for the purposes of counter-terrorism

More information

User Manual. Global Mobile Radar. May 2016

User Manual. Global Mobile Radar. May 2016 User Manual Global Mobile Radar May 2016 Table of Contents INTRODUCTION... 3 LOGIN... 3 RADAR... 8 EXPLORER... 9 WORLD MAP... 10 TAG CLOUD... 11 CREATE... 12 Creating a trend... 12 Creating Inspirations...

More information

Guidelines to Write on history website

Guidelines to Write on history website Guidelines to Write on history website Table of Contents 1. Use Chrome browser to write on blog and install free version of https://www.grammarly.com/... 1 2. Make article outline from 4 sources... 1 2.1

More information

Paper SAS Taming the Rule. Charlotte Crain, Chris Upton, SAS Institute Inc.

Paper SAS Taming the Rule. Charlotte Crain, Chris Upton, SAS Institute Inc. ABSTRACT Paper SAS2620-2016 Taming the Rule Charlotte Crain, Chris Upton, SAS Institute Inc. When business rules are deployed and executed--whether a rule is fired or not if the rule-fire outcomes are

More information

TEXT PREPROCESSING FOR TEXT MINING USING SIDE INFORMATION

TEXT PREPROCESSING FOR TEXT MINING USING SIDE INFORMATION TEXT PREPROCESSING FOR TEXT MINING USING SIDE INFORMATION Ms. Nikita P.Katariya 1, Prof. M. S. Chaudhari 2 1 Dept. of Computer Science & Engg, P.B.C.E., Nagpur, India, nikitakatariya@yahoo.com 2 Dept.

More information

Graph Layout. Last Time: Conveying Structure. Framework for conveying structure. Photographs and illustrations. What is a good view?

Graph Layout. Last Time: Conveying Structure. Framework for conveying structure. Photographs and illustrations. What is a good view? Graph Layout Maneesh Agrawala Last Time: Conveying Structure CS 294-10: Visualization Spring 2011 Photographs and illustrations Framework for conveying structure Goal: Expose important internal features

More information

An Experiment in Visual Clustering Using Star Glyph Displays

An Experiment in Visual Clustering Using Star Glyph Displays An Experiment in Visual Clustering Using Star Glyph Displays by Hanna Kazhamiaka A Research Paper presented to the University of Waterloo in partial fulfillment of the requirements for the degree of Master

More information

arxiv: v1 [cs.hc] 14 Nov 2017

arxiv: v1 [cs.hc] 14 Nov 2017 A visual search engine for Bangladeshi laws arxiv:1711.05233v1 [cs.hc] 14 Nov 2017 Manash Kumar Mandal Department of EEE Khulna University of Engineering & Technology Khulna, Bangladesh manashmndl@gmail.com

More information

Projected Message Design Principles

Projected Message Design Principles Projected Message Design Principles General Message Display Guidelines [G] G1. Screen display should follow the horizontal-vertical and left-right organization that is common to the culture of the intended

More information

VISUAL ANALYSIS OF DOCUMENT TRIAGE DATA

VISUAL ANALYSIS OF DOCUMENT TRIAGE DATA VISUAL ANALYSIS OF DOCUMENT TRIAGE DATA Zhao Geng, Robert S.Laramee Visual Computing Group, Computer Science Department, Swansea University, UK cszg@swansea.ac.uk, r.s.laramee@swansea.ac.uk Fernando Loizides,

More information

D DAVID PUBLISHING. Big Data; Definition and Challenges. 1. Introduction. Shirin Abbasi

D DAVID PUBLISHING. Big Data; Definition and Challenges. 1. Introduction. Shirin Abbasi Journal of Energy and Power Engineering 10 (2016) 405-410 doi: 10.17265/1934-8975/2016.07.004 D DAVID PUBLISHING Shirin Abbasi Computer Department, Islamic Azad University-Tehran Center Branch, Tehran

More information

Bar Charts and Frequency Distributions

Bar Charts and Frequency Distributions Bar Charts and Frequency Distributions Use to display the distribution of categorical (nominal or ordinal) variables. For the continuous (numeric) variables, see the page Histograms, Descriptive Stats

More information

This document was created by an unregistered ChmMagic, please go to to register. it. Thanks

This document was created by an unregistered ChmMagic, please go to  to register. it. Thanks This document was created by an unregistered ChmMagic, please go to http://www.bisenter.com to register. it. Thanks About Coolexon What is Coolexon? Coolexon is a powerful dictionary and translation software.

More information

String Vector based KNN for Text Categorization

String Vector based KNN for Text Categorization 458 String Vector based KNN for Text Categorization Taeho Jo Department of Computer and Information Communication Engineering Hongik University Sejong, South Korea tjo018@hongik.ac.kr Abstract This research

More information

A PRELIMINARY STUDY ON THE EXTRACTION OF SOCIO-TOPICAL WEB KEYWORDS

A PRELIMINARY STUDY ON THE EXTRACTION OF SOCIO-TOPICAL WEB KEYWORDS A PRELIMINARY STUDY ON THE EXTRACTION OF SOCIO-TOPICAL WEB KEYWORDS KULWADEE SOMBOONVIWAT Graduate School of Information Science and Technology, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-0033,

More information

An Approach To Web Content Mining

An Approach To Web Content Mining An Approach To Web Content Mining Nita Patil, Chhaya Das, Shreya Patanakar, Kshitija Pol Department of Computer Engg. Datta Meghe College of Engineering, Airoli, Navi Mumbai Abstract-With the research

More information

QDA Miner. Addendum v2.0

QDA Miner. Addendum v2.0 QDA Miner Addendum v2.0 QDA Miner is an easy-to-use qualitative analysis software for coding, annotating, retrieving and reviewing coded data and documents such as open-ended responses, customer comments,

More information

SK International Journal of Multidisciplinary Research Hub Research Article / Survey Paper / Case Study Published By: SK Publisher

SK International Journal of Multidisciplinary Research Hub Research Article / Survey Paper / Case Study Published By: SK Publisher ISSN: 2394 3122 (Online) Volume 2, Issue 1, January 2015 Research Article / Survey Paper / Case Study Published By: SK Publisher P. Elamathi 1 M.Phil. Full Time Research Scholar Vivekanandha College of

More information

Technical Communication A Practical Approach: Chapter 11: Web Pages and

Technical Communication A Practical Approach: Chapter 11: Web Pages and Publish on the Web Technical Communication A Practical Approach: Chapter 11: Web Pages and Writing for the Web William Sanborn Pfeiffer Kaye Adkins Craig Baehr Revised for ENGR 139 by x.m. spring 2010

More information

Statistical graphics in analysis Multivariable data in PCP & scatter plot matrix. Paula Ahonen-Rainio Maa Visual Analysis in GIS

Statistical graphics in analysis Multivariable data in PCP & scatter plot matrix. Paula Ahonen-Rainio Maa Visual Analysis in GIS Statistical graphics in analysis Multivariable data in PCP & scatter plot matrix Paula Ahonen-Rainio Maa-123.3530 Visual Analysis in GIS 11.11.2015 Topics today YOUR REPORTS OF A-2 Thematic maps with charts

More information

Enhancing Web Page Skimmability

Enhancing Web Page Skimmability Enhancing Web Page Skimmability Chen-Hsiang Yu MIT CSAIL 32 Vassar St Cambridge, MA 02139 chyu@mit.edu Robert C. Miller MIT CSAIL 32 Vassar St Cambridge, MA 02139 rcm@mit.edu Abstract Information overload

More information

Year 9: Long term plan

Year 9: Long term plan Year 9: Long term plan Year 9: Long term plan Unit Hours Powerful procedures 7 Round and round 4 How to become an expert equation solver 6 Why scatter? 6 The construction site 7 Thinking proportionally

More information

Chapters 1.5 and 2.5 Statistics: Collecting and Displaying Data

Chapters 1.5 and 2.5 Statistics: Collecting and Displaying Data Chapters 1.5 and 2.5 Statistics: Collecting and Displaying Data Chapter Objectives: Decide which data is relevant, know methods to collect and organize data ; Design and use a data collection sheet or

More information

Domain-specific Concept-based Information Retrieval System

Domain-specific Concept-based Information Retrieval System Domain-specific Concept-based Information Retrieval System L. Shen 1, Y. K. Lim 1, H. T. Loh 2 1 Design Technology Institute Ltd, National University of Singapore, Singapore 2 Department of Mechanical

More information

June 15, Abstract. 2. Methodology and Considerations. 1. Introduction

June 15, Abstract. 2. Methodology and Considerations. 1. Introduction Organizing Internet Bookmarks using Latent Semantic Analysis and Intelligent Icons Note: This file is a homework produced by two students for UCR CS235, Spring 06. In order to fully appreacate it, it may

More information

Visual Exploration of Relationships between Document Clusters

Visual Exploration of Relationships between Document Clusters Visual Exploration of Relationships between Document Clusters Ilir Jusufi 1,2, Andreas Kerren 1, Jiayi Liu 1, and Björn Zimmer 1 1 Linnaeus University, Department of Computer Science, ISOVIS Group, Växjö,

More information

Introduc)on to Informa)on Visualiza)on

Introduc)on to Informa)on Visualiza)on Introduc)on to Informa)on Visualiza)on Seeing the Science with Visualiza)on Raw Data 01001101011001 11001010010101 00101010100110 11101101011011 00110010111010 Visualiza(on Applica(on Visualiza)on on

More information

Introduction to Text Mining. Hongning Wang

Introduction to Text Mining. Hongning Wang Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:

More information

Chapter 2. Architecture of a Search Engine

Chapter 2. Architecture of a Search Engine Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them

More information

Chapter 6: Information Retrieval and Web Search. An introduction

Chapter 6: Information Retrieval and Web Search. An introduction Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods

More information

Advanced Layouts in a Content-Driven Template-Based Layout System

Advanced Layouts in a Content-Driven Template-Based Layout System Advanced Layouts in a Content-Driven Template-Based Layout System ISTVÁN ALBERT, HASSAN CHARAF, LÁSZLÓ LENGYEL Department of Automation and Applied Informatics Budapest University of Technology and Economics

More information

+1 (646) (US) +44 (20) (UK) Blog. for Magento 2. ecommerce.aheadworks.com/magento-2-extensions

+1 (646) (US) +44 (20) (UK) Blog. for Magento 2. ecommerce.aheadworks.com/magento-2-extensions Blog for Magento 2 Table of contents: Table of contents:... 2 Reference table... 3 Getting started... 4 Sidebar... 5 SEO... 6 Related Products... 6 Wordpress Import... 7 Blog categories... 7 Blog posts...

More information

A SMIL Editor and Rendering Tool for Multimedia Synchronization and Integration

A SMIL Editor and Rendering Tool for Multimedia Synchronization and Integration A SMIL Editor and Rendering Tool for Multimedia Synchronization and Integration Stephen J.H. Yang 1, Norman W.Y. Shao 2, Kevin C.Y. Kuo 3 National Central University 1 National Kaohsiung First University

More information

EBSCOhost Web 6.0. User s Guide EBS 2065

EBSCOhost Web 6.0. User s Guide EBS 2065 EBSCOhost Web 6.0 User s Guide EBS 2065 6/26/2002 2 Table Of Contents Objectives:...4 What is EBSCOhost...5 System Requirements... 5 Choosing Databases to Search...5 Using the Toolbar...6 Using the Utility

More information

Introducing XAIRA. Lou Burnard Tony Dodd. An XML aware tool for corpus indexing and searching. Research Technology Services, OUCS

Introducing XAIRA. Lou Burnard Tony Dodd. An XML aware tool for corpus indexing and searching. Research Technology Services, OUCS Introducing XAIRA An XML aware tool for corpus indexing and searching Lou Burnard Tony Dodd Research Technology Services, OUCS What is XAIRA? XML Aware Indexing and Retrieval Architecture Developed from

More information

BIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26

BIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26 Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26 INTRODUCTION Graphs are one of the most important aspects of data analysis and presentation of your of data. They are visual representations

More information