Table Identification and Information extraction in Spreadsheets

Size: px
Start display at page:

Download "Table Identification and Information extraction in Spreadsheets"

Transcription

1 Table Identification and Information extraction in Spreadsheets Elvis Koci 1,2, Maik Thiele 1, Oscar Romero 2, and Wolfgang Lehner 1 1 Technische Universität Dresden, Germany 2 Universitat Politècnica de Catalunya, Spain

2 Spreadsheets in the Realm of Big Data IMPORTANT SOURCE OF INFORMATION Used in various domains Content generated by experts CHALLENGES No description of the structure High degree of autonomy Implicit Information Information Extraction: Data Structure Implicit Metadata RDF XML JSON DB 2

3 Proposed Solution A PROCESSING PIPELINE KDIR-IC3K 16 CCIS Book Series CAiSE 17 Classify Cells Revise Predictions Identify Tables Cell Features Region Features Heuristics Layouts 3

4 Proposed Solution A PROCESSING PIPELINE Classify Cells Revise Predictions Identify Tables Future Work Evaluation Dataset 4

5 Cell Classification* UNDERSTANDING THE DATA Work at the cell granularity: - Omit empty cells - Omit floating objects Use 5 layout categories (labels): Header Derived Metadata Data Attributes A B C D E 1 2 Number of Items (in Units) Sold per Region 3 Item 4 Location Adapter Total Mouse Monitor 5 VGA HDMI 6 Europe 7 Spain France Asia 10 China Vietnam Item Keyboard is omitted. Check next sheet. * Koci et al.: A machine learning approach for layout inference in spreadsheets. KDIR IC3K 16 5

6 Cell Classification* UNDERSTANDING THE DATA Work at the cell granularity: - Omit empty cells - Omit floating objects Use 5 layout categories (labels): Header Derived Metadata Data Attributes A B C D E 1 2 Number of Items (in Units) Sold per Region 3 Item 4 Location Adapter Total Mouse Monitor 5 VGA HDMI 6 Europe 7 Spain France Asia 10 China Vietnam Item Keyboard is omitted. Check next sheet. * Koci et al.: A machine learning approach for layout inference in spreadsheets. KDIR IC3K 16 6

7 Cell Classification* UNDERSTANDING THE DATA Work at the cell granularity: - Omit empty cells - Omit floating objects Use 5 layout categories (labels): Header Derived Metadata Data Attributes A B C D E 1 2 Number of Items (in Units) Sold per Region 3 Item 4 Location Adapter Total Mouse Monitor 5 VGA HDMI 6 Europe 7 Spain France Asia 10 China Vietnam Item Keyboard is omitted. Check next sheet. * Koci et al.: A machine learning approach for layout inference in spreadsheets. KDIR IC3K 16 7

8 Cell Classification* UNDERSTANDING THE DATA Work at the cell granularity: - Omit empty cells - Omit floating objects Use 5 layout categories (labels): Header Derived Metadata Data Attributes A B C D E 1 2 Number of Items (in Units) Sold per Region 3 Item 4 Location Adapter Total Mouse Monitor 5 VGA HDMI 6 Europe 7 Spain France Asia 10 China Vietnam Item Keyboard is omitted. Check next sheet. * Koci et al.: A machine learning approach for layout inference in spreadsheets. KDIR IC3K 16 8

9 Cell Classification* UNDERSTANDING THE DATA Work at the cell granularity: - Omit empty cells - Omit floating objects Use 5 layout categories (labels): Header Derived Metadata Data Attributes A B C D E 1 2 Number of Items (in Units) Sold per Region 3 Item 4 Location Adapter Total Mouse Monitor 5 VGA HDMI 6 Europe 7 Spain France Asia 10 China Vietnam Item Keyboard is omitted. Check next sheet. * Koci et al.: A machine learning approach for layout inference in spreadsheets. KDIR IC3K 16 9

10 Cell Classification* UNDERSTANDING THE DATA Work at the cell granularity: - Omit empty cells - Omit floating objects Use 5 layout categories (labels): Header Derived Metadata Data Attributes A B C D E 1 2 Number of Items (in Units) Sold per Region 3 Item 4 Location Adapter Total Mouse Monitor 5 VGA HDMI 6 Europe 7 Spain France Asia 10 China Vietnam Item Keyboard is omitted. Check next sheet. * Koci et al.: A machine learning approach for layout inference in spreadsheets. KDIR IC3K 16 10

11 Supervised Classifications of Cells COLLECTING FEATURES Binary and numerical >200 Selected 43 for evaluation Random Forest CLASSIFICATION Evaluated 5 classifiers Random Forest most accurate tree T cell features Type String Is Bold Font No Underline Center Aligned Default Fill Color Features Thick Left Border Not Aggregation Not Referenced Right is Empty Top Same Style Best Results for Data and Header 11

12 A PROCESSING PIPELINE Classify Cells Revise Predictions Identify Tables Future Work Evaluation Dataset 12

13 TIRS Framework* Table identification and Reconstruction HEURISTIC PROCEDURE Identify tables from the results of cell classification A procedure of 8 heuristics tasks Table 1 Table 3 Pre- Process TIRS Worksheet of Classified Cells Table 2 * Koci et al.: Table Identification and Reconstruction in Spreadsheets. CAiSE 17 13

14 Grouping Cells NOTIONS Work with larger structures Decrease complexity LABEL REGIONS (LRS) Adjacent Cells Same Label M H H H H H H H H A A D D D D B A D D D D B A A D D D D B A D D D D B M 14

15 Grouping Cells NOTIONS Work with larger structures Decrease complexity LABEL REGIONS (LRS) Adjacent Cells Same Label M H A D B M D B 15

16 Grouping Cells NOTIONS Work with larger structures Decrease complexity LABEL REGIONS (LRS) Adjacent Cells Same Label M H H H H H H H H A A D D D D B A A! D D D B A D! D D D D B A D D D D B M 16

17 Grouping Cells NOTIONS Work with larger structures Decrease complexity LABEL REGIONS (LRS) Adjacent Cells Same Label Rectilinear polygons M H H H H H H H H A A D D D D B A A! D D D B A D! D D D D B A D D D D B M 17

18 Grouping Cells NOTIONS Work with larger structures Decrease complexity LABEL REGIONS Adjacent Cells Same Label Rectilinear polygons MINIMUM BOUNDING RECTANGLE An approximation for LRs Extensively used for spatial problems (0,0) M H H H H H H H H A A D D D D B A A! D D D B A D! D D D D B A D D D D B M 18

19 Constructing Tables - Preliminaries CONSTRAINS FOR PAIRING HEADER WITH DATA 1) A Header is on the top of Data 2) The distance between them is the smallest 3) Their projections overlap significantly: D1 H overlap(projection D, projection(f)) max(projection_length D, projection_length(f)) > θ - Overlap ratio measures this significance - For the evaluation θ (empirically) set to 0.5 D3 D4 D2 19

20 Constructing Tables - Preliminaries CONSTRAINS FOR PAIRING HEADER WITH DATA 1) A Header is on the top of Data 2) The distance between them is the smallest 3) Their projections overlap significantly: H overlap(projection D, projection(f)) max(projection_length D, projection_length(f)) > θ - Overlap ratio measures this significance - For the evaluation θ (empirically) set to 0.5 D2 20

21 First 5 Heuristic Steps TABLE PATTERNS There are 5 patterns of tables Each corresponds to one heuristic task σ i N σ j M overlap(projection Hi, projection(dj)) max σ i N projection_length(hi), σ j M projection_length(dj) > θ H H 1 H 2 H D D One to One Many to One D H H 1 H 2 H break! D 1 D 2 D 1 D 2 One to Many Many to Many Breaker 21

22 First 5 Heuristic Steps TABLE PATTERNS There are 5 patterns of tables Each corresponds to one heuristic task A D A 1 D A 1 D 1 One to One A 2 Many to One A 2 D 2 Many to Many A 1 D 1 A D A D 2 One to Many Breaker! 22

23 Heuristics Steps 6-8 HANDLE UNPAIRED & REMOVE INCONSISTENCIES Assign regions In-between tables 23

24 Heuristics Steps 6-8 HANDLE UNPAIRED & REMOVE INCONSISTENCIES Assign regions In-between tables Detect overlapping tables 24

25 Heuristics Steps 6-8 HANDLE UNPAIRED & REMOVE INCONSISTENCIES Assign regions In-between tables Detect overlapping tables Give regions to nearest table 25

26 Heuristics Steps 6-8 HANDLE UNPAIRED & REMOVE INCONSISTENCIES Assign regions In-between tables Detect overlapping tables Give regions to nearest table 26

27 Evaluation: Table Orientations One Table Vertical Horizontal Both Precision 76% 62% 23% 45% Recall 83% 58% 24% 32% 27

28 Evaluation: Number of Misclassified Cells None Few(1-3) Bunch(4-7) Many(8- ) Precision 82% 56% 60% 53% Recall 82% 53% 53% 57% 28

29 Overall Results 65% 63% Precision Recall 29

30 A PROCESSING PIPELINE Classify Cells Revise Predictions Identify Tables Future Work Evaluation Dataset 30

31 Revising Classification Prediction* Load Standardize Re-label!! Identify * Koci et al.: Cell Classification for Layout Recognition in Spreadsheets. Chapter at Communications in Computer and Information Science (CCIS) book series. (Currently In Press) 31

32 Revising Classification Prediction M H H H H H H H H A A D D D D B A A! D D D B A D! D D D D B A D D D D B M A A D D D D A A! D D D A D! D D D D A D D D D M 32

33 Revising Classification Prediction IRREGULAR REGIONS HINT MISCLASSIFICATIONS A A D D D D A A! D D D A D! D D D D A D D D D M 33

34 Revising Classification Prediction IRREGULAR REGIONS HINT MISCLASSIFICATIONS Make all strictly rectangular Small regions are suspicious A A D D D D A A! D D D A D! D D D D A D D D D M?? Standardize!?? 34

35 Revising Classification Prediction IDENTIFY TRUE NEGATIVES A supervised machine learning task Use the context from the neighborhood???? Similarity Dissimilarity 35

36 Revising Classification Prediction IDENTIFY TRUE NEGATIVES A supervised machine learning task Use the context from the neighborhood Similarity Dissimilarity Influence? 36

37 Revising Classification Prediction IDENTIFY TRUE NEGATIVES A supervised machine learning task Use the context from the neighborhood Similarity Dissimilarity Influence Absence? 37

38 Revising Classification Prediction IDENTIFY TRUE NEGATIVES A supervised machine learning task Use the context from the neighborhood Similarity Dissimilarity Influence Absence? 38

39 Revising Classification Prediction IDENTIFY TRUE NEGATIVES A supervised machine learning task Use the context from the neighborhood!! 39

40 Revising Classification Prediction PREDICT TRUE LABEL (I.E., RELABELING) A supervised machine learning task Use the context from the neighborhood!! Similarity Dissimilarity Influence Absence 40

41 Revising Classification Prediction PREDICT TRUE LABEL (I.E., RELABELING) A supervised machine learning task Use the context from the neighborhood Similarity Dissimilarity Influence Absence 41

42 Revising Classification Prediction PREDICT TRUE LABEL (I.E., RELABELING) A supervised machine learning task Use the context from the neighborhood Similarity Dissimilarity Influence Absence 42

43 Revising Classification Prediction OVERALL PROCESS Load Standardize Re-label!! Identify 43

44 Revising Classification Prediction EVALUATION RESULTS Successfully identified almost half of misclassified (total 1,237) Relabeling approach needs further improvements Identifying Relabeling 44

45 A PROCESSING PIPELINE Classify Cells Revise Predictions Identify Tables Future Work Evaluation Dataset 45

46 The Evaluation Dataset THREE CORPORA OF SPREADSHEETS 288 Worksheets Tables FUSE ENRON EUSES TOTAL FUSE ENRON EUSES TOTAL Barik, T., Lubick, K., Smith, J., Slankas, J., & Murphy-Hill, E. FUSE: A Reproducible, Extendable, Internet-Scale Corpus of Spreadsheets. MSR 2015 Hermans, F., & Murphy-Hill, E. Enron's Spreadsheets and Related s: A Dataset and Analysis. ICSE 2015 Fisher, M., & Rothermel, G. The EUSES Spreadsheet Corpus: a Shared Resource for Supporting Experimentation With Spreadsheet Dependability Mechanisms. ACM SIGSOFT Software Engineering Notes 2005 (Vol. 30, No. 4, pp. 1-5) 46

47 The Evaluation Dataset ANNOTATION TOOL 47

48 A PROCESSING PIPELINE Classify Cells Revise Predictions Identify Tables Future Work Evaluation Dataset 48

49 Future Work GRAPH REPRESENTATION M H H A D B M D B D H Right-Left 0.2 D Top-Bottom 1.0 M 49

50 Future Work GRAPH REPRESENTATION M H split H A D B M D B D H D M 50

51 Future Work GRAPH REPRESENTATION M H H A D B M D B D H D M 51

52 Future Work Graph-based Table Identification (target ECIR) - Node clustering - Guided minimum-cut - Fitness function 52

53 Future Work Graph-based Table Identification (target ECIR) - Node clustering - Guided minimum-cut - Fitness function Demo: a plugin for Excel to extract information Plugin Results Features Web Service 53

54 Future Work FUTURE WORK Graph-based Table Identification (target ECIR) - Node clustering - Guided minimum-cut - Fitness function Demo: a plugin for Excel to extract information Plugin Web Service 54

55 Future Work FUTURE WORK Graph-based Table Identification (target ECIR) - Node clustering - Guided minimum-cut - Fitness function Demo: a plugin for Excel to extract information Plugin Web Service 55

56 Future Work FUTURE WORK Graph-based Table Identification (target ECIR) - Node clustering - Guided minimum-cut - Fitness function Demo: a plugin for Excel to extract information Cell classification using Recurrent Neural Networks posts/ Understanding-LSTMs/ 56

57 Conclusions & Future Work FUTURE WORK Table 1 Graph-based Table Identification (target ECIR) - Node clustering - Guided minimum-cut - Fitness function Demo: a plugin for Excel to extract information Cell classification using Recurrent Neural Networks Schema Extraction from identified tables Table 3 Table 2 57

58 Conclusions & Future Work FUTURE WORK Table 1 Graph-based Table Identification (target ECIR) - Node clustering - Guided minimum-cut - Fitness function Demo: a plugin for Excel to extract information Cell classification using Recurrent Neural Networks Schema Extraction from identified tables Table 3 Table 2 58

59 Related Work 1. Chen, Z., & Cafarella, M. (2013, August). Automatic Web Spreadsheet Data Extraction. In Proceedings of the 3rd International Workshop on Semantic Search Over the Web (p. 1). ACM. 2. Eberius, J., Werner, C., Thiele, M., Braunschweig, K., Dannecker, L., & Lehner, W. (2013, October). DeExcelerator: A Framework for Extracting Relational Data from Partially Structured Documents. In Proceedings of the 22nd ACM International Conference on Information & Knowledge Management (pp ). ACM. 3. Adelfio, M. D. and Samet, H. (2013). Schema Extraction for Tabular Data on the Web. VLDB 13, 6(6): Cunha, J., Saraiva, J., & Visser, J. (2009, January). From Spreadsheets to Relational Databases and Back. In Proceedings of the 2009 ACM SIGPLAN Workshop on Partial Evaluation and Program Manipulation (pp ). ACM. 5. Abraham, R., & Erwig, M. (2007). UCheck: A Spreadsheet Type Checker for End Users. Journal of Visual Languages & Computing, 18(1), Shigarov, Alexey O. (2015). "Table Understanding Using a Rule Engine. Expert Systems with Applications, 42(2): Swidan, A., & Hermans, F. (2017, June). Semi-automatic Extraction of Cross-Table Data from a Set of Spreadsheets. In International Symposium on End User Development (pp ). Springer, Cham. 59

60 Thank You!

61 Region Spatial Arrangements 61

62 Constructing Tables - Preliminaries CONSTRAINS FOR PAIRING ATTRIBUTES WITH DATA 1) Attributes are on the left of Data 2) The distance between them is the smallest 3) Their projections overlap significantly: overlap(projection D, projection(f)) max(projection_length D, projection_length(f)) > θ A D - Overlap ratio measures this significance - For the evaluation θ (empirically) set to

63 Constructing Tables - Preliminaries CONSTRAINS FOR PAIRING ATTRIBUTES WITH DATA 1) Attributes are on the left of Data 2) The distance between them is the smallest 3) Their projections overlap significantly: H overlap(projection D, projection(f)) max(projection_length D, projection_length(f)) > θ A D - Overlap ratio measures this significance - For the evaluation θ (empirically) set to

SpreadsheetDoc: An Excel Add-in for Documenting Spreadsheets

SpreadsheetDoc: An Excel Add-in for Documenting Spreadsheets SpreadsheetDoc: An Excel Add-in for Documenting Spreadsheets Diogo Canteiro and Jácome Cunha Universidade Nova de Lisboa, Portugal d.canteiro@campus.fct.unl.pt jacome@fct.unl.pt Abstract. Documentation

More information

WEEK NO. 12 MICROSOFT EXCEL 2007

WEEK NO. 12 MICROSOFT EXCEL 2007 WEEK NO. 12 MICROSOFT EXCEL 2007 LESSONS OVERVIEW: GOODBYE CALCULATORS, HELLO SPREADSHEET! 1. The Excel Environment 2. Starting A Workbook 3. Modifying Columns, Rows, & Cells 4. Working with Worksheets

More information

COMPUTERIZED OFFICE SUPPORT PROGRAM

COMPUTERIZED OFFICE SUPPORT PROGRAM NH108 Excel Level 1 16 Total Hours COURSE TITLE: Excel Level 1 COURSE OVERVIEW: This course provides students with the knowledge and skills to create spreadsheets and workbooks that can be used to store,

More information

Structure in On-line Documents

Structure in On-line Documents Structure in On-line Documents Anil K. Jain and Anoop M. Namboodiri Department of Comp. Sci. and Engg. Michigan State University East Lansing, MI 4884 fjain, anoopg@cse.msu.edu Jayashree Subrahmonia IBM

More information

Microsoft Excel Chapter 1. Creating a Worksheet and an Embedded Chart

Microsoft Excel Chapter 1. Creating a Worksheet and an Embedded Chart Microsoft Excel 2010 Chapter 1 Creating a Worksheet and an Embedded Chart Objectives Describe the Excel worksheet Enter text and numbers Use the Sum button to sum a range of cells Copy the contents of

More information

Microsoft How to Series

Microsoft How to Series Microsoft How to Series Getting Started with EXCEL 2007 A B C D E F Tabs Introduction to the Excel 2007 Interface The Excel 2007 Interface is comprised of several elements, with four main parts: Office

More information

EXCEL 2003 DISCLAIMER:

EXCEL 2003 DISCLAIMER: EXCEL 2003 DISCLAIMER: This reference guide is meant for experienced Microsoft Excel users. It provides a list of quick tips and shortcuts for familiar features. This guide does NOT replace training or

More information

Microsoft Office Excel 2007: Basic. Course Overview. Course Length: 1 Day. Course Overview

Microsoft Office Excel 2007: Basic. Course Overview. Course Length: 1 Day. Course Overview Microsoft Office Excel 2007: Basic Course Length: 1 Day Course Overview This course teaches the basic functions and features of Excel 2007. After an introduction to spreadsheet terminology and Excel's

More information

I OFFICE TAB... 1 RIBBONS & GROUPS... 2 OTHER SCREEN PARTS... 4 APPLICATION SPECIFICATIONS... 5 THE BASICS...

I OFFICE TAB... 1 RIBBONS & GROUPS... 2 OTHER SCREEN PARTS... 4 APPLICATION SPECIFICATIONS... 5 THE BASICS... EXCEL 2010 BASICS Microsoft Excel I OFFICE TAB... 1 RIBBONS & GROUPS... 2 OTHER SCREEN PARTS... 4 APPLICATION SPECIFICATIONS... 5 THE BASICS... 6 The Mouse... 6 What Are Worksheets?... 6 What is a Workbook?...

More information

Introduction. Formatting cells. Google Sheets. Formatting Cells

Introduction. Formatting cells. Google Sheets. Formatting Cells Google Sheets Formatting Cells Introduction After you've added a lot of content to a spreadsheet, it can sometimes be difficult to view and read all of your information easily. Formatting allows you to

More information

Microsoft Excel Chapter 2. Formulas, Functions, and Formatting

Microsoft Excel Chapter 2. Formulas, Functions, and Formatting Microsoft Excel 2010 Chapter 2 Formulas, Functions, and Formatting Objectives Enter formulas using the keyboard Enter formulas using Point mode Apply the AVERAGE, MAX, and MIN functions Verify a formula

More information

MICROSOFT OFFICE. Courseware: Exam: Sample Only EXCEL 2016 CORE. Certification Guide

MICROSOFT OFFICE. Courseware: Exam: Sample Only EXCEL 2016 CORE. Certification Guide MICROSOFT OFFICE Courseware: 3263 2 Exam: 77 727 EXCEL 2016 CORE Certification Guide Microsoft Office Specialist 2016 Series Microsoft Excel 2016 Core Certification Guide Lesson 1: Introducing Excel Lesson

More information

Classification and Information Extraction for Complex and Nested Tabular Structures in Images

Classification and Information Extraction for Complex and Nested Tabular Structures in Images Classification and Information Extraction for Complex and Nested Tabular Structures in Images Abstract Understanding of technical documents, like manuals, is one of the most important steps in automatic

More information

Microsoft Excel 2010

Microsoft Excel 2010 Microsoft Excel 2010 omar 2013-2014 First Semester 1. Exploring and Setting Up Your Excel Environment Microsoft Excel 2010 2013-2014 The Ribbon contains multiple tabs, each with several groups of commands.

More information

Microsoft Word 2007 on Windows

Microsoft Word 2007 on Windows 1 Microsoft Word 2007 on Windows Word is a very popular text formatting and editing program. It is the standard for writing papers and other documents. This tutorial and quick start guide will help you

More information

The EUSES Spreadsheet Corpus: A Shared Resource for Supporting Experimentation with Spreadsheet Dependability Mechanisms

The EUSES Spreadsheet Corpus: A Shared Resource for Supporting Experimentation with Spreadsheet Dependability Mechanisms The EUSES Spreadsheet Corpus: A Shared Resource for Supporting Experimentation with Spreadsheet Dependability Mechanisms Marc Fisher II and Gregg Rothermel Department of Computer Science and Engineering

More information

PivotTables & Charts for Health

PivotTables & Charts for Health PivotTables & Charts for Health Data Inputs PivotTables Pivot Charts Global Strategic Information UCSF Global Health Sciences Version Malaria 1.0 1 Table of Contents 1.1. Introduction... 3 1.1.1. Software

More information

Lesson 1: Creating a Worksheet and a Chart Microsoft Excel 2016 IN THIS CHAPTER, YOU WILL LEARN HOW TO

Lesson 1: Creating a Worksheet and a Chart Microsoft Excel 2016 IN THIS CHAPTER, YOU WILL LEARN HOW TO Lesson 1: Creating a Worksheet and a Chart Microsoft Excel 2016 IN THIS CHAPTER, YOU WILL LEARN HOW TO Describe the Excel worksheet Enter text and numbers Use the Sum button to sum a range of cells Enter

More information

IV. Arranging & Viewing the Worksheet

IV. Arranging & Viewing the Worksheet IV. Arranging & Viewing the Worksheet Adding & deleting rows and columns Excel makes it easy to rearrange a worksheet. Along with dragging data ranges to move them, you can add or delete rows and columns.

More information

Basic Excel 2010 Workshop 101

Basic Excel 2010 Workshop 101 Basic Excel 2010 Workshop 101 Class Workbook Instructors: David Newbold Jennifer Tran Katie Spencer UCSD Libraries Educational Services 06/13/11 Why Use Excel? 1. It is the most effective and efficient

More information

Going digital Challenge & solutions in a newspaper archiving project. Andrey Lomov ATAPY Software Russia

Going digital Challenge & solutions in a newspaper archiving project. Andrey Lomov ATAPY Software Russia Going digital Challenge & solutions in a newspaper archiving project Andrey Lomov ATAPY Software Russia Problem Description Poor recognition results caused by low image quality: noise, white holes in characters,

More information

Fall 2016 Exam Review 3 Module Test

Fall 2016 Exam Review 3 Module Test 1. What is the block of text at the bottom of the page called? Header Footer Document Area Ribbon 2. Which word processing tool can help you find synonyms to improve your word choice? Spelling and Grammar

More information

Ms excel. The Microsoft Office Button. The Quick Access Toolbar

Ms excel. The Microsoft Office Button. The Quick Access Toolbar Ms excel MS Excel is electronic spreadsheet software. In This software we can do any type of Calculation & inserting any table, data and making chart and graphs etc. the File of excel is called workbook.

More information

Microsoft Excel 2010 Basics

Microsoft Excel 2010 Basics Microsoft Excel 2010 Basics Starting Word 2010 with XP: Click the Start Button, All Programs, Microsoft Office, Microsoft Excel 2010 Starting Word 2010 with 07: Click the Microsoft Office Button with the

More information

Budget Exercise for Intermediate Excel

Budget Exercise for Intermediate Excel Budget Exercise for Intermediate Excel Follow the directions below to create a 12 month budget exercise. Read through each individual direction before performing it, like you are following recipe instructions.

More information

Excel Main Screen. Fundamental Concepts. General Keyboard Shortcuts Open a workbook Create New Save Preview and Print Close a Workbook

Excel Main Screen. Fundamental Concepts. General Keyboard Shortcuts Open a workbook Create New Save Preview and Print Close a Workbook Excel 2016 Main Screen Fundamental Concepts General Keyboard Shortcuts Open a workbook Create New Save Preview and Print Close a Ctrl + O Ctrl + N Ctrl + S Ctrl + P Ctrl + W Help Run Spell Check Calculate

More information

Introduction to Excel 2007

Introduction to Excel 2007 Introduction to Excel 2007 Excel 2007 is a software program that creates a spreadsheet. It permits the user to enter data and formulas to perform mathematical and Boolean (comparison) calculations on the

More information

Beginning Excel. Revised 4/19/16

Beginning Excel. Revised 4/19/16 Beginning Excel Objectives: The Learner will: Become familiar with terminology used in Microsoft Excel Create a simple workbook Write a simple formula Formatting Cells Adding Columns Borders Table of Contents:

More information

Talking about apples and oranges

Talking about apples and oranges S C I E N C E P A S S I O N T E C H N O L O G Y Talking about apples and oranges Static analysis techniques Part II Birgit Hofer Institute for Software Technology 1 u www.tugraz.at Risk comes from not

More information

Microsoft Excel for Beginners

Microsoft Excel for Beginners Microsoft Excel for Beginners training@health.ufl.edu Basic Computing 4 Microsoft Excel 2.0 hours This is a basic computer workshop. Microsoft Excel is a spreadsheet program. We use it to create reports

More information

Microsoft Excel Level 1

Microsoft Excel Level 1 Microsoft Excel 2010 Level 1 Copyright 2010 KSU Department of Information Technology Services This document may be downloaded, printed, or copied for educational use without further permission of the Information

More information

Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Processing, and Visualization

Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Processing, and Visualization Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Katsuya Masuda *, Makoto Tanji **, and Hideki Mima *** Abstract This study proposes a framework to access to the

More information

Learning Path Queries on Graph Databases

Learning Path Queries on Graph Databases Learning Path Queries on Graph Databases Radu Ciucanu joint work with Angela Bonifati and Aurélien Lemay University of Lille, France INRIA Lille Nord Europe Links Project EDBT 15 March 24, 2015 Radu Ciucanu

More information

CHAPTER 4: MICROSOFT OFFICE: EXCEL 2010

CHAPTER 4: MICROSOFT OFFICE: EXCEL 2010 CHAPTER 4: MICROSOFT OFFICE: EXCEL 2010 Quick Summary A workbook an Excel document that stores data contains one or more pages called a worksheet. A worksheet or spreadsheet is stored in a workbook, and

More information

SUM - This says to add together cells F28 through F35. Notice that it will show your result is

SUM - This says to add together cells F28 through F35. Notice that it will show your result is COUNTA - The COUNTA function will examine a set of cells and tell you how many cells are not empty. In this example, Excel analyzed 19 cells and found that only 18 were not empty. COUNTBLANK - The COUNTBLANK

More information

4) Study the section of a worksheet in the image below. What is the cell address of the cell containing the word "Qtr3"?

4) Study the section of a worksheet in the image below. What is the cell address of the cell containing the word Qtr3? Choose The Correct Answer: 1) Study the highlighted cells in the image below and identify which of the following represents the correct cell address for these cells: a) The cell reference for the selected

More information

Spreadsheets. Dr. Abdallah Mohamed

Spreadsheets. Dr. Abdallah Mohamed Spreadsheets Dr. Abdallah Mohamed Key Points 1) Spreadsheets are programs for storing and manipulating data that is represented as a table of cells. 2) Each cell has a row number and column label which

More information

Keywords Data alignment, Data annotation, Web database, Search Result Record

Keywords Data alignment, Data annotation, Web database, Search Result Record Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Annotating Web

More information

Formatting Spreadsheets in Microsoft Excel

Formatting Spreadsheets in Microsoft Excel Formatting Spreadsheets in Microsoft Excel This document provides information regarding the formatting options available in Microsoft Excel 2010. Overview of Excel Microsoft Excel 2010 is a powerful tool

More information

Automatic Initialization of the TLD Object Tracker: Milestone Update

Automatic Initialization of the TLD Object Tracker: Milestone Update Automatic Initialization of the TLD Object Tracker: Milestone Update Louis Buck May 08, 2012 1 Background TLD is a long-term, real-time tracker designed to be robust to partial and complete occlusions

More information

Exploring Microsoft Office Excel 2007

Exploring Microsoft Office Excel 2007 Exploring Microsoft Office Excel 2007 Chapter 1: Introduction to Excel What Can I Do with a Spreadsheet Objectives Define worksheets and workbooks Use spreadsheets across disciplines Plan for good workbook

More information

Microsoft Excel 2010 Tutorial

Microsoft Excel 2010 Tutorial 1 Microsoft Excel 2010 Tutorial Excel is a spreadsheet program in the Microsoft Office system. You can use Excel to create and format workbooks (a collection of spreadsheets) in order to analyze data and

More information

Microsoft Excel 2007 Beginning The information below is devoted to Microsoft Excel and the basics of the program.

Microsoft Excel 2007 Beginning The information below is devoted to Microsoft Excel and the basics of the program. Microsoft Excel 2007 Beginning The information below is devoted to Microsoft Excel and the basics of the program. Starting Excel Option 1: Click the Start button on the taskbar, then Programs>Microsoft

More information

Open Office Calc (Spreadsheet) Tutorial

Open Office Calc (Spreadsheet) Tutorial Open Office Calc (Spreadsheet) Tutorial Table of Contents Introduction...3 What is a Spreadsheet?...3 Starting OpenOffice Calc...3 OpenOffice Calc (Spreadsheet) Basics...4 Creating a New Document...5 Entering

More information

MERGING BUSINESS VOCABULARIES AND RULES

MERGING BUSINESS VOCABULARIES AND RULES MERGING BUSINESS VOCABULARIES AND RULES Edvinas Sinkevicius Departament of Information Systems Centre of Information System Design Technologies, Kaunas University of Lina Nemuraite Departament of Information

More information

Overview. At Course Completion After completing this course, students will be learn about and be able to:

Overview. At Course Completion After completing this course, students will be learn about and be able to: Overview Organizations the world over rely on information to make sound decisions regarding all manner of affairs. But with the amount of available data growing on a daily basis, the ability to make sense

More information

-Using Excel- *The columns are marked by letters, the rows by numbers. For example, A1 designates row A, column 1.

-Using Excel- *The columns are marked by letters, the rows by numbers. For example, A1 designates row A, column 1. -Using Excel- Note: The version of Excel that you are using might vary slightly from this handout. This is for Office 2004 (Mac). If you are using a different version, while things may look slightly different,

More information

Flowlogic. User Manual Version GraphLogic: Developed by scientists, for scientists. Graphing and Statistical Analysis.

Flowlogic. User Manual Version GraphLogic: Developed by scientists, for scientists. Graphing and Statistical Analysis. Flowlogic Flow Cytometry Analysis Software Developed by scientists, for scientists User Manual Version 7.2.1 GraphLogic: Graphing and Statistical Analysis www.inivai.com TABLE OF CONTENTS GraphLogic Graphing

More information

Microsoft Excel 2010 Handout

Microsoft Excel 2010 Handout Microsoft Excel 2010 Handout Excel is an electronic spreadsheet program you can use to enter and organize data, and perform a wide variety of number crunching tasks. Excel helps you organize and track

More information

Day : Date : Objects : Open MS Excel program * Open Excel application. Select : start. Choose: programs. Choose : Microsoft Office.

Day : Date : Objects : Open MS Excel program * Open Excel application. Select : start. Choose: programs. Choose : Microsoft Office. Day : Date : Objects : Open MS Excel program * Open Excel application. Select : start Choose: programs Choose : Microsoft Office Select: Excel *The interface of Excel program - Menu bar. - Standard bar.

More information

FUNDAMENTAL SPREADSHEET APPLICATIONS (230)

FUNDAMENTAL SPREADSHEET APPLICATIONS (230) FUNDAMENTAL SPREADSHEET APPLICATIONS - REGIONAL 2017 PAGE 1 OF 7 Contestant Number: Time: Rank: FUNDAMENTAL SPREADSHEET APPLICATIONS (230) REGIONAL 2017 Job 1: Create Spreadsheet with Formulas (110 points)

More information

Microsoft Office. Microsoft Office

Microsoft Office. Microsoft Office is an office suite of interrelated desktop applications, servers and services for the Microsoft Windows. It is a horizontal market software that is used in a wide range of industries. was introduced by

More information

Contents. 1. Managing Seed Plan Spreadsheet

Contents. 1. Managing Seed Plan Spreadsheet By Peter K. Mulwa Contents 1. Managing Seed Plan Spreadsheet Seed Enterprise Management Institute (SEMIs) Managing Seed Plan Spreadsheet Using Microsoft Excel 2010 3 Definition of Terms Spreadsheet: A

More information

THE EXCEL ENVIRONMENT... 1 EDITING...

THE EXCEL ENVIRONMENT... 1 EDITING... Excel Essentials TABLE OF CONTENTS THE EXCEL ENVIRONMENT... 1 EDITING... 1 INSERTING A COLUMN... 1 DELETING A COLUMN... 1 INSERTING A ROW... DELETING A ROW... MOUSE POINTER SHAPES... USING AUTO-FILL...

More information

1) Merge the cells that contain the title and center the title

1) Merge the cells that contain the title and center the title Supplies: You will need a storage location to save your spreadsheet for use in Session 2. You will need the 2 handouts pertaining to Session 1 Instructions: Follow the directions below to create a budget

More information

Application of Skills: Microsoft Excel 2013 Tutorial

Application of Skills: Microsoft Excel 2013 Tutorial Application of Skills: Microsoft Excel 2013 Tutorial Throughout this module, you will progress through a series of steps to create a spreadsheet for sales of a club or organization. You will continue to

More information

Personal Computing EN1301

Personal Computing EN1301 Personal Computing EN1301 (12 hrs.) Instructional Objectives Explain word processing Distinguish different word processing applications Discover the different features of a word processing application

More information

Excel. module. Lesson 1 Create a Worksheet Lesson 2 Create and Revise. Lesson 3 Edit and Format

Excel. module. Lesson 1 Create a Worksheet Lesson 2 Create and Revise. Lesson 3 Edit and Format module 2 Excel Lesson 1 Create a Worksheet Lesson 2 Create and Revise Formulas Lesson 3 Edit and Format Worksheets Lesson 4 Print Worksheets Lesson 5 Modify Workbooks Lesson 6 Create and Modify Charts

More information

The Menu and Toolbar in Excel (see below) look much like the Word tools and most of the tools behave as you would expect.

The Menu and Toolbar in Excel (see below) look much like the Word tools and most of the tools behave as you would expect. Launch the Microsoft Excel Program Click on the program icon in Launcher or the Microsoft Office Shortcut Bar. A worksheet is a grid, made up of columns, which are lettered and rows, and are numbered.

More information

A cell is highlighted when a thick black border appears around it. Use TAB to move to the next cell to the LEFT. Use SHIFT-TAB to move to the RIGHT.

A cell is highlighted when a thick black border appears around it. Use TAB to move to the next cell to the LEFT. Use SHIFT-TAB to move to the RIGHT. Instructional Center for Educational Technologies EXCEL 2010 BASICS Things to Know Before You Start The cursor in Excel looks like a plus sign. When you click in a cell, the column and row headings will

More information

Rich Text Editor Quick Reference

Rich Text Editor Quick Reference Rich Text Editor Quick Reference Introduction Using the rich text editor is similar to using a word processing application such as Microsoft Word. After data is typed into the editing area it can be formatted

More information

Excel Select a template category in the Office.com Templates section. 5. Click the Download button.

Excel Select a template category in the Office.com Templates section. 5. Click the Download button. Microsoft QUICK Excel 2010 Source Getting Started The Excel Window u v w z Creating a New Blank Workbook 2. Select New in the left pane. 3. Select the Blank workbook template in the Available Templates

More information

lab MS Excel 2010 active cell

lab MS Excel 2010 active cell MS Excel is an example of a spreadsheet, a branch of software meant for performing different kinds of calculations, numeric data analysis and presentation, statistical operations and forecasts. The main

More information

Excel 2010-Part. Two

Excel 2010-Part. Two Jefferson Parish Library Computer Training Team Excel 2010-Part Two August 2011 Symbols Used in Formulas Add Subtract Divide Multiply + - / * When working with formulas in Excel you will use basic keyboard

More information

Microsoft Excel Chapter 1. Creating a Worksheet and a Chart

Microsoft Excel Chapter 1. Creating a Worksheet and a Chart Microsoft Excel 2013 Chapter 1 Creating a Worksheet and a Chart Objectives Describe the Excel worksheet Enter text and numbers Use the Sum button to sum a range of cells Enter a simple function Copy the

More information

Creating a Basic Chart in Excel 2007

Creating a Basic Chart in Excel 2007 Creating a Basic Chart in Excel 2007 A chart is a pictorial representation of the data you enter in a worksheet. Often, a chart can be a more descriptive way of representing your data. As a result, those

More information

XML Data in (Object-) Relational Databases

XML Data in (Object-) Relational Databases XML Data in (Object-) Relational Databases RNDr. Irena Mlýnková irena.mlynkova@mff.cuni.cz Charles University Faculty of Mathematics and Physics Department of Software Engineering Prague, Czech Republic

More information

Excel 2013 Workshop. Prepared by

Excel 2013 Workshop. Prepared by Excel 2013 Workshop Prepared by Joan Weeks Computer Labs Manager & Madeline Davis Computer Labs Assistant Department of Library and Information Science June 2014 Excel 2013: Fundamentals Course Description

More information

Starting Excel application

Starting Excel application MICROSOFT EXCEL 1 2 Microsoft Excel: is a special office program used to apply mathematical operations according to reading a cell automatically, just click on it. It is called electronic tables Starting

More information

Lesson 19 Organizing and Enhancing Worksheets

Lesson 19 Organizing and Enhancing Worksheets Organizing and Enhancing Worksheets Computer Literacy BASICS: A Comprehensive Guide to IC 3, 5 th Edition 1 Objectives Hide, show, and freeze columns and rows. Create, rename, and delete worksheets. Change

More information

1 THE PNP BASIC COMPUTER ESSENTIALS e-learning (MS Excel 2007)

1 THE PNP BASIC COMPUTER ESSENTIALS e-learning (MS Excel 2007) 1 THE PNP BASIC COMPUTER ESSENTIALS e-learning (MS Excel 2007) 2 THE PNP BASIC COMPUTER ESSENTIALS e-learning (MS Excel 2007) TABLE OF CONTENTS CHAPTER 1: GETTING STARTED... 5 THE EXCEL ENVIRONMENT...

More information

Unit 9: Excel Page( )

Unit 9: Excel Page( ) Unit 9: Excel Page( 496-499) Lab: A. Font B. Fill color C. Font color D. View buttons E. Numeric entry F. Row G. Cell H. Column I. Workbook window J. Active sheet K. Status bar L. Range M. Column labels

More information

Static Spreadsheet Analysis

Static Spreadsheet Analysis Debugging of Spreadsheets (DEOS) Project Funded by the Austrian Science Fund (FWF, contract I2144) and the Deutsche Forschungsgemeinschaft (DFG,contract JA 2095/4-1) 1 2 the number of genomics papers packaged

More information

In this section you will learn some simple data entry, editing, formatting techniques and some simple formulae. Contents

In this section you will learn some simple data entry, editing, formatting techniques and some simple formulae. Contents In this section you will learn some simple data entry, editing, formatting techniques and some simple formulae. Contents Section Topic Sub-topic Pages Section 2 Spreadsheets Layout and Design S2: 2 3 Formulae

More information

Microsoft Excel 2013: Excel Basics June 2014

Microsoft Excel 2013: Excel Basics June 2014 Microsoft Excel 2013: Excel Basics June 2014 Description Excel is a powerful spreadsheet program. Please note that in this class we will use Excel 2010 or 2013. Learn how to create spreadsheets, enter

More information

Table of Contents. Word. Using the mouse wheel 39 Moving the insertion point using the keyboard 40 Resume reading 41

Table of Contents. Word. Using the mouse wheel 39 Moving the insertion point using the keyboard 40 Resume reading 41 Table of Contents iii Table of Contents Word Starting Word What is word processing? 2 Starting Word 2 Exploring the Start screen 4 Creating a blank document 4 Exploring the Word document window 5 Exploring

More information

Excel 2016 Basics for Windows

Excel 2016 Basics for Windows Excel 2016 Basics for Windows Excel 2016 Basics for Windows Training Objective To learn the tools and features to get started using Excel 2016 more efficiently and effectively. What you can expect to learn

More information

Microsoft Office Excel 2010: Basic. Course Overview. Course Length: 1 Day. Course Overview

Microsoft Office Excel 2010: Basic. Course Overview. Course Length: 1 Day. Course Overview Microsoft Office Excel 2010: Basic Course Length: 1 Day Course Overview This course teaches the basic functions and features of Excel 2010. After an introduction to spreadsheet terminology and Excel's

More information

The toolbars at the top are the standard toolbar and the formatting toolbar.

The toolbars at the top are the standard toolbar and the formatting toolbar. Lecture 8 EXCEL Excel is a spreadsheet (all originally developed for bookkeeping and accounting). It is very useful for any mathematical or tabular operations. It allows you to make quick changes in input

More information

IITS Workshop Creating a Gradebook in Microsoft Office Excel 2007

IITS Workshop Creating a Gradebook in Microsoft Office Excel 2007 IITS Workshop Creating a Gradebook in Microsoft Office Excel 2007 Table of Contents Our Gradebook Example:... 3 Set Up a New Gradebook... 3 Adding Formulas (Averages and Final Grades)... 4 Formatting data...

More information

Microsoft Excel 2000 Charts

Microsoft Excel 2000 Charts You see graphs everywhere, in textbooks, in newspapers, magazines, and on television. The ability to create, read, and analyze graphs are essential parts of a student s education. Creating graphs by hand

More information

Introduction to Excel 2013

Introduction to Excel 2013 Introduction to Excel 2013 Instructions The is composed of separate of parts which test your knowledge and understanding of some of the core concepts addressed in each lesson. Complete each part in the

More information

Microsoft Excel 2013: Part 3 More on Formatting Cells And Worksheet Basics. To apply number formatting:

Microsoft Excel 2013: Part 3 More on Formatting Cells And Worksheet Basics. To apply number formatting: Microsoft Excel 2013: Part 3 More on Formatting Cells And Worksheet Basics Formatting text and numbers In Excel, you can apply specific formatting for text and numbers instead of displaying all cell content

More information

Agilent MassHunter Workstation Software Report Designer Add-in

Agilent MassHunter Workstation Software Report Designer Add-in Agilent MassHunter Workstation Software Report Designer Add-in Quick Start Guide What is the Agilent MassHunter Workstation Software Report Designer Add-in? 2 Report Designer UI elements 3 Getting Started

More information

Microsoft Excel 2016 / 2013 Basic & Intermediate

Microsoft Excel 2016 / 2013 Basic & Intermediate Microsoft Excel 2016 / 2013 Basic & Intermediate Duration: 2 Days Introduction Basic Level This course covers the very basics of the Excel spreadsheet. It is suitable for complete beginners without prior

More information

LABORATORY 1 Data Analysis & Graphing in Excel

LABORATORY 1 Data Analysis & Graphing in Excel LABORATORY 1 Data Analysis & Graphing in Excel Goal: In this lab, you will learn how to enter and manipulate data in Excel and you will learn how to make the graphs you will need for your lab write-ups.

More information

BaSICS OF excel By: Steven 10.1

BaSICS OF excel By: Steven 10.1 BaSICS OF excel By: Steven 10.1 Workbook 1 workbook is made out of spreadsheet files. You can add it by going to (File > New Workbook). Cell Each & every rectangular box in a spreadsheet is referred as

More information

ADVANCED SPREADSHEET APPLICATIONS (235)

ADVANCED SPREADSHEET APPLICATIONS (235) Page 1 of 7 Contestant Number: Time: Rank: ADVANCED SPREADSHEET APPLICATIONS (235) REGIONAL 2017 Job: Family Budget TOTAL POINTS (375 points) (375 points) Failure to adhere to any of the following rules

More information

Office Administrator

Office Administrator DURATION: 40 hours Office Administrator COURSE FEES: 3300 (Inclusive All) OBJECTIVE: To give knowledge of basic computer operations. CONTENTS: The course covers the introduction to the evolution of computers,

More information

Excel 2016: Part 1. Updated January 2017 Copy cost: $1.50

Excel 2016: Part 1. Updated January 2017 Copy cost: $1.50 Excel 2016: Part 1 Updated January 2017 Copy cost: $1.50 Getting Started Please note that you are required to have some basic computer skills for this class. Also, any experience with Microsoft Word is

More information

ICT IGCSE Practical Revision Presentation Word Processing

ICT IGCSE Practical Revision Presentation Word Processing Page Layout Header & Footer Font Styles Image wrapping List Styles Indentation & Spacing Find & Replace Create/Format Table Common Mistakes Orphan & Widows Completed Example Mail Merge Page Layout (Size

More information

Technology for Merchandise Planning and Control

Technology for Merchandise Planning and Control Technology for Merchandise Planning and Control Contents: Module Three: Formatting Worksheets Working with Charts UREFERENCE/PAGES Formatting Worksheets... Unit C Formatting Values... Excel 52 Excel 57

More information

GRAPHING IN EXCEL EXCEL LAB #2

GRAPHING IN EXCEL EXCEL LAB #2 GRAPHING IN EXCEL EXCEL LAB #2 ECON/BUSN 180: Quantitative Methods for Economics and Business Department of Economics and Business Lake Forest College Lake Forest, IL 60045 Copyright, 2011 Overview This

More information

Excel Tutorial 1

Excel Tutorial 1 IT٢.we Excel 2003 - Tutorial 1 Spreadsheet Basics Screen Layout Title bar Menu bar Standard Toolbar Other Tools Task Pane Adding and Renaming Worksheets Modifying Worksheets Moving Through Cells Adding

More information

Excel 2016 Basics for Mac

Excel 2016 Basics for Mac Excel 2016 Basics for Mac Excel 2016 Basics for Mac Training Objective To learn the tools and features to get started using Excel 2016 more efficiently and effectively. What you can expect to learn from

More information

SOMSN: An Effective Self Organizing Map for Clustering of Social Networks

SOMSN: An Effective Self Organizing Map for Clustering of Social Networks SOMSN: An Effective Self Organizing Map for Clustering of Social Networks Fatemeh Ghaemmaghami Research Scholar, CSE and IT Dept. Shiraz University, Shiraz, Iran Reza Manouchehri Sarhadi Research Scholar,

More information

Guide to WB Annotations

Guide to WB Annotations Guide to WB Annotations 04 May 2016 Annotations are a powerful new feature added to Workbench v1.2.0 (Released May 2016) for placing text and symbols within wb_view tabs and windows. They enable generation

More information

Word processing software

Word processing software Unit 351 Word processing software UAN: Level: 3 Credit value: 6 GLH: 45 Y/502/4629 Assessment type: Portfolio of Evidence or assignment (7574 ITQ Users) Relationship to NOS: Assessment requirements specified

More information

Excel 2010: Getting Started with Excel

Excel 2010: Getting Started with Excel Excel 2010: Getting Started with Excel Excel 2010 Getting Started with Excel Introduction Page 1 Excel is a spreadsheet program that allows you to store, organize, and analyze information. In this lesson,

More information

MICROSOFT EXCEL BIS 202. Lesson 1. Prepared By: Amna Alshurooqi Hajar Alshurooqi

MICROSOFT EXCEL BIS 202. Lesson 1. Prepared By: Amna Alshurooqi Hajar Alshurooqi MICROSOFT EXCEL Prepared By: Amna Alshurooqi Hajar Alshurooqi Lesson 1 BIS 202 1. INTRODUCTION Microsoft Excel is a spreadsheet application used to perform financial calculations, statistical analysis,

More information