Apache Mahout. Scaling Machine Learning. Presented by: Isabel Drost

Size: px
Start display at page:

Download "Apache Mahout. Scaling Machine Learning. Presented by: Isabel Drost"

Transcription

1 Apache Mahout Scaling Machine Learning Presented by: Isabel Drost

2 Agenda Motivation. Machine learning? Introducing Mahout. How can you help?

3 Some motivation.

4 January 3, 2006 by Matt Callow

5 Follow news stories September 10, 2008 by Alex Barth Search through papers. Automatic topic tracker.

6 March 7, 2008 by extranoise

7 Movie recommendation March 22, 2008 by Crystian Cruz IMDB + movie reviews. Aggregate reviews from IMDB, twitter,...

8 Lots and lots of data. Structured and unstructured.

9 Mission Provide scalable data mining algorithms.

10 Machine Learning?

11

12 Archimedes generates model: Density of Object =. Density of Fluid Weight Weight Apparent immersed weight

13 June 25, 2008 by chase-me

14 March 28, 2007 by dullhunk

15 Machine learning generates model

16 Machine learning pipeline Gather data. (and meta data). Identify characteristics. Chose right algorithm. Keep model in sync when nature changes. Train on the gathered data. Tune parameters of your algorithm.

17 January 8, 2008 by Pink Sherbet Photography

18 Machine learning pipeline Gather data. (and meta data). Identify characteristics. Chose right algorithm. Keep model in sync when nature changes. Train on the gathered data. Tune parameters of your algorithm.

19 E-Bay Auction status? Phishing Spam? Different topic Requested password? password

20 Apache One of your mails:... Hadoop London Lucene London

21 Machine learning pipeline Gather data. (and meta data). Identify characteristics. Chose right algorithm. Keep model in sync when nature changes. Train on the gathered data. Tune parameters of your algorithm.

22

23 Machine learning pipeline Gather data. (and meta data). Identify characteristics. Chose right algorithm. Keep model in sync when nature changes. Train on the gathered data. Tune parameters of your algorithm.

24 Parameter tuning Penalty for mistakes. Kernel type for data transformation. Tune kernel parameters.

25 Machine learning pipeline Gather data. (and meta data). Identify characteristics. Chose right algorithm. Keep model in sync when nature changes. Train on the gathered data. Tune parameters of your algorithm.

26 Training Build model from data.

27 Machine learning pipeline Gather data. (and meta data). Identify characteristics. Chose right algorithm. Keep model in sync when nature changes. Train on the gathered data. Tune parameters of your algorithm.

28 Nature changes? Spammers adapt to spam filters. Users write mails in different styles. Expand to new languages....

29 Machine learning pipeline Gather data. (and meta data). Identify characteristics. Chose right algorithm. Keep model in sync when nature changes. Train on the gathered data. Tune parameters of your algorithm.

30 Introducing Mahout

31 Classification Categorize data. Examples: Identify spam mails. Classify movies as Action, Comedy...

32 Classification Naive bayes. Complementary naive bayes. Winnow/Perceptron Others upcoming.

33 Discovering groups of data Group data by similarity. Examples: News articles by topic. Developers by favorite modules.

34 Discovering groups of data Canopy. PLSI. K-Means. Others upcoming. Dirichlet based.

35 Recommendation mining Recommend items. Examples: Find books a user my like. Identify movies a user likes.

36 Upcoming More algorithms. More examples.

37 What Mahout can do for you Why should I participate?

38 Jumpstart your project with proven code. January 8, 2008 by dreizehn28

39 Discuss with researchers and engineers. November 16, 2005 [phil h]

40 Become a community member.

41 s Thank you to all those making this possible. October 22, 2008 by e_calamar

42 July 9, 2006 by trackrecord We need You: Enthusiasm. Mathematical knowledge. Proficiency in Hadoop. Interest in understanding data.

43 Some advertising Berlin - June* at 5p.m. newthinking store Berlin Tucholskystr. 48 Hadoop** User/Developer Meeting Germany * Exact date is set by speaker that is you! ** Lucene, Tika, Solr, UIMA, Mahout, katta,... people welcome.

Open Source development for students.

Open Source development for students. http://www.flickr.com/photos/inaz/454059437 By Inaz Open Source development for students. Why should I work on free software? Isabel Drost Nighttime: Co-Founder Apache Mahout. Organizer of Berlin Hadoop

More information

Mahout in Action MANNING ROBIN ANIL SEAN OWEN TED DUNNING ELLEN FRIEDMAN. Shelter Island

Mahout in Action MANNING ROBIN ANIL SEAN OWEN TED DUNNING ELLEN FRIEDMAN. Shelter Island Mahout in Action SEAN OWEN ROBIN ANIL TED DUNNING ELLEN FRIEDMAN II MANNING Shelter Island contents preface xvii acknowledgments about this book xx xix about multimedia extras xxiii about the cover illustration

More information

Technology Drives Business. CUSTOM SOLR TOKENIZER FLEXIBLE TOKENIZER WITH JFLEX 2014 BerlinBuzzword

Technology Drives Business. CUSTOM SOLR TOKENIZER FLEXIBLE TOKENIZER WITH JFLEX 2014 BerlinBuzzword Technology Drives Business CUSTOM SOLR TOKENIZER FLEXIBLE TOKENIZER WITH JFLEX 2014 BerlinBuzzword Agenda ME & SHI JFLEX Tokenizer Motivation JFlex?! Solr implementation Demo Q & A ME & SHI Markus Klose

More information

Taming Text. How to Find, Organize, and Manipulate It MANNING GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS. Shelter Island

Taming Text. How to Find, Organize, and Manipulate It MANNING GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS. Shelter Island Taming Text How to Find, Organize, and Manipulate It GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS 11 MANNING Shelter Island contents foreword xiii preface xiv acknowledgments xvii about this book

More information

Un-moderated real-time news trends extraction from World Wide Web using Apache Mahout

Un-moderated real-time news trends extraction from World Wide Web using Apache Mahout Un-moderated real-time news trends extraction from World Wide Web using Apache Mahout A Project Report Presented to Professor Rakesh Ranjan San Jose State University Spring 2011 By Kalaivanan Durairaj

More information

Text Classification Using Mahout

Text Classification Using Mahout International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume. 1, Issue 5, September 2014, PP 1-5 ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online) www.arcjournals.org Text

More information

Coursework Completion

Coursework Completion Half Term 1 5 th September 12 th September 19 th September 26 th September 3 rd October 10 th October 17 th October Coursework Completion This first half term will be dedicated to ensuring that all students

More information

Family Tree Maker Articles

Family Tree Maker Articles Family Tree Maker Articles Year Month Title 1998 March First Column, Tips 1998 April Merging 1998 May Sources 1998 June Scrapbooks 1998 July Shortcuts and the INI File 1998 August Family Tree Maker, Version

More information

The Hadoop Ecosystem. EECS 4415 Big Data Systems. Tilemachos Pechlivanoglou

The Hadoop Ecosystem. EECS 4415 Big Data Systems. Tilemachos Pechlivanoglou The Hadoop Ecosystem EECS 4415 Big Data Systems Tilemachos Pechlivanoglou tipech@eecs.yorku.ca A lot of tools designed to work with Hadoop 2 HDFS, MapReduce Hadoop Distributed File System Core Hadoop component

More information

Introduction to Text Mining. Hongning Wang

Introduction to Text Mining. Hongning Wang Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:

More information

Machine Learning using MapReduce

Machine Learning using MapReduce Machine Learning using MapReduce What is Machine Learning Machine learning is a subfield of artificial intelligence concerned with techniques that allow computers to improve their outputs based on previous

More information

Data Intensive Computing SUBTITLE WITH TWO LINES OF TEXT IF NECESSARY PASIG June, 2009

Data Intensive Computing SUBTITLE WITH TWO LINES OF TEXT IF NECESSARY PASIG June, 2009 Data Intensive Computing SUBTITLE WITH TWO LINES OF TEXT IF NECESSARY PASIG June, 2009 Presenter s Name Simon CW See Title & and Division HPC Cloud Computing Sun Microsystems Technology Center Sun Microsystems,

More information

Study and Analysis of Recommendation Systems for Location Based Social Network (LBSN)

Study and Analysis of Recommendation Systems for Location Based Social Network (LBSN) , pp.421-426 http://dx.doi.org/10.14257/astl.2017.147.60 Study and Analysis of Recommendation Systems for Location Based Social Network (LBSN) N. Ganesh 1, K. SaiShirini 1, Ch. AlekhyaSri 1 and Venkata

More information

I'm Charlie Hull, co-founder and Managing Director of Flax. We've been building open source search applications since 2001

I'm Charlie Hull, co-founder and Managing Director of Flax. We've been building open source search applications since 2001 Open Source Search I'm Charlie Hull, co-founder and Managing Director of Flax We've been building open source search applications since 2001 I'm going to tell you why and how you should use open source

More information

Edizioni C&C Srl. The Italian magazine for Classic cars and motorcycles; stylish but inexpensive, slender but rich in content. PRICE 2.

Edizioni C&C Srl. The Italian magazine for Classic cars and motorcycles; stylish but inexpensive, slender but rich in content. PRICE 2. The Italian magazine for Classic cars and motorcycles; stylish but inexpensive, slender but rich in content. epocauto aims to improve understanding and appreciation of the vast and varied world of classic

More information

Click to edit Master title style Click to edit Master title style

Click to edit Master title style Click to edit Master title style Click to edit Master title style Click to edit Master title style Denver Regional Aerial Photography Project 2018: Update Presented by: Ashley Summers August 2018 Agenda Click Click to to edit edit Master

More information

Collective Intelligence in Action

Collective Intelligence in Action Collective Intelligence in Action SATNAM ALAG II MANNING Greenwich (74 w. long.) contents foreword xv preface xvii acknowledgments xix about this book xxi PART 1 GATHERING DATA FOR INTELLIGENCE 1 "1 Understanding

More information

Distributed Itembased Collaborative Filtering with Apache Mahout. Sebastian Schelter twitter.com/sscdotopen. 7.

Distributed Itembased Collaborative Filtering with Apache Mahout. Sebastian Schelter twitter.com/sscdotopen. 7. Distributed Itembased Collaborative Filtering with Apache Mahout Sebastian Schelter ssc@apache.org twitter.com/sscdotopen 7. October 2010 Overview 1. What is Apache Mahout? 2. Introduction to Collaborative

More information

Question Answering Systems

Question Answering Systems Question Answering Systems An Introduction Potsdam, Germany, 14 July 2011 Saeedeh Momtazi Information Systems Group Outline 2 1 Introduction Outline 2 1 Introduction 2 History Outline 2 1 Introduction

More information

Workshops. 1. SIGMM Workshop on Social Media. 2. ACM Workshop on Multimedia and Security

Workshops. 1. SIGMM Workshop on Social Media. 2. ACM Workshop on Multimedia and Security 1. SIGMM Workshop on Social Media SIGMM Workshop on Social Media is a workshop in conjunction with ACM Multimedia 2009. With the growing of user-centric multimedia applications in the recent years, this

More information

Featured Archive. Saturday, February 28, :50:18 PM RSS. Home Interviews Reports Essays Upcoming Transcripts About Black and White Contact

Featured Archive. Saturday, February 28, :50:18 PM RSS. Home Interviews Reports Essays Upcoming Transcripts About Black and White Contact Saturday, February 28, 2009 03:50:18 PM To search, type and hit ente SEARCH RSS Home Interviews Reports Essays Upcoming Transcripts About Black and White Contact SUBSCRIBE TO OUR MAILING LIST First Name:

More information

IMPROVING Sepsis SURVIVAL. Data Portal User Manual version 2.0

IMPROVING Sepsis SURVIVAL. Data Portal User Manual version 2.0 IMPROVING Sepsis SURVIVAL Data Portal User Manual version 2.0 1 Table of Contents Data Portal User Accounts... 3 Logging into the Data Portal... 4 Outcome Data Entry... 5 Outcome Data Due Dates... 5 View

More information

Detecting Malicious URLs. Justin Ma, Lawrence Saul, Stefan Savage, Geoff Voelker. Presented by Gaspar Modelo-Howard September 29, 2010.

Detecting Malicious URLs. Justin Ma, Lawrence Saul, Stefan Savage, Geoff Voelker. Presented by Gaspar Modelo-Howard September 29, 2010. Detecting Malicious URLs Justin Ma, Lawrence Saul, Stefan Savage, Geoff Voelker Presented by Gaspar Modelo-Howard September 29, 2010 Publications Justin Ma, Lawrence K. Saul, Stefan Savage, and Geoffrey

More information

DCBench: a Data Center Benchmark Suite

DCBench: a Data Center Benchmark Suite DCBench: a Data Center Benchmark Suite Zhen Jia ( 贾禛 ) http://prof.ict.ac.cn/zhenjia/ Institute of Computing Technology, Chinese Academy of Sciences workshop in conjunction with CCF October 31,2013,Guilin

More information

SOLUTION TRACK Finding the Needle in a Big Data Innovator & Problem Solver Cloudera

SOLUTION TRACK Finding the Needle in a Big Data Innovator & Problem Solver Cloudera SOLUTION TRACK Finding the Needle in a Big Data Haystack @EvaAndreasson, Innovator & Problem Solver Cloudera Agenda Problem (Solving) Apache Solr + Apache Hadoop et al Real-world examples Q&A Problem Solving

More information

MAP OF OUR REGION. About

MAP OF OUR REGION. About About ABOUT THE GEORGIA BULLETIN The Georgia Bulletin is the Catholic newspaper for the Archdiocese of Atlanta. We cover the northern half of the state of Georgia with the majority of our circulation being

More information

SOFTWARE QUALITY. MADE IN GERMANY.

SOFTWARE QUALITY. MADE IN GERMANY. WHAT IS BEST PRACTICE FOR ACHIEVING ISO26262 COMPLIANCE? MGIGroup, 17.10.2017 SOFTWARE QUALITY. MADE IN GERMANY. SOLUTIONS FOR INTEGRATED QUALITY ASSURANCE OF EMBEDDED SOFTWARE MOTIVATION Simulink/Stateflow

More information

By submitting your content to Pregame magazine, you agree to the Contributor Terms as outlined at

By submitting your content to Pregame magazine, you agree to the Contributor Terms as outlined at Pregame is a magazine-meets-portfolio; a content community of aspirational individuals who are dedicated to maximizing our potential in life and work. Our content centers on real-world success: how we

More information

Knee Surgery Sports Traumatology Arthroscopy

Knee Surgery Sports Traumatology Arthroscopy Advertising Rates 2016 effective October 1st, 2015 Knee Surgery Sports Traumatology Arthroscopy Official Clinical Journal of the European Society of Sports Traumatology, Knee Surgery and Arthroscopy (ESSKA)

More information

Chee Kiam. to sieve through. and the next one. relevant. The advances in Big. (NLB) of Singapore.

Chee Kiam. to sieve through. and the next one. relevant. The advances in Big. (NLB) of Singapore. Submitted on: May 31, 2013 Connecting library content using data mining and text analytics on structured and unstructured dataa Chee Kiam Lim Technology and Innovation, National Library Board, Singapore.

More information

Example. Section: PS 709 Examples of Calculations of Reduced Hours of Work Last Revised: February 2017 Last Reviewed: February 2017 Next Review:

Example. Section: PS 709 Examples of Calculations of Reduced Hours of Work Last Revised: February 2017 Last Reviewed: February 2017 Next Review: Following are three examples of calculations for MCP employees (undefined hours of work) and three examples for MCP office employees. Examples use the data from the table below. For your calculations use

More information

MAP OF OUR REGION. About

MAP OF OUR REGION. About About ABOUT THE GEORGIA BULLETIN The Georgia Bulletin is the Catholic newspaper for the Archdiocese of Atlanta. We cover the northern half of the state of Georgia with the majority of our circulation being

More information

Getting Started with LearnWorlds at NPCT

Getting Started with LearnWorlds at NPCT Getting Started with LearnWorlds at NPCT Elevating Traditional Approaches to Refugee Wellness Welcome! We're so glad you're joining us. This guide is a brief introduction to the LearnWorlds platform, which

More information

If you are reading this then the Jenkins labels and the ways they are applied to the ASF nodes are a mystery to you!

If you are reading this then the Jenkins labels and the ways they are applied to the ASF nodes are a mystery to you! Jenkins node labels If you are reading this then the Jenkins labels and the ways they are applied to the ASF nodes are a mystery to you! This is an attempt to list all nodes and what labels are applied

More information

An Efficient Informal Data Processing Method by Removing Duplicated Data

An Efficient Informal Data Processing Method by Removing Duplicated Data An Efficient Informal Data Processing Method by Removing Duplicated Data Jaejeong Lee 1, Hyeongrak Park and Byoungchul Ahn * Dept. of Computer Engineering, Yeungnam University, Gyeongsan, Korea. *Corresponding

More information

AARNet Network Operations. Network Operations. Customer Alerts & Maintenance NOC. Questnet 2009 Mike Groeneweg. Copyright AARNet Pty Ltd

AARNet Network Operations. Network Operations. Customer Alerts & Maintenance NOC. Questnet 2009 Mike Groeneweg. Copyright AARNet Pty Ltd AARNet Network Operations Customer Alerts & Maintenance Network Operations Questnet Mike Groeneweg NOC 2 1 Network Operations NOC 24 hour phone service (1300 APL NOC or +61 2 9963 3538) E-mail is monitored

More information

Predict the box office of US movies

Predict the box office of US movies Predict the box office of US movies Group members: Hanqing Ma, Jin Sun, Zeyu Zhang 1. Introduction Our task is to predict the box office of the upcoming movies using the properties of the movies, such

More information

Classification. I don t like spam. Spam, Spam, Spam. Information Retrieval

Classification. I don t like spam. Spam, Spam, Spam. Information Retrieval Information Retrieval INFO 4300 / CS 4300! Classification applications in IR Classification! Classification is the task of automatically applying labels to items! Useful for many search-related tasks I

More information

Technical Specifications

Technical Specifications Technical Specifications Online, E-Newsletter & Webinars CONTACT Alex Shikany Vice President - AIA 900 Victors Way, Suite 140 Ann Arbor, Michigan 48108 Tel: 734.994.6088 Fax: 734.994.3338 E-mail: ashikany@robotics.org

More information

Computer Grade 5. Unit: 1, 2 & 3 Total Periods 38 Lab 10 Months: April and May

Computer Grade 5. Unit: 1, 2 & 3 Total Periods 38 Lab 10 Months: April and May Computer Grade 5 1 st Term Unit: 1, 2 & 3 Total Periods 38 Lab 10 Months: April and May Summer Vacation: June, July and August 1 st & 2 nd week Day 1 Day 2 Day 3 Day 4 Day 5 Day 6 First term (April) Week

More information

Competitive Intelligence and Web Mining:

Competitive Intelligence and Web Mining: Competitive Intelligence and Web Mining: Domain Specific Web Spiders American University in Cairo (AUC) CSCE 590: Seminar1 Report Dr. Ahmed Rafea 2 P age Khalid Magdy Salama 3 P age Table of Contents Introduction

More information

Needs Driven Workflow Design

Needs Driven Workflow Design Needs Driven Workflow Design Validation Interpretation DEPLOY Stakeholders Types and levels of analysis determine data, algorithms & parameters, and deployment Visually encode data Overlay data Data Select

More information

Information Retrieval CS6200. Jesse Anderton College of Computer and Information Science Northeastern University

Information Retrieval CS6200. Jesse Anderton College of Computer and Information Science Northeastern University Information Retrieval CS6200 Jesse Anderton College of Computer and Information Science Northeastern University What is Information Retrieval? You have a collection of documents Books, web pages, journal

More information

The Gartner Security Information and Event Management Magic Quadrant 2010: Dealing with Targeted Attacks

The Gartner Security Information and Event Management Magic Quadrant 2010: Dealing with Targeted Attacks The Gartner Security Information and Event Management Magic Quadrant 2010: Dealing with Targeted Attacks Mark Nicolett Notes accompany this presentation. Please select Notes Page view. These materials

More information

Marketing Opportunities

Marketing Opportunities Email Marketing Opportunities Write the important dates and special events for your organization in the spaces below. You can use these entries to plan out your email marketing for the year. January February

More information

Introduction to Automated Text Analysis. bit.ly/poir599

Introduction to Automated Text Analysis. bit.ly/poir599 Introduction to Automated Text Analysis Pablo Barberá School of International Relations University of Southern California pablobarbera.com Lecture materials: bit.ly/poir599 Today 1. Solutions for last

More information

How enterprises can use cyber threat information effectively? Shimon Modi,

How enterprises can use cyber threat information effectively? Shimon Modi, How enterprises can use cyber threat information effectively? Shimon Modi, Ph.D. smodi@trustar.co @shimonmodi About Me 10+ years of Applied R&D experience in Information Security Currently @ TruSTAR Technology

More information

College Algebra. Cartesian Coordinates and Graphs. Dr. Nguyen August 22, Department of Mathematics UK

College Algebra. Cartesian Coordinates and Graphs. Dr. Nguyen August 22, Department of Mathematics UK College Algebra Cartesian Coordinates and Graphs Dr. Nguyen nicholas.nguyen@uky.edu Department of Mathematics UK August 22, 2018 Agenda Welcome x and y-coordinates in the Cartesian plane Graphs and solutions

More information

Phishing Activity Trends Report August, 2006

Phishing Activity Trends Report August, 2006 Phishing Activity Trends Report, 26 Phishing is a form of online identity theft that employs both social engineering and technical subterfuge to steal consumers' personal identity data and financial account

More information

Perceptron Introduction to Machine Learning. Matt Gormley Lecture 5 Jan. 31, 2018

Perceptron Introduction to Machine Learning. Matt Gormley Lecture 5 Jan. 31, 2018 10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Perceptron Matt Gormley Lecture 5 Jan. 31, 2018 1 Q&A Q: We pick the best hyperparameters

More information

Part I: Data Mining Foundations

Part I: Data Mining Foundations Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?

More information

SAP Jam Communities What's New 1808 THE BEST RUN. PUBLIC Document Version: August

SAP Jam Communities What's New 1808 THE BEST RUN. PUBLIC Document Version: August PUBLIC Document Version: August 2018 2018-10-26 2018 SAP SE or an SAP affiliate company. All rights reserved. THE BEST RUN Content 1 Release Highlights....3 1.1 Anonymous access to public communities....4

More information

Web browsing support for cross-community activities

Web browsing support for cross-community activities Web browsing support for cross-community activities Tomohiro Oda Agenda cross-community activity cross-community activity and DynC difficulties in supporting cross-community activities csuite: web browsing

More information

Supporting FRBRization of Web Product Descriptions

Supporting FRBRization of Web Product Descriptions Supporting FRBRization of Web Product Descriptions Naimdjon Takhirov, Fabien Duchateau, Trond Aalberg Department of Computer and Information Science Norwegian University of Science and Technology Theory

More information

Introduction to Data Mining and Data Analytics

Introduction to Data Mining and Data Analytics 1/28/2016 MIST.7060 Data Analytics 1 Introduction to Data Mining and Data Analytics What Are Data Mining and Data Analytics? Data mining is the process of discovering hidden patterns in data, where Patterns

More information

SUN CITY GRAND COMPUTERS APPLE SIG. GraComputers

SUN CITY GRAND COMPUTERS APPLE SIG. GraComputers SUN CITY GRAND COMPUTERS APPLE SIG GraComputers www.grandcomputers.org APPLE SIG THE APPLE SIG MISSION IS TO SERVE MEMBERS OF GRAND COMPUTERS CLUB INTERESTED IN MAC COMPUTERS AND APPLE DEVICES. THE TARGET

More information

Annotating Spatio-Temporal Information in Documents

Annotating Spatio-Temporal Information in Documents Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de

More information

Module Four: Charts and Media Clips

Module Four: Charts and Media Clips Module Four: Charts and Media Clips Charts, sometimes called graphs, are a way to present detailed data to an audience in an easy to understand visual format. Media clips can turn your presentation into

More information

Information Retrieval

Information Retrieval Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,

More information

Brand USA's Next Generation of Digital and Content Strategy

Brand USA's Next Generation of Digital and Content Strategy Brand USA's Next Generation of Digital and Content Strategy Our Speakers Tracy Lanza Vice President Integrated Marketing Mark Lapidus Director Digital Development Talia Salem Manager Web and Content Karyn

More information

Characterization and Modeling of Deleted Questions on Stack Overflow

Characterization and Modeling of Deleted Questions on Stack Overflow Characterization and Modeling of Deleted Questions on Stack Overflow Denzil Correa, Ashish Sureka http://correa.in/ February 16, 2014 Denzil Correa, Ashish Sureka (http://correa.in/) ACM WWW-2014 February

More information

Jumpstarting the Semantic Web

Jumpstarting the Semantic Web Jumpstarting the Semantic Web Mark Watson. Copyright 2003, 2004 Version 0.3 January 14, 2005 This work is licensed under the Creative Commons Attribution-NoDerivs-NonCommercial License. To view a copy

More information

Speaker Packet Workshops & Breakouts

Speaker Packet Workshops & Breakouts 2018 Speaker Packet Workshops & Breakouts JW Marriott San Antonio Hill Country Dear Conference Speaker: Thank you for agreeing to serve as a speaker for the upcoming Innovations in Testing Conference to

More information

This tutorial is designed for all Java enthusiasts who want to learn document type detection and content extraction using Apache Tika.

This tutorial is designed for all Java enthusiasts who want to learn document type detection and content extraction using Apache Tika. About the Tutorial This tutorial provides a basic understanding of Apache Tika library, the file formats it supports, as well as content and metadata extraction using Apache Tika. Audience This tutorial

More information

Japan s Measures against Spam

Japan s Measures against Spam June 22, 2, 2006 Japan s Measures against Spam Yoshichika Imaizumi Telecommunications Bureau, Ministry of Internal Affairs and Communications (MIC), Japan Characteristics of spam in Japan 1.. Media 2004

More information

CHIROPRACTIC MARKETING CENTER

CHIROPRACTIC MARKETING CENTER Marketing Plan Sample Marketing Calendar Here is a sample yearly marketing plan. You should use something similar, but of course add or remove strategies as appropriate for your practice. Letter and advertisement

More information

Prof. Dr. Christian Bizer

Prof. Dr. Christian Bizer STI Summit July 6 th, 2011, Riga, Latvia Global Data Integration and Global Data Mining Prof. Dr. Christian Bizer Freie Universität ität Berlin Germany Outline 1. Topology of the Web of Data What data

More information

Form Identifying. Figure 1 A typical HTML form

Form Identifying. Figure 1 A typical HTML form Table of Contents Form Identifying... 2 1. Introduction... 2 2. Related work... 2 3. Basic elements in an HTML from... 3 4. Logic structure of an HTML form... 4 5. Implementation of Form Identifying...

More information

Tour-Based Mode Choice Modeling: Using An Ensemble of (Un-) Conditional Data-Mining Classifiers

Tour-Based Mode Choice Modeling: Using An Ensemble of (Un-) Conditional Data-Mining Classifiers Tour-Based Mode Choice Modeling: Using An Ensemble of (Un-) Conditional Data-Mining Classifiers James P. Biagioni Piotr M. Szczurek Peter C. Nelson, Ph.D. Abolfazl Mohammadian, Ph.D. Agenda Background

More information

Big Data Analytics CSCI 4030

Big Data Analytics CSCI 4030 High dim. data Graph data Infinite data Machine learning Apps Locality sensitive hashing PageRank, SimRank Filtering data streams SVM Recommen der systems Clustering Community Detection Queries on streams

More information

ECS289: Scalable Machine Learning

ECS289: Scalable Machine Learning ECS289: Scalable Machine Learning Cho-Jui Hsieh UC Davis Sept 24, 2015 Course Information Website: www.stat.ucdavis.edu/~chohsieh/ecs289g_scalableml.html My office: Mathematical Sciences Building (MSB)

More information

Market Trials Review Group. May 2, 2012

Market Trials Review Group. May 2, 2012 Market Trials Review Group May 2, 2012 Section 1 WELCOME & INTRODUCTIONS 4 Agenda 08:00 08:30 Welcome and Introductions Review of Today s Agenda 08:30 09:00 MTRG Objectives and Process 09:00 10:00 High

More information

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. 6 What is Web Mining? p. 6 Summary of Chapters p. 8 How

More information

Published: December 15, 2016 Revised: December 15, 2016

Published: December 15, 2016 Revised: December 15, 2016 Market Participant Guide: SPP 2017 Congestion Hedging Published: December 15, 2016 Revised: December 15, 2016 Revision History Chart Version Revised By Description of Modifications Revision Date 1.0 Congestion

More information

News Article Categorization Team Members: Himay Jesal Desai, Bharat Thatavarti, Aditi Satish Mhapsekar

News Article Categorization Team Members: Himay Jesal Desai, Bharat Thatavarti, Aditi Satish Mhapsekar CS 410 PROJECT REPORT News Article Categorization Team Members: Himay Jesal Desai, Bharat Thatavarti, Aditi Satish Mhapsekar Overview: Our project, News Explorer, is a system that categorizes news articles

More information

December 2017 Marketing & Communications Report

December 2017 Marketing & Communications Report DoorCounty.com - Web Site Visits (Sessions) 2015 84,622 75,713 94,730 120,683 119,876 185,326 212,189 184,422 149,937 108,034 46,080 44,448 1,426,060 2016 63,405 60,289 80,863 101,543 131,388 173,247 201,583

More information

CONTINUING PROFESSIONAL DEVELOPMENT RULES

CONTINUING PROFESSIONAL DEVELOPMENT RULES Independent Objective Authoritative The home for property professionals in Australia Australian Property Institute Limited CONTINUING PROFESSIONAL DEVELOPMENT RULES Reference Continuing Professional Development

More information

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case

More information

Mission At Home in the Modern World.

Mission At Home in the Modern World. 2018 Media Kit Mission At Home in the Modern World. 2018 Media Kit Dwell is the guide for living with good design. Dwell s engaged community of over six million consumers trust Dwell to provide the tools

More information

ECS289: Scalable Machine Learning

ECS289: Scalable Machine Learning ECS289: Scalable Machine Learning Cho-Jui Hsieh UC Davis Sept 22, 2016 Course Information Website: http://www.stat.ucdavis.edu/~chohsieh/teaching/ ECS289G_Fall2016/main.html My office: Mathematical Sciences

More information

A Quickie Guide to Using Databases. There s lots of cool stuff you can do with databases, but this will get you started!

A Quickie Guide to Using Databases. There s lots of cool stuff you can do with databases, but this will get you started! A Quickie Guide to Using Databases There s lots of cool stuff you can do with databases, but this will get you started! Revised: March 2012 1. Go to the College home page, click the drop down menu Our

More information

Suicide Prevention: Putting Techniques into Practice and Case Conceptualization Half Day Workshops via Adobe Connect

Suicide Prevention: Putting Techniques into Practice and Case Conceptualization Half Day Workshops via Adobe Connect Suicide Prevention: Putting Techniques into Practice and Case Conceptualization Half Day Workshops via Adobe Connect Presented by the Center for Deployment Psychology for military/dod/gs providers only.

More information

Published: December 15, 2017 Revised: December 15, 2017

Published: December 15, 2017 Revised: December 15, 2017 Market Participant Guide: SPP 2018 Congestion Hedging Published: December 15, 2017 Revised: December 15, 2017 Revision History Chart Version Revised By Description of Modifications Revision Date 1.0 Congestion

More information

Jisc Research Data Discovery Service Project Workshop Christopher Brown

Jisc Research Data Discovery Service Project Workshop Christopher Brown 18 Feb 2016 Jisc Research Data Discovery Service Project Workshop Christopher Brown Agenda» 10:30 10:40 Welcome and Introduction - Catherine Grout» 10:40 10:45 Project status and introduction to workshop/exercise

More information

UNIVERSITY REFERENCING IN GOOGLE DOCS WITH PAPERPILE

UNIVERSITY REFERENCING IN GOOGLE DOCS WITH PAPERPILE Oct 15 UNIVERSITY REFERENCING IN GOOGLE DOCS WITH PAPERPILE By Unknown On Wednesday, October 14, 2015 In Google, Google Docs, Useful Apps With No Comments Many universities and colleges require the use

More information

WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG

WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG The #1 Challenge in Successfully Deploying a Data Catalog The data cataloging space is relatively new. As a result, many organizations don

More information

Part 1: How Can I Make Next Year s Event More Successful? November 15, 2010 Presenters: Amy Braiterman, Blackbaud Kim Romaszewski, Blackbaud

Part 1: How Can I Make Next Year s Event More Successful? November 15, 2010 Presenters: Amy Braiterman, Blackbaud Kim Romaszewski, Blackbaud Part 1: How Can I Make Next Year s Event More Successful? November 15, 2010 Presenters: Amy Braiterman, Blackbaud Kim Romaszewski, Blackbaud Events Boot Camp Series Events Boot Camp, Part 1: How Can I

More information

Programming Logic and Design Sixth Edition

Programming Logic and Design Sixth Edition Objectives Programming Logic and Design Sixth Edition Chapter 6 Arrays In this chapter, you will learn about: Arrays and how they occupy computer memory Manipulating an array to replace nested decisions

More information

Frequently Asked Questions (FAQ)

Frequently Asked Questions (FAQ) What if this list did not answer my questions? 2017 SmartHealth Wellness Program Frequently Asked Questions (FAQ) 1. Call toll free at 1-855-750-8866 2. Email support@limeade.com 3. For questions about

More information

2016 SmartHealth Wellness Program Frequently Asked Questions (FAQ)

2016 SmartHealth Wellness Program Frequently Asked Questions (FAQ) What if this list did not answer my questions? 2016 SmartHealth Wellness Program Frequently Asked Questions (FAQ) 1. Call toll free at 1-855-750-8866 2. Email support@limeade.com 3. For questions about

More information

Product Versioning and Back Support Policy

Product Versioning and Back Support Policy Effective March 18, 2016 to Feb 1, 2017 Product Versioning and Back Support Policy Definitions Semantic Versioning Date Based Versioning Standard Support Extended Support End of Life Support Critical Security

More information

CERN openlab Communications

CERN openlab Communications CERN openlab Communications CERN openlab III Board of Sponsors 02 April 2009 Mélissa Le Jeune Major New Actions Implemented Mechanisms to: constantly measure the impact of all communications report sponsor

More information

Constructing Triangles Given Sides

Constructing Triangles Given Sides Consider Every Side Constructing Triangles Given Sides 3 WARM UP Use the coordinate plane to determine each distance. Show your work. A y C B E D 0 5 5 1. What is the distance from point F to point D?

More information

New TriCounty Community Portal

New TriCounty Community Portal Welcome to the new Front Page of your Community Portal. The Community portal has been divided into two separate sections: 1) Local Community News 2) Local Schools within the Community LOCAL COMMUNITY NEWS

More information

Data Analytics with HPC. Data Streaming

Data Analytics with HPC. Data Streaming Data Analytics with HPC Data Streaming Reusing this material This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 4.0 International License. http://creativecommons.org/licenses/by-nc-sa/4.0/deed.en_us

More information

Natural Floodplain Functions Alliance Webinar

Natural Floodplain Functions Alliance Webinar Natural Floodplain Functions Alliance Webinar Informing Flood Mitigation with Ecosystem Service Valuation: An Introduction to the Ecosystem Valuation Toolkit Hosted by the Association of State Wetland

More information

We will also have a "what's in" and "what's out" section highlighting changes to those listed in the Favorites page since the previous newsletter.

We will also have a what's in and what's out section highlighting changes to those listed in the Favorites page since the previous newsletter. UK-OSINT www.uk-osint.net & www.ktrs.info Newsletter January 2015 So Why Are We Doing A Newsletter? I thought we should start putting out a regular newsletter for those who have attended any of our multi-day

More information

High Visibility Enforcement TN Grants Tip Sheets

High Visibility Enforcement TN Grants Tip Sheets High Visibility Enforcement TN Grants Tip Sheets Tennessee Highway Safety Office Updated October 26, 2017 High Visibility Enforcement Grant Tip Sheets 1 Table of Contents Claim without Expenses (Zero Quarter

More information

RESOURCE WORLD S # 1 FOR STAMP COLLECTORS

RESOURCE WORLD S # 1 FOR STAMP COLLECTORS 2018 MEDIA KIT WORLD S # 1 RESOURCE FOR STAMP COLLECTORS Linn s Stamp News is the market leader in news and insights for the stamp collecting hobby. Collectors and investors turn to our magazine regularly

More information

Phishing Activity Trends Report October, 2004

Phishing Activity Trends Report October, 2004 Phishing Activity Trends Report October, 2004 Phishing is a form of online identity theft that uses spoofed emails designed to lure recipients to fraudulent websites which attempt to trick them into divulging

More information