Applications of HPCC Systems at Clemson University. Amy Apon, PhD Linh Ngo, PhD Michael Payne Big Data Systems Laboratory Clemson University

Size: px
Start display at page:

Download "Applications of HPCC Systems at Clemson University. Amy Apon, PhD Linh Ngo, PhD Michael Payne Big Data Systems Laboratory Clemson University"

Transcription

1 Applications of HPCC Systems at Clemson University Amy Apon, PhD Linh Ngo, PhD Michael Payne Big Data Systems Laboratory Clemson University

2 Clemson Strengths and Opportunities People Facilities PhD-level faculty & research staff Talented students Significant industry collaborators Palmetto Top 5 in US Academic Supercomputers ~2000 nodes, 20K cores, 600 GPUs 100Gb Internet connectivity

3 Big Data Systems Lab Overview Big Data Systems Lab Vision Perform World Class Research on the Systems and Enabling Information Technology for Advanced Data Analytics Big Data Systems Lab Research Areas Systems and Architectures Tools and Operations Data Analytics and Applications

4 Effect of High Performance Computing on Academic Research Productivity Motivation: There is a lot of pressure on federal funding We propose efficiency as a measure from which to gain insights on return on investment We show that locallyavailable HPC has a positive effect on the ability of a university to do research

5 Text mining of news reports and social media for business intelligence Motivation: Government and business need information about public sentiment. Research: We develop and apply methods to analyze large amounts of textual data to enable inquiry of social and business problems.

6 Shared Computing Resources among Researchers Shared Execution Environment Temporary Local Storage User Privileges Only

7 Linh Ngo, PhD HPCC Systems in a Shared Research Computing Environment

8 Shared Computing Resources among Researchers Shared Execution Environment Temporary Local Storage User Privileges Only How to provision and configure an HPCC cluster dynamically for research purposes? Step 1: Configure, install, and deploy HPCC as a non-root user Step 2: Dynamically provision HPCC cluster in a shared research environment

9 Installation and Configuration of Dependencies / /usr binutils ICU XALAN APR / /home /lib64 /lib $USER /opt /local_scratch /parallel_scratch

10 Resolving Non-default Installation Path Conflicts / etc init.d Administrative privileges / home Non-administrative privileges $USER opt var HPCCSystems HPCCSystems lib log HPCCSystems HPCCSystems configmgr mydafilesrv mydafilesrv local_scratch hpcc $USER hpcc parallel_scratch $USER HPCCSystems lib log lock pid

11 Non-root Deployment Remove/relax root-level settings: i.e.: is_root Reduce default configuration settings for resource requirements: depended on resource allocation requests

12 Dynamic Provisioning user.palmetto.clemson.edu 2 PBS_NODEFILE 3 environment.xml 4 1 mydafilesrv mydafile myeclc mythor 5 myroxie Deploy to /local_scratch or /parallel_scratch?

13 Michael Payne Using HPCC Systems to Manage Academic Data LexisNexis Summer 2014 Internship

14 Using HPCC Systems to Manage Academic Data Research in Scholarly Data requires academic data from many different sources, which store data under various formats Aggregating these sources into a useful and cohesive structure requires a data-intensive approach to preprocessing, integration and analysis HPCC Systems is a platform to streamline this process

15 Categories of Scholarly Data Research Higher Education Institutions Funding Support High Performance Computing Capability

16 Scholarly Data Description Enrollment (multi-sheet Excel) Financial (multi-sheet Excel) Faculty (multi-sheet Excel) Detailed Award (XML) Federal Funding (tabdelimited) Expenditures (tab-delimited) Institution Patent (Excel) Degrees Conferred (Excel) Funding Support Research Higher Education Institutions Detailed list of articles by discipline with abstracts, references, and disambiguated authors. (XML) High Performance Computing Capability Institutional Information with Carnegie s Research Classifications (multi-sheet Excel) Institutions from States with EPSCoR status (tab-delimited) NIH Award Data (CSV/Excel) Top500 Supercomputer affiliated with academic institutions (XML)

17 Examples of Scholarly Data Links PI/Author name Institution name Name similarity Address Name similarity Match name with WoS s Organization-Enhanced name Institution name/ Acknowledgment attributes (2008 on ward, automatic)

18 Ongoing Work Porting data analytic processes to ECL Applying Machine Learning techniques for article abstract classification

19 Summer 2014 Internship - Logistic Regression for Dense Matrices LexisNexis Internship Machine Learning Manager Timothy Humphrey Mentor Arjuna Chala

20 Logistic Regression Prediction using continuous and discrete values No distributional assumptions on the predictors May not be normally distributed or linearly related Relationship between the discrete variable and the predictor is non-linear

21 Parallel Block Basic Linear Algebra Subprograms (PB-BLAS) Matrices can be partitioned Schemes must be compatible There are multiple choices! X = 4 x 4 4 x 1 4 x 1 X 3 x 1 = 2 x 3 2 x 1 1 x 1 2 x 1

22 Machine Learning in ECL 35 Logistic Runtimes Hard Coded Mapping Full Higgs Dataset 11,000,000 x 28 Time in Minutes Higgs 1,000 Higgs 10,000 Higgs 100,000 PB-BLAS Non PB-BLAS

23 Machine Learning in ECL 25 Logistic Runtimes Auto Mapping Full Elsevier Dataset 100,000 x 3,291 Time in Minutes Elsevier 100 PB-BLAS Non PB-BLAS

24 Machine Learning in ECL 450 Logistic Runtimes Auto Mapping Full Elsevier Dataset 100,000 x 3,291 Time in Minutes Elsevier 1,000 PB-BLAS Non PB-BLAS

25 Project Summary Logistic Regression code and supporting functions have been documented and merged to ECL-ML GitHub repository Auto block vector mapping function for any user that wants to use PB-BLAS Ready to use element wise multiplication in PB-BLAS Updated debugging statements that a clear understanding of errors Test functions for both block vector mapping function Sample code for using logistic regression Currently working on K-means implementation that utilizes PB-BLAS

26 Linh Ngo, PhD Alex Herzog, PhD Michael Payne Amy Apon, PhD {lngo, aherzog, mpayne3, Big Data Systems Laboratory Clemson University

HPCC / Spark Integration. Boca Raton Documentation Team

HPCC / Spark Integration. Boca Raton Documentation Team Boca Raton Documentation Team HPCC / Spark Integration Boca Raton Documentation Team Copyright 2018 HPCC Systems. All rights reserved We welcome your comments and feedback about this document via email

More information

EDA Toolkit for Data Scientists

EDA Toolkit for Data Scientists EDA Toolkit for Data Scientists Srini Sivasubramanian, Senior Architect, Cognizant Joe Chambers, Senior Software Engineer, LexisNexis Presented at Big Data Week, Atlanta May 6, 2014 Data Analytics is 90%

More information

Science 2.0 VU Big Science, e-science and E- Infrastructures + Bibliometric Network Analysis

Science 2.0 VU Big Science, e-science and E- Infrastructures + Bibliometric Network Analysis W I S S E N n T E C H N I K n L E I D E N S C H A F T Science 2.0 VU Big Science, e-science and E- Infrastructures + Bibliometric Network Analysis Elisabeth Lex KTI, TU Graz WS 2015/16 u www.tugraz.at

More information

BEST BIG DATA CERTIFICATIONS

BEST BIG DATA CERTIFICATIONS VALIANCE INSIGHTS BIG DATA BEST BIG DATA CERTIFICATIONS email : info@valiancesolutions.com website : www.valiancesolutions.com VALIANCE SOLUTIONS Analytics: Optimizing Certificate Engineer Engineering

More information

Clemson HPC and Cloud Computing

Clemson HPC and Cloud Computing Clemson HPC and Cloud Computing Jill Gemmill, Ph.D. Executive Director Cyberinfrastructure Technology Integration Computing & Information Technology CLEMSON UNIVERSITY 2 About Clemson University South

More information

CRITERIA FOR ACCREDITING COMPUTING PROGRAMS

CRITERIA FOR ACCREDITING COMPUTING PROGRAMS CRITERIA FOR ACCREDITING COMPUTING PROGRAMS Effective for Reviews During the 2014-2015 Accreditation Cycle Incorporates all changes approved by the ABET Board of Directors as of October 26, 2013 Computing

More information

How to Guide. For Personal Users

How to Guide. For Personal Users How to Guide For Personal Users February 2016 Contents Introduction... 2 Features and functions:... 2 Accessing UICollaboratory... 3 Home Page... 3 Homepage Key Features... 3 Collaboration Map... 4 Search

More information

Data Processing at Scale (CSE 511)

Data Processing at Scale (CSE 511) Data Processing at Scale (CSE 511) Note: Below outline is subject to modifications and updates. About this Course Database systems are used to provide convenient access to disk-resident data through efficient

More information

Semi-Structured Data Management (CSE 511)

Semi-Structured Data Management (CSE 511) Semi-Structured Data Management (CSE 511) Note: Below outline is subject to modifications and updates. About this Course Database systems are used to provide convenient access to disk-resident data through

More information

How to Guide. For Personal Users

How to Guide. For Personal Users How to Guide For Personal Users March 2016 Contents Introduction... 2 Features and Functions:... 2 Accessing UICollaboratory... 3 Home Page... 3 Homepage Key Features... 3 Collaboration Map... 4 Search

More information

Pre-Requisites: CS2510. NU Core Designations: AD

Pre-Requisites: CS2510. NU Core Designations: AD DS4100: Data Collection, Integration and Analysis Teaches how to collect data from multiple sources and integrate them into consistent data sets. Explains how to use semi-automated and automated classification

More information

Data Mining: STATISTICA

Data Mining: STATISTICA Outline Data Mining: STATISTICA Prepare the data Classification and regression (C & R, ANN) Clustering Association rules Graphic user interface Prepare the Data Statistica can read from Excel,.txt and

More information

Introducing Microsoft SQL Server 2016 R Services. Julian Lee Advanced Analytics Lead Global Black Belt Asia Timezone

Introducing Microsoft SQL Server 2016 R Services. Julian Lee Advanced Analytics Lead Global Black Belt Asia Timezone Introducing Microsoft SQL Server 2016 R Services Julian Lee Advanced Analytics Lead Global Black Belt Asia Timezone SQL Server 2016: Everything built-in built-in built-in built-in built-in built-in $2,230

More information

2017 Resource Allocations Competition Results

2017 Resource Allocations Competition Results 2017 Resource Allocations Competition Results Table of Contents Executive Summary...3 Computational Resources...5 CPU Allocations...5 GPU Allocations...6 Cloud Allocations...6 Storage Resources...6 Acceptance

More information

Radiation Center Strategic Plan Mission, Vision, Goals and Strategies for Activities and Operations

Radiation Center Strategic Plan Mission, Vision, Goals and Strategies for Activities and Operations Radiation Center Strategic Plan 2012 Mission, Vision, Goals and Strategies for Activities and Operations July 2012 Radiation Center Strategic Plan 2012 Mission, Vision, Goals and Strategies for Activities

More information

IBM Spectrum Scale IO performance

IBM Spectrum Scale IO performance IBM Spectrum Scale 5.0.0 IO performance Silverton Consulting, Inc. StorInt Briefing 2 Introduction High-performance computing (HPC) and scientific computing are in a constant state of transition. Artificial

More information

Outline. Prepare the data Classification and regression Clustering Association rules Graphic user interface

Outline. Prepare the data Classification and regression Clustering Association rules Graphic user interface Data Mining: i STATISTICA Outline Prepare the data Classification and regression Clustering Association rules Graphic user interface 1 Prepare the Data Statistica can read from Excel,.txt and many other

More information

GRADUATE PROGRAMS IN ENTERPRISE AND CLOUD COMPUTING

GRADUATE PROGRAMS IN ENTERPRISE AND CLOUD COMPUTING GRADUATE PROGRAMS IN ENTERPRISE AND CLOUD COMPUTING MASTER OF SCIENCE DOCTORAL DEGREE GRADUATE CERTIFICATES STEVENS.EDU/GRAD-ECC MASTER OF SCIENCE IN Enterprise and Cloud Computing Enterprise and cloud

More information

The IRIS Data Management Center maintains the world s largest system for collecting, archiving and distributing freely available seismological data.

The IRIS Data Management Center maintains the world s largest system for collecting, archiving and distributing freely available seismological data. The IRIS Data Management Center maintains the world s largest system for collecting, archiving and distributing freely available seismological data. DATA Data are open and freely available via the internet

More information

Funded Project Final Survey Report

Funded Project Final Survey Report Funded Project Final Survey Report Principal Investigator: Prof Andrea Goldsmith Project Title: Wireless Sensor Networks Technology for Smart Buildings 1. Project Description: This project sets forth a

More information

SOFTWARE ENGINEERING. Curriculum in Software Engineering. Program Educational Objectives

SOFTWARE ENGINEERING. Curriculum in Software Engineering. Program Educational Objectives Software Engineering 1 SOFTWARE ENGINEERING For the undergraduate curriculum in Software Engineering (http:// www.se.iastate.edu) leading to the degree Bachelor of Science. This curriculum is accredited

More information

Computer Information Systems

Computer Information Systems Computer Information Systems Network Intranet, Local Area Networks (LANs), Wide Area Networks (WANs), Network Segments, Hardware, Software: Development Development Installation Testing Monitoring Maintenance

More information

Scholarly Big Data: Leverage for Science

Scholarly Big Data: Leverage for Science Scholarly Big Data: Leverage for Science C. Lee Giles The Pennsylvania State University University Park, PA, USA giles@ist.psu.edu http://clgiles.ist.psu.edu Funded in part by NSF, Allen Institute for

More information

Federated XDMoD Requirements

Federated XDMoD Requirements Federated XDMoD Requirements Date Version Person Change 2016-04-08 1.0 draft XMS Team Initial version Summary Definitions Assumptions Data Collection Local XDMoD Installation Module Support Data Federation

More information

JAKUB KOPERWAS, HENRYK RYBINSKI, ŁUKASZ SKONIECZNY Institute of Computer Science, Warsaw University of Technology

JAKUB KOPERWAS, HENRYK RYBINSKI, ŁUKASZ SKONIECZNY Institute of Computer Science, Warsaw University of Technology JAKUB KOPERWAS, HENRYK RYBINSKI, ŁUKASZ SKONIECZNY Institute of Computer Science, Warsaw University of Technology Motivation, goals and key assumptions Main features and functionalities of Omega- Psir

More information

Presented by: Cathy Payne, Applications Analyst Piedmont Technical College

Presented by: Cathy Payne, Applications Analyst Piedmont Technical College Presented by: Cathy Payne, Applications Analyst Piedmont Technical College payne.c@ptc.edu Introduction Using Banner s Financial Aid Self-Service, we can build Snapshots to provide accurate and consistent

More information

Higher Education in Texas: Serving Texas Through Transformational Education, Research, Discovery & Impact

Higher Education in Texas: Serving Texas Through Transformational Education, Research, Discovery & Impact Higher Education in Texas: Serving Texas Through Transformational Education, Research, Discovery & Impact M. Dee Childs, Vice President for Information Technology & Chief Information Officer v Texas A&M

More information

Big Data Specialized Studies

Big Data Specialized Studies Information Technologies Programs Big Data Specialized Studies Accelerate Your Career extension.uci.edu/bigdata Offered in partnership with University of California, Irvine Extension s professional certificate

More information

Using Existing Numerical Libraries on Spark

Using Existing Numerical Libraries on Spark Using Existing Numerical Libraries on Spark Brian Spector Chicago Spark Users Meetup June 24 th, 2015 Experts in numerical algorithms and HPC services How to use existing libraries on Spark Call algorithm

More information

MSc(IT) Program. MSc(IT) Program Educational Objectives (PEO):

MSc(IT) Program. MSc(IT) Program Educational Objectives (PEO): MSc(IT) Program Master of Science (Information Technology) is an intensive program designed for students who wish to pursue a professional career in Information Technology. The courses have been carefully

More information

EGI federated e-infrastructure, a building block for the Open Science Commons

EGI federated e-infrastructure, a building block for the Open Science Commons EGI federated e-infrastructure, a building block for the Open Science Commons Yannick LEGRÉ Director, EGI.eu www.egi.eu EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union

More information

DDN Annual High Performance Computing Trends Survey Reveals Rising Deployment of Flash Tiers & Private/Hybrid Clouds vs.

DDN Annual High Performance Computing Trends Survey Reveals Rising Deployment of Flash Tiers & Private/Hybrid Clouds vs. DDN Annual High Performance Computing Trends Survey Reveals Rising Deployment of Flash Tiers & Private/Hybrid Clouds vs. Public for HPC HPC End Users Cite Mixed I/O as the Most Difficult Performance Challenge

More information

Operations Orchestration 10.x Flow Authoring (OO220)

Operations Orchestration 10.x Flow Authoring (OO220) Operations Orchestration 10.x Flow Authoring (OO220) Education Services course product number H4S75S Course length 4 days Delivery mode Instructor Led Training (ILT) virtual Instructor Led Training (ILT)

More information

Bringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security

Bringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security Bringing OpenStack to the Enterprise An enterprise-class solution ensures you get the required performance, reliability, and security INTRODUCTION Organizations today frequently need to quickly get systems

More information

HPCC Systems ECL and Distributed Machine Learning with the HPCC Systems Platform.

HPCC Systems ECL and Distributed Machine Learning with the HPCC Systems Platform. RED/082311 HPCC Systems ECL and Distributed Machine Learning with the HPCC Systems Platform Big Data and Machine Learning Extracting information from Big Data can be hard! Even understanding the structure

More information

Gauging the User: TESTING THE UX IN AN INSTITUTIONAL REPOSITORY AFTER AN ACADEMIC LIBRARY AND A PUBLISHER COLLABORATE

Gauging the User: TESTING THE UX IN AN INSTITUTIONAL REPOSITORY AFTER AN ACADEMIC LIBRARY AND A PUBLISHER COLLABORATE Gauging the User: TESTING THE UX IN AN INSTITUTIONAL REPOSITORY AFTER AN ACADEMIC LIBRARY AND A PUBLISHER COLLABORATE Laura Spears, PhD Assessment Librarian laura.spears@ufl.edu Chelsea Dinsmore, MLIS

More information

RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE

RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE Luigi Grimaudo (luigi.grimaudo@polito.it) DataBase And Data Mining Research Group (DBDMG) Summary RapidMiner project Strengths

More information

Summary. RapidMiner Project 12/13/2011 RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE

Summary. RapidMiner Project 12/13/2011 RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE Luigi Grimaudo (luigi.grimaudo@polito.it) DataBase And Data Mining Research Group (DBDMG) Summary RapidMiner project Strengths

More information

Integrating Identity Management Aspirations and Issues

Integrating Identity Management Aspirations and Issues Integrating Identity Management Aspirations and Issues James Dalziel Professor of Learning Technology, MAMS CI and Director, Macquarie E-Learning Centre Of Excellence (MELCOE) Macquarie University james@melcoe.mq.edu.au

More information

Stream Processing on IoT Devices using Calvin Framework

Stream Processing on IoT Devices using Calvin Framework Stream Processing on IoT Devices using Calvin Framework by Ameya Nayak A Project Report Submitted in Partial Fulfillment of the Requirements for the Degree of Master of Science in Computer Science Supervised

More information

OPPORTUNITY FOR PLACEMENT AHRC Knowledge Exchange Fellow FuseBox/Wired Sussex

OPPORTUNITY FOR PLACEMENT AHRC Knowledge Exchange Fellow FuseBox/Wired Sussex OPPORTUNITY FOR PLACEMENT AHRC Knowledge Exchange Fellow FuseBox/Wired Sussex The AHRC in partnership with Wired Sussex is looking to recruit a Knowledge Exchange Fellow to be based at the FuseBox, an

More information

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF Conference 2017 The Data Challenges of the LHC Reda Tafirout, TRIUMF Outline LHC Science goals, tools and data Worldwide LHC Computing Grid Collaboration & Scale Key challenges Networking ATLAS experiment

More information

Brown University Libraries Technology Plan

Brown University Libraries Technology Plan Brown University Libraries Technology Plan 2009-2011 Technology Vision Brown University Library creates, develops, promotes, and uses technology to further the Library s mission and strategic directions

More information

SPARC 2 Consultations January-February 2016

SPARC 2 Consultations January-February 2016 SPARC 2 Consultations January-February 2016 1 Outline Introduction to Compute Canada SPARC 2 Consultation Context Capital Deployment Plan Services Plan Access and Allocation Policies (RAC, etc.) Discussion

More information

Predicting Service Outage Using Machine Learning Techniques. HPE Innovation Center

Predicting Service Outage Using Machine Learning Techniques. HPE Innovation Center Predicting Service Outage Using Machine Learning Techniques HPE Innovation Center HPE Innovation Center - Our AI Expertise Sense Learn Comprehend Act Computer Vision Machine Learning Natural Language Processing

More information

Strategic Energy Institute Energy Policy Innovation Center EPICenter

Strategic Energy Institute Energy Policy Innovation Center EPICenter Strategic Energy Institute Energy Policy Innovation Center EPICenter Introduction & Overview Richard A. Simmons, PhD, PE November 28, 2016 Introduce the context for the GT-led energy policy center Key

More information

Demystifying Scopus APIs

Demystifying Scopus APIs 0 Demystifying Scopus APIs Massimiliano Bearzot Customer Consultant South Europe April 17, 2018 1 What You Will Learn Today about Scopus APIs Simplistically, how do Scopus APIs work & why do they matter?

More information

Now, Data Mining Is Within Your Reach

Now, Data Mining Is Within Your Reach Clementine Desktop Specifications Now, Data Mining Is Within Your Reach Data mining delivers significant, measurable value. By uncovering previously unknown patterns and connections in data, data mining

More information

BUCKNELL S SCIENCE DMZ

BUCKNELL S SCIENCE DMZ BUCKNELL S SCIENCE #Bisonet Param Bedi VP for Library and Information Technology Principal Investigator Initial Science Design Process Involving Bucknell faculty researchers Library and Information Technology

More information

New Faculty Orientation Technology Session Research Resources August 2017

New Faculty Orientation Technology Session Research Resources August 2017 New Faculty Orientation Technology Session Research Resources August 2017 MSU Information Technology Support and Design Resources Web Accessibility The MSU community is expected to comply with the university's

More information

Oracle Machine Learning Notebook

Oracle Machine Learning Notebook Oracle Machine Learning Notebook Included in Autonomous Data Warehouse Cloud Charlie Berger, MS Engineering, MBA Sr. Director Product Management, Machine Learning, AI and Cognitive Analytics charlie.berger@oracle.com

More information

INSTITUTE OF INFORMATION TECHNOLOGY UNIVERSITY OF DHAKA

INSTITUTE OF INFORMATION TECHNOLOGY UNIVERSITY OF DHAKA INSTITUTE OF INFORMATION TECHNOLOGY UNIVERSITY OF DHAKA http://www.iit.du.ac.bd/ BACHELOR OF SCIENCE IN SOFTWARE ENGINEERING (BSSE) 1. Institute of Information Technology (IIT) Institute of Information

More information

Data Science Bootcamp Curriculum. NYC Data Science Academy

Data Science Bootcamp Curriculum. NYC Data Science Academy Data Science Bootcamp Curriculum NYC Data Science Academy 100+ hours free, self-paced online course. Access to part-time in-person courses hosted at NYC campus Machine Learning with R and Python Foundations

More information

BOARD OF REGENTS ACADEMIC AFFAIRS COMMITTEE 4 STATE OF IOWA SEPTEMBER 12-13, 2018

BOARD OF REGENTS ACADEMIC AFFAIRS COMMITTEE 4 STATE OF IOWA SEPTEMBER 12-13, 2018 STATE OF IOWA SEPTEMBER 12-13, 2018 REQUEST FOR NEW PROGRAM AT IOWA STATE UNIVERSITY: BACHELOR OF SCIENCE IN CYBER SECURITY ENGINEERING Contact: Rachel Boon Action Requested: Consider approval of the request

More information

Does Research ICT KALRO? Transforming education using ICT

Does Research ICT KALRO? Transforming education using ICT Does Research ICT Matter @ KALRO? What is Our Agenda The Status of Research Productivity and Collaboration of KE Research Institutions Is the research productivity of KARLO visible to the world? Discovery

More information

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research Dr Paul Calleja Director of Research Computing University of Cambridge Global leader in science & technology

More information

Microsoft SharePoint Server 2013 Plan, Configure & Manage

Microsoft SharePoint Server 2013 Plan, Configure & Manage Microsoft SharePoint Server 2013 Plan, Configure & Manage Course 20331-20332B 5 Days Instructor-led, Hands on Course Information This five day instructor-led course omits the overlap and redundancy that

More information

Identity Management: Setting Context

Identity Management: Setting Context Identity Management: Setting Context Joseph Pato Trusted Systems Lab Hewlett-Packard Laboratories One Cambridge Center Cambridge, MA 02412, USA joe.pato@hp.com Identity Management is the set of processes,

More information

OneUConn IT Service Delivery Vision

OneUConn IT Service Delivery Vision OneUConn IT Service Delivery Vision The University s Academic Vision establishes a foundation and high expectations for excellence in research, teaching, learning, and outreach for all of UConn s campuses.

More information

Gain Greater Productivity in Enterprise Data Mining

Gain Greater Productivity in Enterprise Data Mining Clementine 9.0 Specifications Gain Greater Productivity in Enterprise Data Mining Discover patterns and associations in your organization s data and make decisions that lead to significant, measurable

More information

UCLA RESEARCH INFORMATICS STRATEGIC PLAN Taking Action June, 2013

UCLA RESEARCH INFORMATICS STRATEGIC PLAN Taking Action June, 2013 UCLA RESEARCH INFORMATICS STRATEGIC PLAN Taking Action June, 2013 1 Project Motivation Addressing Research Informatics is among the greatest strategic requirements for UCLA s future research competitiveness

More information

Scientific databases

Scientific databases SCID 305 : Generic Skills in Science Research Scientific databases Suang Udomvaraphunt Academic IT Stang Monkolsuk library and Information Division Faculty of Science Stang Mongkolsuk Library http://stang.sc.mahidol.ac.th

More information

Information Systems and Tech (IST)

Information Systems and Tech (IST) Information Systems and Tech (IST) 1 Information Systems and Tech (IST) Courses IST 101. Introduction to Information Technology. 4 Introduction to information technology concepts and skills. Survey of

More information

Student Handbook Master of Information Systems Management (MISM)

Student Handbook Master of Information Systems Management (MISM) Student Handbook 2018-2019 Master of Information Systems Management (MISM) Table of Contents Contents 1 Masters of Information Systems Management (MISM) Curriculum... 3 1.1 Required Courses... 3 1.2 Analytic

More information

Event: PASS SQL Saturday - DC 2018 Presenter: Jon Tupitza, CTO Architect

Event: PASS SQL Saturday - DC 2018 Presenter: Jon Tupitza, CTO Architect Event: PASS SQL Saturday - DC 2018 Presenter: Jon Tupitza, CTO Architect BEOP.CTO.TP4 Owner: OCTO Revision: 0001 Approved by: JAT Effective: 08/30/2018 Buchanan & Edwards Proprietary: Printed copies of

More information

Program Proposal for a Direct Converted Program. BS in COMPUTER SCIENCE

Program Proposal for a Direct Converted Program. BS in COMPUTER SCIENCE Program Proposal for a Direct Converted Program BS in COMPUTER SCIENCE Document Page number Curriculum Sheet p. 2 p. -year Roadmap p. p. 5 Two Year Course Schedule p. 6 (2018 2019 AY and 2019 2020 AY)

More information

The Shogun Machine Learning Toolbox

The Shogun Machine Learning Toolbox The Shogun Machine Learning Toolbox heiko.strathmann@gmail.com Europython 2014 July 24, 2014 Outline Overview Machine Learning Features Technical Features Community A bit about me Heiko Strathmann - http://herrstrathmann.de/

More information

INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING

INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING CS 7265 BIG DATA ANALYTICS INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING * Some contents are adapted from Dr. Hung Huang and Dr. Chengkai Li at UT Arlington Mingon Kang, PhD Computer Science,

More information

High Performance Computing Data Management. Philippe Trautmann BDM High Performance Computing Global Research

High Performance Computing Data Management. Philippe Trautmann BDM High Performance Computing Global Research High Performance Computing Management Philippe Trautmann BDM High Performance Computing Global Education @ Research HPC Market and Trends High Performance Computing: Availability/Sharing is key European

More information

QUALCOMM: Company Overview and Opportunities for Students and Collaboration. Junyi Li Vice President of Technology QUALCOMM

QUALCOMM: Company Overview and Opportunities for Students and Collaboration. Junyi Li Vice President of Technology QUALCOMM QUALCOMM: Company Overview and Opportunities for Students and Collaboration Junyi Li Vice President of Technology QUALCOMM April 14, 2008 Outline 1. General background on Qualcomm 2. Research and Development

More information

AT&T Labs Research Bell Labs/Lucent Technologies Princeton University Rensselaer Polytechnic Institute Rutgers, the State University of New Jersey

AT&T Labs Research Bell Labs/Lucent Technologies Princeton University Rensselaer Polytechnic Institute Rutgers, the State University of New Jersey AT&T Labs Research Bell Labs/Lucent Technologies Princeton University Rensselaer Polytechnic Institute Rutgers, the State University of New Jersey Texas Southern University Texas State University, San

More information

Commonwealth Cyber Initiative Letters of Support

Commonwealth Cyber Initiative Letters of Support James Madison University Norfolk State University University of Mary Washington Virginia Commonwealth University Virginia Community College System Virginia State University Commonwealth Cyber Initiative

More information

SciVerse Scopus. 1. Scopus introduction and content coverage. 2. Scopus in comparison with Web of Science. 3. Basic functionalities of Scopus

SciVerse Scopus. 1. Scopus introduction and content coverage. 2. Scopus in comparison with Web of Science. 3. Basic functionalities of Scopus Prepared by: Jawad Sayadi Account Manager, United Kingdom Elsevier BV Radarweg 29 1043 NX Amsterdam The Netherlands J.Sayadi@elsevier.com SciVerse Scopus SciVerse Scopus 1. Scopus introduction and content

More information

GUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV

GUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV GUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV Subject Name: Elective I Data Warehousing & Data Mining (DWDM) Subject Code: 2640005 Learning Objectives: To understand

More information

Data science How to prepare engineers for this field

Data science How to prepare engineers for this field 16th Workshop Software Engineering Education and Reverse Engineering, Jahorina 2016 Data science How to prepare engineers for this field Ivica Marković Department of Computer Science Faculty of Electronic

More information

AN OVERVIEW OF COMPUTING RESOURCES WITHIN MATHS AND UON

AN OVERVIEW OF COMPUTING RESOURCES WITHIN MATHS AND UON AN OVERVIEW OF COMPUTING RESOURCES WITHIN MATHS AND UON 1 PURPOSE OF THIS TALK Give an overview of the provision of computing facilities within Maths and UoN (Theo). When does one realise that should take

More information

The library s role in promoting the sharing of scientific research data

The library s role in promoting the sharing of scientific research data The library s role in promoting the sharing of scientific research data Katherine Akers Biomedical Research/Research Data Specialist Shiffman Medical Library Wayne State University Funding agency requirements

More information

Python With Data Science

Python With Data Science Course Overview This course covers theoretical and technical aspects of using Python in Applied Data Science projects and Data Logistics use cases. Who Should Attend Data Scientists, Software Developers,

More information

CYBERSECURITY: Scholarship and Job Opportunities

CYBERSECURITY: Scholarship and Job Opportunities CYBERSECURITY: Scholarship and Job Opportunities Malware Invasion in Cyberspace Blackhole Malware Exploit Kit (2012) Shamoon Virus (2012) Stuxnet Worm (2010) Operation Aurora (2009) and many others Job

More information

Preamble. A Strategic Plan for the Internet2 Community Spring 2008

Preamble. A Strategic Plan for the Internet2 Community Spring 2008 Draft Community Version -- March 28 2008 Preamble 2008-2013 Spring 2008 In the 1980s the U.S. research and education community, with the support of the U.S. government, came together to create the NSFNET.

More information

Big Data Analytics: What is Big Data? Stony Brook University CSE545, Fall 2016 the inaugural edition

Big Data Analytics: What is Big Data? Stony Brook University CSE545, Fall 2016 the inaugural edition Big Data Analytics: What is Big Data? Stony Brook University CSE545, Fall 2016 the inaugural edition What s the BIG deal?! 2011 2011 2008 2010 2012 What s the BIG deal?! (Gartner Hype Cycle) What s the

More information

Computing Accreditation Commission Version 2.0 CRITERIA FOR ACCREDITING COMPUTING PROGRAMS

Computing Accreditation Commission Version 2.0 CRITERIA FOR ACCREDITING COMPUTING PROGRAMS Computing Accreditation Commission Version 2.0 CRITERIA FOR ACCREDITING COMPUTING PROGRAMS Optional for Reviews During the 2018-2019 Accreditation Cycle Mandatory for Reviews During the 2019-2020 Accreditation

More information

High Performance Computing Resources at MSU

High Performance Computing Resources at MSU MICHIGAN STATE UNIVERSITY High Performance Computing Resources at MSU Last Update: August 15, 2017 Institute for Cyber-Enabled Research Misson icer is MSU s central research computing facility. The unit

More information

National Research Data Cloud

National Research Data Cloud National Research Data Cloud Progress, Feedback CAUL Webinar 16 Mar 2018 2016 Roadmap National research infrastructure comprises the nationally significant assets, facilities and services to support leading-edge

More information

Cloud Computing For Researchers

Cloud Computing For Researchers Cloud Computing For Researchers August, 2016 Compute Canada is often asked about the potential of outsourcing to commercial clouds. This has been investigated as an alternative or supplement to purchasing

More information

Educational Data Mining: Performance Evaluation of Decision Tree and Clustering Techniques using WEKA Platform

Educational Data Mining: Performance Evaluation of Decision Tree and Clustering Techniques using WEKA Platform Educational Data Mining: Performance Evaluation of Decision Tree and Clustering Techniques using WEKA Platform Ritika Saxena ritikasin25@gmail.com Abstract Data Mining plays a vital role in information

More information

Cyber Security Program

Cyber Security Program Cyber Security Program Cyber Security Program Goals and Objectives Goals Provide comprehensive Security Education and Awareness to the University community Build trust with the University community by

More information

Parallel Methods for Convex Optimization. A. Devarakonda, J. Demmel, K. Fountoulakis, M. Mahoney

Parallel Methods for Convex Optimization. A. Devarakonda, J. Demmel, K. Fountoulakis, M. Mahoney Parallel Methods for Convex Optimization A. Devarakonda, J. Demmel, K. Fountoulakis, M. Mahoney Problems minimize g(x)+f(x; A, b) Sparse regression g(x) =kxk 1 f(x) =kax bk 2 2 mx Sparse SVM g(x) =kxk

More information

Government IT Modernization and the Adoption of Hybrid Cloud

Government IT Modernization and the Adoption of Hybrid Cloud Government IT Modernization and the Adoption of Hybrid Cloud An IDC InfoBrief, Sponsored by VMware June 2018 Federal and National Governments Are at an Inflection Point Federal and national governments

More information

Data Analytics Training Program

Data Analytics Training Program Data Analytics Training Program In exclusive association with 1200+ Trainings 20,000+ Participants 10,000+ Brands 45+ Countries [Since 2009] Training partner for Who Is This Course For? Programers Willing

More information

Internet of Things specialization at Institut Mines-Télécom / Télécom Bretagne. Rennes campus, France

Internet of Things specialization at Institut Mines-Télécom / Télécom Bretagne. Rennes campus, France Internet of Things specialization at Institut Mines-Télécom / Télécom Bretagne Rennes campus, France 2 About Institut Mines-Télécom About Télécom Bretagne! A Graduate Engineering School & Research Centre

More information

Mass Big Data: Progressive Growth through Strategic Collaboration

Mass Big Data: Progressive Growth through Strategic Collaboration Massachusetts Technology Collaborative Mass Big Data: Progressive Growth through Strategic Collaboration Patrick Larkin, Executive Director The Innovation Institute at the Massachusetts Technology Collaborative

More information

Mendeley Institutional Edition Group Owner Guide

Mendeley Institutional Edition Group Owner Guide Group Owners Guide 2 Mendeley Institutional Edition Group Owner Guide As a Mendeley Institutional Edition (MIE) group owner, you would be responsible for managing group members, maintaining the holding

More information

Accelerating Spark Workloads using GPUs

Accelerating Spark Workloads using GPUs Accelerating Spark Workloads using GPUs Rajesh Bordawekar, Minsik Cho, Wei Tan, Benjamin Herta, Vladimir Zolotov, Alexei Lvov, Liana Fong, and David Kung IBM T. J. Watson Research Center 1 Outline Spark

More information

SMART. Investing in urban innovation

SMART. Investing in urban innovation SMART Investing in urban innovation What Smart Belfast? Belfast has ambitious plans for the future. Building on our economic revival, we want to make our city an outstanding place to live, work and invest.

More information

Question Bank. 4) It is the source of information later delivered to data marts.

Question Bank. 4) It is the source of information later delivered to data marts. Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile

More information

SIMPLIFY IT. Transform IT with VCE and Vblock TM Infrastructure Platforms. Copyright 2011 VCE Company LLC, All rights reserved.

SIMPLIFY IT. Transform IT with VCE and Vblock TM Infrastructure Platforms. Copyright 2011 VCE Company LLC, All rights reserved. SIMPLIFY IT Transform IT with VCE and Vblock TM Infrastructure Platforms I.T. BUDGET DILEMMA 73% Maintain 27% Invest Source: Forrester Research, Inc., IT Budget Allocations: Planning For 2011, December

More information

If you missed any of the information leading up to the launch of The Learning Exchange, please visit:

If you missed any of the information leading up to the launch of The Learning Exchange, please visit: THE LEARNING EXCHANGE We are excited to announce the launch of the Big Brothers Big Sisters new learning management system, The Learning Exchange! The Learning Exchange replaces Impact U and introduces

More information

After completing this course, participants will be able to:

After completing this course, participants will be able to: Designing a Business Intelligence Solution by Using Microsoft SQL Server 2008 T h i s f i v e - d a y i n s t r u c t o r - l e d c o u r s e p r o v i d e s i n - d e p t h k n o w l e d g e o n d e s

More information

Tackling Big Data Using MATLAB

Tackling Big Data Using MATLAB Tackling Big Data Using MATLAB Alka Nair Application Engineer 2015 The MathWorks, Inc. 1 Building Machine Learning Models with Big Data Access Preprocess, Exploration & Model Development Scale up & Integrate

More information