Konstantinos Krommydas, Ph.D.

Size: px
Start display at page:

Download "Konstantinos Krommydas, Ph.D."

Transcription

1 , Ph.D. Phone: (+1) Web: w LinkedIn: EDUCATION Virginia Polytechnic Institute and State University (Virginia Tech), Department of Computer Science (August 2010 May 2017): Ph.D. Degree Thesis: Towards Enhancing Performance, Programmability, and Portability in Heterogeneous Computing GPA: 3.8/4 Advisor: Dr. Wu-chun Feng Virginia Polytechnic Institute and State University (Virginia Tech), Department of Computer Science (August 2010 December 2013): Master s Degree (non-thesis) GPA: 3.8/4 Advisor: Dr. Wu-chun Feng University of Thessaly (Greece), Department of Computer and Communications Engineering (September June 2010): B.S. Diploma (5-year program, Thesis) GPA: 8.24/10 (Top 3% of graduating class) Thesis: Implementation of the AVS Video Decoder in Reconfigurable Logic Co-advisors: Dr. Nikolaos Bellas, Dr. Christos D. Antonopoulos RESEARCH INTERESTS High performance computing, Systems, Parallel computer architectures, Programming languages, Bioinformatics, Image and video processing TECHNICAL STRENGTHS Programming: C, C++, FORTRAN, Java, Perl, Shell scripting, SQL, HTML, PHP, JavaScript, SWIFT, MIPS assembly Parallel Programming: OpenMP, MPI, CUDA, OpenCL, Pthreads, MMX/SSE/AVX SIMD extensions Profiling & Debugging Tools: Intel VTune, NVIDIA Visual Profiler, AMD CodeXL, gprof, Intel Advisor, gdb Other Software & Tools: MS Visual Studio, Latex, SVN, GIT, Flex/Bison, MS Office 1

2 WORK EXPERIENCE Intel: Software Engineer, February 2017 now o Optimizing high-performance computing (HPC) parallel applications (quantum chemistry, computational cosmology) on Intel platforms (CPU, network fabric, I/O). Research on architectural innovations for future Intel processors based on HPC domain algorithm insights. Intel: Software Engineering Graduate Intern, August - December 2015 o Extended prior research prototype of GLAF, a visual-programming environment, for domain scientists to automate (parallel) OpenCL code generation. Enabled seamless portability of existing programs to GPU and FPGA platforms and enhanced programmability and functional/performance portability. Intel: Software Engineering Graduate Intern, May - December 2014 o Designed and implemented aspects of GLAF, an all-encompassing visual programming environment. Key elements are its intuitive programming interface for scientists, researchers and engineers, and its serial and parallel code generation (C, FORTRAN) and optimization back-ends. Virginia Tech: Graduate Teaching Assistant (CS 4504: Computer Architecture - Spring 2014, Spring 2016 and CS 4234: Parallel Computing - Fall 2016) o Assisted with teaching (multiple guest lectures), and material creation (lecture presentations, homework problems, programming assignments, semester-long projects, exams). Led review sessions, and did grading. Virginia Tech: Graduate Research Assistant (Synergy Lab), August 2010 December 2016 (except periods noted above) o Extended AVS video decoder optimizations and explored performance trade-offs in large-scale multiprocessors (experimental Intel Many-core Testing Lab). Addressed scalability issues and devised algorithmic remedies to overcome architectural bottlenecks and load imbalance. o Investigated the suitability and performance of fine-grained, hybrid parallelization approaches for protein sequence alignment (BLAST algorithm) on the GPU. Proposed and prototyped compressive genomics approaches for storage reduction and performance improvements. o Examined performance and portability aspects of the OpenCL Dwarfs benchmark suite on architecturally diverse, heterogeneous parallel platforms (CPU, APU, GPU, FPGA and Intel MIC). Revamped OpenCL Dwarfs by extending robustness, benchmarks coverage and benchmarking capabilities, and enabling uniformity of cross-platform optimization levels. o Mapped, optimized, and characterized molecular modeling (electrostatic surface potential calculation) algorithms across modern multi- and many-core architectures. Conducted a quantitative and qualitative study on the performance, programmability and portability trade-offs. o Proposed and implemented a novel motif-based benchmarking methodology for rapid evaluation of composite/fused heterogeneous architectures. Applied the proposed methodology to a diverse set of candidate platforms and provided projected performance insights. 2

3 o Enhanced robustness of our GLAF visual-programming framework to support automatic parallelization and optimization of large-scale FORTRAN legacy applications. Applied on NASA weather modeling and computational fluid dynamics algorithms (NASA-funded project). University of Thessaly: Undergraduate Researcher, Spring 2009 Spring 2010 o Algorithmic mapping and performance optimization of the AVS Video decoder on multicore CPUs. Exploited x86 SIMD capabilities and parallelized using macroblock-level dynamic multithreading schemes. Achieved real-time highdefinition video decoding. o Implemented and evaluated AVS video decoder kernels on a Xilinx ML405 FPGA with embedded PowerPC CPU. Created soft core IP for custom motion compensation peripheral and enabled integration within an embedded Xilinx PowerPC-based Linux distribution. PEER-REVIEWED PUBLICATIONS Krommydas, K.; Feng, W-C.; Telescoping Architectures: Evaluating Next Generation Heterogeneous Computing, 2016 IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), December 2016 Krommydas, K.; Sasanka, R.; Feng, W-C.; Bridging the FPGA Performance- Programmability Gap via Automatic OpenCL Code Generation and Tuning, 2016 IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP), 6-8 July 2016 Krommydas, K.; Feng, W-C.; Antonopoulos, C.D.; Bellas, N.; OpenDwarfs: Characterization of Dwarf-based Benchmarks on Fixed and Reconfigurable Architectures, Journal of Signal Processing Systems, Springer, October 2015 Krommydas, K.; Sasanka, R.; Feng, W-C.; GLAF: A Visual Programming and Auto-tuning Framework for High-performance Computing, 2015 ACM International Conference on Supercomputing (ICPP), 1-4 September 2015 Kalidas, R.; Daga, M.; Krommydas, K.; Feng, W-C.; On the Performance, Energy, and Power of Data-Access Methods in Heterogeneous Computing Systems, 2015 IEEE International Parallel and Distributed Processing Symposium Workshops High-Performance, Power-Aware Computing (HPPAC), May 2015 Feng, W-C.; Krommydas, K.; Zhang L.; CentroidBLAST: Accelerating Sequence Search via Clustering, 7 th International Conference on Bioinformatics and Computational Biology (BICOB), 9-11 March 2015 Krommydas, K.; Feng, W-C.; Owaida, M.; Antonopoulos, C.D.; Bellas, N.; On the Characterization of OpenCL Dwarfs on Fixed and Reconfigurable Platforms, 2014 IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP), June 2014 (Best paper finalist Top 3) Krommydas, K.; Scogland, T.; Feng W-C.; On the Programmability and Performance of Heterogeneous Architectures, 2013 IEEE International Conference on Parallel and Distributed Systems (ICPADS), December

4 Krommydas, K.; Antonopoulos, C.D.; Bellas, N.; Feng, W-C.; AVS Video Decoder on Multicore Systems: Optimizations and Tradeoffs, 2011 IEEE International Conference on Multimedia and Expo (ICME), July 2011 Krommydas, K.; Tsoublekas, G.; Antonopoulos, C.D.; Bellas, N.; Mapping and Optimization of the AVS Video Decoder on a High Performance Chip Multiprocessor, 2010 IEEE International Conference on Multimedia and Expo (ICME), July 2010 INVITED TALKS/PRESENTATIONS GLAF: Parallel Programming for the Masses, NASA Langley Research Center, Hampton, VA, May Tutorial: Accessing and Using the GLAF Prototype, NASA Langley Research Center, Hampton, VA, September Performance, Programmability, and Portability in Heterogeneous Computing: A Multi-Faceted Perspective and Case Study, Computer Science Department Seminar - Old Dominion University (ODU), Norfolk, VA, March HONORS AND AWARDS ICTAS Doctoral Scholars Program Award, Virginia Tech ( ) Gerondelis Foundation Scholarship, USA (2011) Technical Chamber of Greece Top Student Award (awarded yearly to the top graduating student of each Polytechnic department), Greece (2010) PROFESSIONAL SERVICE AND ACTIVITIES Referee of Publications: o ACM Transactions of Reconfigurable Technology and Systems (TRETS) o IEEE Transactions on Very Large Scale Integration Systems (TVLSI) o ACM Transactions on Multimedia Computing Communications and Applications (TOMCCAP) o Design Automation Conference (DAC) o IEEE Access Journal o International Journal of High Performance Computing Applications (IJHPCA) Sage o Journal on Concurrency and Computation: Practice and Experience - Wiley o IETE Technical Review Journal Taylor & Francis o International Workshop on Accelerators and Hybrid Exascale Systems (AsHES) IEEE IPDPS Workshop (IPDPSW) 4

5 o Morgan-Kaufmann Book Proposals Reviewer ( High-performance Parallelism Pearls II Multicore and Manycore Approaches, Intel Xeon Phi Processor (Knights Landing) High Performance Programming ) Technical Program Committee/Conference Organization Participation: o Vice-Chair of Poster Session, 47 th Processing (ICPP 2018) International Conference on Parallel o Technical Program Committee Member, 7 th Workshop on Parallel Programming Models (IPDPS 2018 Workshop) Professional Membership: o IEEE Administrative Service: o President for the Virginia Tech Hellenic Student Association ( ) o Public Relations Officer for the Virginia Tech Hellenic Student Association ( ) o Graduate Student Committee for the Virginia Tech College of Engineering ( ) o Local Organization Committee for the 3rd National Pan-Hellenic Electrical & Computer Engineering Students Conference - University Of Thessaly branch (2009) o Student Representative at the General Board of the Computer and Communication Engineering Dept. ( ) 5

GLAF: A Visual Programming and Auto- Tuning Framework for Parallel Computing

GLAF: A Visual Programming and Auto- Tuning Framework for Parallel Computing GLAF: A Visual Programming and Auto- Tuning Framework for Parallel Computing Student: Konstantinos Krommydas Collaborator: Dr. Ruchira Sasanka (Intel) Advisor: Dr. Wu-chun Feng Motivation High-performance

More information

Stavros Nikolaou. 413 Gates Hall URL: Ithaca, NY, 14853, USA Date of Birth: December, 1987

Stavros Nikolaou. 413 Gates Hall URL:  Ithaca, NY, 14853, USA Date of Birth: December, 1987 413 Gates Hall +1 6073795409 Department of Computer Science Cornell University email: snikolaou@cs.cornell.edu URL: www.cs.cornell.edu/~snikolaou Ithaca, NY, 14853, USA Date of Birth: December, 1987 Education

More information

Brian F. Cooper. Distributed systems, digital libraries, and database systems

Brian F. Cooper. Distributed systems, digital libraries, and database systems Brian F. Cooper Home Office Internet 2240 Homestead Ct. #206 Stanford University cooperb@stanford.edu Los Altos, CA 94024 Gates 424 http://www.stanford.edu/~cooperb/app/ (408) 730-5543 Stanford, CA 94305

More information

CONCENTRATIONS: HIGH-PERFORMANCE COMPUTING & BIOINFORMATICS CYBER-SECURITY & NETWORKING

CONCENTRATIONS: HIGH-PERFORMANCE COMPUTING & BIOINFORMATICS CYBER-SECURITY & NETWORKING MAJOR: DEGREE: COMPUTER SCIENCE MASTER OF SCIENCE (M.S.) CONCENTRATIONS: HIGH-PERFORMANCE COMPUTING & BIOINFORMATICS CYBER-SECURITY & NETWORKING The Department of Computer Science offers a Master of Science

More information

HPC code modernization with Intel development tools

HPC code modernization with Intel development tools HPC code modernization with Intel development tools Bayncore, Ltd. Intel HPC Software Workshop Series 2016 HPC Code Modernization for Intel Xeon and Xeon Phi February 17 th 2016, Barcelona Microprocessor

More information

University of Texas Administration Building Room Guadalupe Street Austin, TX Phone: (479)

University of Texas Administration Building Room Guadalupe Street Austin, TX Phone: (479) Islam Akef Ebeid University of Texas Administration Building Room 5.556 http://www.i3akef.com School of Information https://www.linkedin.com/in/i3akef University of Texas at Austin islam.akef@gmail.com

More information

First, the need for parallel processing and the limitations of uniprocessors are introduced.

First, the need for parallel processing and the limitations of uniprocessors are introduced. ECE568: Introduction to Parallel Processing Spring Semester 2015 Professor Ahmed Louri A-Introduction: The need to solve ever more complex problems continues to outpace the ability of today's most powerful

More information

Call for Papers for Communication QoS, Reliability and Modeling Symposium

Call for Papers for Communication QoS, Reliability and Modeling Symposium Call for Papers for Communication QoS, Reliability and Modeling Symposium Scope and Motivation: In modern communication networks, different technologies need to cooperate with each other for end-to-end

More information

EDUCATION RESEARCH EXPERIENCE

EDUCATION RESEARCH EXPERIENCE PERSONAL Name: Mais Nijim Gender: Female Address: 901 walkway, apartment A1 Socorro, NM 87801 Email: mais@cs.nmt.edu Phone: (505)517-0150 (505)650-0400 RESEARCH INTEREST Computer Architecture Storage Systems

More information

Dr. Spencer Sevilla Postdoctoral Researcher, University of Washington

Dr. Spencer Sevilla Postdoctoral Researcher, University of Washington Dr. Spencer Sevilla Postdoctoral Researcher, University of Washington email: sevilla@cs.washington.edu Research Interests Computer networks, Host mobility, Web technology, Content caching Education University

More information

AYAN MONDAL ayan.mondal/

AYAN MONDAL ayan.mondal/ AYAN MONDAL +91-7872773669 mondalayan12@gmail.com, ayanmondal@iitkgp.ac.in http://cse.iitkgp.ac.in/ ayan.mondal/ Present Address Kharagpur - 721302 India Permanent Address Ashrampara (Teachers Colony)

More information

Early Experiences Writing Performance Portable OpenMP 4 Codes

Early Experiences Writing Performance Portable OpenMP 4 Codes Early Experiences Writing Performance Portable OpenMP 4 Codes Verónica G. Vergara Larrea Wayne Joubert M. Graham Lopez Oscar Hernandez Oak Ridge National Laboratory Problem statement APU FPGA neuromorphic

More information

AMD ACCELERATING TECHNOLOGIES FOR EXASCALE COMPUTING FELLOW 3 OCTOBER 2016

AMD ACCELERATING TECHNOLOGIES FOR EXASCALE COMPUTING FELLOW 3 OCTOBER 2016 AMD ACCELERATING TECHNOLOGIES FOR EXASCALE COMPUTING BILL.BRANTLEY@AMD.COM, FELLOW 3 OCTOBER 2016 AMD S VISION FOR EXASCALE COMPUTING EMBRACING HETEROGENEITY CHAMPIONING OPEN SOLUTIONS ENABLING LEADERSHIP

More information

Degree Branch / Specialization College University CSE SONA COLLEGE OF TECHNOLOGY : ASSISTANT PROFESSOR (SENIOR GRADE) ASSISTANT PROFESSOR

Degree Branch / Specialization College University CSE SONA COLLEGE OF TECHNOLOGY : ASSISTANT PROFESSOR (SENIOR GRADE) ASSISTANT PROFESSOR FACULTY PROFILE Name Designation Email ID : NISHA SOMS : ASSISTANT PROFESSOR (SENIOR GRADE) : nishasoms.cse@srit.org Area of Specialization : Wireless Ad Hoc Networks, Mobile Computing Cryptography and

More information

Big Data Systems on Future Hardware. Bingsheng He NUS Computing

Big Data Systems on Future Hardware. Bingsheng He NUS Computing Big Data Systems on Future Hardware Bingsheng He NUS Computing http://www.comp.nus.edu.sg/~hebs/ 1 Outline Challenges for Big Data Systems Why Hardware Matters? Open Challenges Summary 2 3 ANYs in Big

More information

Amy Babay April 2018

Amy Babay April 2018 Amy Babay www.dsn.jhu.edu/~babay 814-528-4205 babay@cs.jhu.edu Education PhD in Computer Science April 2018 PhD Research: Structured overlay networks for a new generation of Internet services, dependable

More information

Expressing Heterogeneous Parallelism in C++ with Intel Threading Building Blocks A full-day tutorial proposal for SC17

Expressing Heterogeneous Parallelism in C++ with Intel Threading Building Blocks A full-day tutorial proposal for SC17 Expressing Heterogeneous Parallelism in C++ with Intel Threading Building Blocks A full-day tutorial proposal for SC17 Tutorial Instructors [James Reinders, Michael J. Voss, Pablo Reble, Rafael Asenjo]

More information

Thesis: An Extensible, Self-Tuning, Overlay-Based Infrastructure for Large-Scale Stream Processing and Dissemination Advisor: Ugur Cetintemel

Thesis: An Extensible, Self-Tuning, Overlay-Based Infrastructure for Large-Scale Stream Processing and Dissemination Advisor: Ugur Cetintemel Olga Papaemmanouil Phone: +1 (401) 588-0230 Department of Computer Science Fax: +1 (401) 863-7657 Box 1910, 115 Waterman St, Floor 4 Email: olga@cs.brown.edu Providence, RI, 02912, USA Web: http://www.cs.brown.edu/

More information

Trends and Challenges in Multicore Programming

Trends and Challenges in Multicore Programming Trends and Challenges in Multicore Programming Eva Burrows Bergen Language Design Laboratory (BLDL) Department of Informatics, University of Bergen Bergen, March 17, 2010 Outline The Roadmap of Multicores

More information

Amir Aminzadeh Gohari

Amir Aminzadeh Gohari Amir Aminzadeh Gohari Contact Information Computer Eng. Phone: (707) 953-2647 E-mail: amirazg@ece.ucsb.edu Santa Barbara, CA 93106 USA Web: http://ece.ucsb.edu/ amirazg Education Sept. 2007 June 2011 Ph.D.,

More information

CS 553: Algorithmic Language Compilers (PLDI) Graduate Students and Super Undergraduates... Logistics. Plan for Today

CS 553: Algorithmic Language Compilers (PLDI) Graduate Students and Super Undergraduates... Logistics. Plan for Today Graduate Students and Super Undergraduates... CS 553: Algorithmic Language Compilers (PLDI) look for other sources of information make decisions, because all research problems are under-specified evaluate

More information

Amy Babay November Doctor of Philosophy in Computer Science September 2018

Amy Babay November Doctor of Philosophy in Computer Science September 2018 Amy Babay www.dsn.jhu.edu/~babay 814-528-4205 babay@cs.jhu.edu Education November 2018 Doctor of Philosophy in Computer Science September 2018 Thesis: Timely, Reliable, and Cost-Effective Internet Transport

More information

THE COMPARISON OF PARALLEL SORTING ALGORITHMS IMPLEMENTED ON DIFFERENT HARDWARE PLATFORMS

THE COMPARISON OF PARALLEL SORTING ALGORITHMS IMPLEMENTED ON DIFFERENT HARDWARE PLATFORMS Computer Science 14 (4) 2013 http://dx.doi.org/10.7494/csci.2013.14.4.679 Dominik Żurek Marcin Pietroń Maciej Wielgosz Kazimierz Wiatr THE COMPARISON OF PARALLEL SORTING ALGORITHMS IMPLEMENTED ON DIFFERENT

More information

Dr. Yassine Hariri CMC Microsystems

Dr. Yassine Hariri CMC Microsystems Dr. Yassine Hariri Hariri@cmc.ca CMC Microsystems 03-26-2013 Agenda MCES Workshop Agenda and Topics Canada s National Design Network and CMC Microsystems Processor Eras: Background and History Single core

More information

Architecture, Programming and Performance of MIC Phi Coprocessor

Architecture, Programming and Performance of MIC Phi Coprocessor Architecture, Programming and Performance of MIC Phi Coprocessor JanuszKowalik, Piotr Arłukowicz Professor (ret), The Boeing Company, Washington, USA Assistant professor, Faculty of Mathematics, Physics

More information

AREAS OF SPECIALIZATION. Digital Image Processing Computer Vision Pattern Recognition Image Retrieval Image Reconstruction Face Recognition

AREAS OF SPECIALIZATION. Digital Image Processing Computer Vision Pattern Recognition Image Retrieval Image Reconstruction Face Recognition Dr. Pooja Sharma (Gold Medalist) UGC (NET and JRF) Assistant Professor Department of Computer Science and Engineering IKGPTU, Main Campus, Kapurthala, Punjab, India. E-mail: dr.poojasharma@ptu.ac.in Mobile:

More information

Trends in HPC (hardware complexity and software challenges)

Trends in HPC (hardware complexity and software challenges) Trends in HPC (hardware complexity and software challenges) Mike Giles Oxford e-research Centre Mathematical Institute MIT seminar March 13th, 2013 Mike Giles (Oxford) HPC Trends March 13th, 2013 1 / 18

More information

Introduction CPS343. Spring Parallel and High Performance Computing. CPS343 (Parallel and HPC) Introduction Spring / 29

Introduction CPS343. Spring Parallel and High Performance Computing. CPS343 (Parallel and HPC) Introduction Spring / 29 Introduction CPS343 Parallel and High Performance Computing Spring 2018 CPS343 (Parallel and HPC) Introduction Spring 2018 1 / 29 Outline 1 Preface Course Details Course Requirements 2 Background Definitions

More information

HPC future trends from a science perspective

HPC future trends from a science perspective HPC future trends from a science perspective Simon McIntosh-Smith University of Bristol HPC Research Group simonm@cs.bris.ac.uk 1 Business as usual? We've all got used to new machines being relatively

More information

ADAM CHLIPALA. Research interests. Education. Refereed conference papers

ADAM CHLIPALA. Research interests. Education. Refereed conference papers 579 Soda Hall Berkeley, CA 94720-1776 USA adamc@cs.berkeley.edu http://www.cs.berkeley.edu/ adamc/ Research interests ADAM CHLIPALA Dependent type systems; interactive theorem proving; type-based programming

More information

International Conference Russian Supercomputing Days. September 25-26, 2017, Moscow

International Conference Russian Supercomputing Days. September 25-26, 2017, Moscow International Conference Russian Supercomputing Days September 25-26, 2017, Moscow International Conference Russian Supercomputing Days Supported by the Russian Foundation for Basic Research Platinum Sponsor

More information

30 Eleanor Dr., # 908, Ottawa, ON, Canada, K2E 7E5 +1 (613)

30 Eleanor Dr., # 908, Ottawa, ON, Canada, K2E 7E5 +1 (613) 30 Eleanor Dr., # 908, Ottawa, ON, Canada, K2E 7E5 +1 (613) 907-6109 http://www.sce.carleton.ca/~ufarooq ufarooq@sce.carleton.ca Umar FAROOQ Research Interests Distributed Systems, Large-Scale Multi-Organizational

More information

FACULTY PROFILE. Total Experience : Academic : 7 Years and 3 Months. Degree Branch / Specialization College University

FACULTY PROFILE. Total Experience : Academic : 7 Years and 3 Months. Degree Branch / Specialization College University FACULTY PROFILE Name Designation Email ID : A.K.Kavitha : Assistant Professor : kavitha.ece@srit.org Area of Specialization : Wireless Communication Total Experience : Academic : 7 Years and 3 Industry

More information

FPGA-based Supercomputing: New Opportunities and Challenges

FPGA-based Supercomputing: New Opportunities and Challenges FPGA-based Supercomputing: New Opportunities and Challenges Naoya Maruyama (RIKEN AICS)* 5 th ADAC Workshop Feb 15, 2018 * Current Main affiliation is Lawrence Livermore National Laboratory SIAM PP18:

More information

CURRICULUM VITÆ. Naama Kraus B.Sc. in Computer Science and Mathematics, Bar-Ilan University, Cum Laude GPA: 90.

CURRICULUM VITÆ. Naama Kraus B.Sc. in Computer Science and Mathematics, Bar-Ilan University, Cum Laude GPA: 90. CURRICULUM VITÆ Naama Kraus naamakraus@gmail.com Personal Information Home Address: 6 Trumpeldor Ave., Haifa, 32582, Israel Phone (Home): +972 4 8328216 Phone (Mobile): +972 55 6644563 Date of Birth: 29-APR-1974

More information

Productive Performance on the Cray XK System Using OpenACC Compilers and Tools

Productive Performance on the Cray XK System Using OpenACC Compilers and Tools Productive Performance on the Cray XK System Using OpenACC Compilers and Tools Luiz DeRose Sr. Principal Engineer Programming Environments Director Cray Inc. 1 The New Generation of Supercomputers Hybrid

More information

Introduction II. Overview

Introduction II. Overview Introduction II Overview Today we will introduce multicore hardware (we will introduce many-core hardware prior to learning OpenCL) We will also consider the relationship between computer hardware and

More information

CURRICULUM VITAE. June, 2013

CURRICULUM VITAE. June, 2013 CURRICULUM VITAE ד"ר אבי סופר Dr. Avi Soffer June, 2013 ORT Braude College, Department of Software Engineering, P.O. Box 78, Karmiel 2161002, Israel Telephone: +972-4-990-1720 Email: asoffer@braude.ac.il

More information

Hybrid KAUST Many Cores and OpenACC. Alain Clo - KAUST Research Computing Saber Feki KAUST Supercomputing Lab Florent Lebeau - CAPS

Hybrid KAUST Many Cores and OpenACC. Alain Clo - KAUST Research Computing Saber Feki KAUST Supercomputing Lab Florent Lebeau - CAPS + Hybrid Computing @ KAUST Many Cores and OpenACC Alain Clo - KAUST Research Computing Saber Feki KAUST Supercomputing Lab Florent Lebeau - CAPS + Agenda Hybrid Computing n Hybrid Computing n From Multi-Physics

More information

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE (NON-THESIS OPTION)

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE (NON-THESIS OPTION) Master of Science (M.S.) Major in Computer Science (Non-thesis Option) 1 MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE (NON-THESIS OPTION) Major Program The Master of Science (M.S.) degree with a

More information

Chris Vegter University of Northern Colorado Monfort College of Business (970)

Chris Vegter University of Northern Colorado Monfort College of Business (970) Chris Vegter University of Northern Colorado Monfort College of Business (970) 351-1244 Email: chris.vegter@unco.edu Education MBA, Colorado State University, 2006. Area of Study: Business Administration

More information

Overview. CS 472 Concurrent & Parallel Programming University of Evansville

Overview. CS 472 Concurrent & Parallel Programming University of Evansville Overview CS 472 Concurrent & Parallel Programming University of Evansville Selection of slides from CIS 410/510 Introduction to Parallel Computing Department of Computer and Information Science, University

More information

Programming Models for Multi- Threading. Brian Marshall, Advanced Research Computing

Programming Models for Multi- Threading. Brian Marshall, Advanced Research Computing Programming Models for Multi- Threading Brian Marshall, Advanced Research Computing Why Do Parallel Computing? Limits of single CPU computing performance available memory I/O rates Parallel computing allows

More information

Module Catalog M.Sc. Computational Science CS-M-F

Module Catalog M.Sc. Computational Science CS-M-F Module Catalog M.Sc. Computational Science CS-M-F 1. Module title: CS-M-F: Specialization 2. Field / responsibility of: Physics / department, Dean of Studies 3. Module contents: Investigating the current

More information

Nomair A. Naeem. Personal Data. Education. Teaching Experience. Course Instructor/Sessional

Nomair A. Naeem. Personal Data. Education. Teaching Experience. Course Instructor/Sessional Nomair A. Naeem Personal Data Address: E-mail: Web Page: David R. Cheriton School of Computer Science University of Waterloo 200 University Avenue West Waterloo, ON N2L 3G1, Canada nanaeem@uwaterloo.ca

More information

General introduction: GPUs and the realm of parallel architectures

General introduction: GPUs and the realm of parallel architectures General introduction: GPUs and the realm of parallel architectures GPU Computing Training August 17-19 th 2015 Jan Lemeire (jan.lemeire@vub.ac.be) Graduated as Engineer in 1994 at VUB Worked for 4 years

More information

Parallel Programming. Michael Gerndt Technische Universität München

Parallel Programming. Michael Gerndt Technische Universität München Parallel Programming Michael Gerndt Technische Universität München gerndt@in.tum.de Contents 1. Introduction 2. Parallel architectures 3. Parallel applications 4. Parallelization approach 5. OpenMP 6.

More information

Discover Viterbi: Computer Science, Cyber Security & Informatics Programs. Viterbi School of Engineering University of Southern California Fall 2017

Discover Viterbi: Computer Science, Cyber Security & Informatics Programs. Viterbi School of Engineering University of Southern California Fall 2017 Discover Viterbi: Computer Science, Cyber Security & Informatics Programs Viterbi School of Engineering University of Southern California Fall 2017 WebEx Quick Facts Will I be able to get a copy of the

More information

The Heterogeneous Programming Jungle. Service d Expérimentation et de développement Centre Inria Bordeaux Sud-Ouest

The Heterogeneous Programming Jungle. Service d Expérimentation et de développement Centre Inria Bordeaux Sud-Ouest The Heterogeneous Programming Jungle Service d Expérimentation et de développement Centre Inria Bordeaux Sud-Ouest June 19, 2012 Outline 1. Introduction 2. Heterogeneous System Zoo 3. Similarities 4. Programming

More information

Hana' Ahmad Al-Theiabat

Hana' Ahmad Al-Theiabat Hana' Ahmad Al-Theiabat Citizenship : Jordanian Date of birth : 23 December 1990 Contact Tel: +962-788057098 e-mail: hanatheabat@gmail.com Address Ramtha, Irbid, Jordan Education Bachelor's degree in:

More information

PORTING CP2K TO THE INTEL XEON PHI. ARCHER Technical Forum, Wed 30 th July Iain Bethune

PORTING CP2K TO THE INTEL XEON PHI. ARCHER Technical Forum, Wed 30 th July Iain Bethune PORTING CP2K TO THE INTEL XEON PHI ARCHER Technical Forum, Wed 30 th July Iain Bethune (ibethune@epcc.ed.ac.uk) Outline Xeon Phi Overview Porting CP2K to Xeon Phi Performance Results Lessons Learned Further

More information

HIGH PERFORMANCE COMPUTING (PLATFORMS) SECURITY AND OPERATIONS

HIGH PERFORMANCE COMPUTING (PLATFORMS) SECURITY AND OPERATIONS HIGH PERFORMANCE COMPUTING (PLATFORMS) SECURITY AND OPERATIONS AT PITT Kim F. Wong Center for Research Computing SAC-PA, June 22, 2017 Our service The mission of the Center for Research Computing is to

More information

How to Write Fast Code , spring st Lecture, Jan. 14 th

How to Write Fast Code , spring st Lecture, Jan. 14 th How to Write Fast Code 18-645, spring 2008 1 st Lecture, Jan. 14 th Instructor: Markus Püschel TAs: Srinivas Chellappa (Vas) and Frédéric de Mesmay (Fred) Today Motivation and idea behind this course Technicalities

More information

PHYSICS. Chair: David Baker. Faculty: Andra Petrean, David Whelan Visiting Faculty: Vince Gregoric Emeriti: Larry Robinson, Donald Salisbury

PHYSICS. Chair: David Baker. Faculty: Andra Petrean, David Whelan Visiting Faculty: Vince Gregoric Emeriti: Larry Robinson, Donald Salisbury PHYSICS Chair: David Baker Faculty: Andra Petrean, David Whelan Visiting Faculty: Vince Gregoric Emeriti: Larry Robinson, Donald Salisbury The mission of the Physics Department at Austin College is to

More information

Girija J. Narlikar Forbes Avenue girija

Girija J. Narlikar Forbes Avenue   girija Girija J. Narlikar CMU Computer Science Dept. girija@cs.cmu.edu 5000 Forbes Avenue http://www.cs.cmu.edu/ girija Pittsburgh, PA 15213 Phone: (412) 268-3337 [O] Fax: (412) 268-5576 (412) 802-0459 [H] Research

More information

2007 Doctor of Philosophy(Ph.D.) in Computer Science University of California, San Diego (UCSD)

2007 Doctor of Philosophy(Ph.D.) in Computer Science University of California, San Diego (UCSD) ALPER T. MIZRAK VMware, Inc. 3401 Hillview Ave Palo Alto, CA 94304 Phone: 1 (650) 427-2492 amizrak@vmware.com http://cseweb.ucsd.edu/ amizrak www.linkedin.com/in/alpermizrak Cell: 1 (858) 337-3791 alpermizrak@gmail.com

More information

Ph.D. in Computer Science (

Ph.D. in Computer Science ( Computer Science 1 COMPUTER SCIENCE http://www.cs.miami.edu Dept. Code: CSC Introduction The Department of Computer Science offers undergraduate and graduate education in Computer Science, and performs

More information

High Performance Computing with Accelerators

High Performance Computing with Accelerators High Performance Computing with Accelerators Volodymyr Kindratenko Innovative Systems Laboratory @ NCSA Institute for Advanced Computing Applications and Technologies (IACAT) National Center for Supercomputing

More information

GPU Architecture. Alan Gray EPCC The University of Edinburgh

GPU Architecture. Alan Gray EPCC The University of Edinburgh GPU Architecture Alan Gray EPCC The University of Edinburgh Outline Why do we want/need accelerators such as GPUs? Architectural reasons for accelerator performance advantages Latest GPU Products From

More information

Imani Palmer ipalmer2.web.engr.illinois.edu

Imani Palmer ipalmer2.web.engr.illinois.edu Imani Palmer ipalmer2@illinois.edu ipalmer2.web.engr.illinois.edu 240.676.7741 Research Interest Education Cyber Security, System Security, Real-Time Streaming Engines, Data Analysis University of Illinois

More information

Getting Started with Intel SDK for OpenCL Applications

Getting Started with Intel SDK for OpenCL Applications Getting Started with Intel SDK for OpenCL Applications Webinar #1 in the Three-part OpenCL Webinar Series July 11, 2012 Register Now for All Webinars in the Series Welcome to Getting Started with Intel

More information

A Total of 7 years of IT and Marketing Experience [6 years in Pakistan + 1 year in North Cyprus] in education and service industry.

A Total of 7 years of IT and Marketing Experience [6 years in Pakistan + 1 year in North Cyprus] in education and service industry. ASIM TANVIR Senior Lecturer/Program Coordinator IT Institute of Business and Information Technology University of the Punjab Lahore, Pakistan asim@ibitpu.edu.pk CAREER OBJECTIVE To implement the IT concepts

More information

S.No Description 1 Allocation of subjects to the faculty based on their specialization by the HoD 2 Preparation of college Academic Calendar and

S.No Description 1 Allocation of subjects to the faculty based on their specialization by the HoD 2 Preparation of college Academic Calendar and S.No Description 1 Allocation of subjects to the faculty based on their specialization by the HoD 2 Preparation of college Academic Calendar and co-curricular calendar 3 Preparation of time table 4 Appointment

More information

Cuda C Programming Guide Appendix C Table C-

Cuda C Programming Guide Appendix C Table C- Cuda C Programming Guide Appendix C Table C-4 Professional CUDA C Programming (1118739329) cover image into the powerful world of parallel GPU programming with this down-to-earth, practical guide Table

More information

IWES st Italian Workshop on Embedded Systems Pisa September 2016

IWES st Italian Workshop on Embedded Systems Pisa September 2016 IWES 2016 1st Italian Workshop on Embedded Systems Pisa -- 19 September 2016 Research Group Overview Roberto Giorgi University of Siena, Italy http://www.dii.unisi.it/~giorgi Siena on Earth 2 Engineering

More information

Interconnect Your Future

Interconnect Your Future Interconnect Your Future Gilad Shainer 2nd Annual MVAPICH User Group (MUG) Meeting, August 2014 Complete High-Performance Scalable Interconnect Infrastructure Comprehensive End-to-End Software Accelerators

More information

The University of Jordan. Accreditation & Quality Assurance Center. Curriculum for Doctorate Degree

The University of Jordan. Accreditation & Quality Assurance Center. Curriculum for Doctorate Degree Accreditation & Quality Assurance Center Curriculum for Doctorate Degree 1. Faculty King Abdullah II School for Information Technology 2. Department Computer Science الدكتوراة في علم الحاسوب (Arabic).3

More information

OVERVIEW OF MPC JUNE 24 TH LLNL Meeting June 15th, 2015 PAGE 1

OVERVIEW OF MPC JUNE 24 TH LLNL Meeting June 15th, 2015 PAGE 1 OVERVIEW OF MPC Forum Teratec Patrick CARRIBA ULT, Julien JAEGER, Marc PERACHE CEA, DAM, DIF, F-91297 Arpajon, France www.cea.fr www.cea.fr JUNE 24 TH 2015 LLNL Meeting June 15th, 2015 PAGE 1 Context Starting

More information

Vectorisation and Portable Programming using OpenCL

Vectorisation and Portable Programming using OpenCL Vectorisation and Portable Programming using OpenCL Mitglied der Helmholtz-Gemeinschaft Jülich Supercomputing Centre (JSC) Andreas Beckmann, Ilya Zhukov, Willi Homberg, JSC Wolfram Schenck, FH Bielefeld

More information

Addressing Heterogeneity in Manycore Applications

Addressing Heterogeneity in Manycore Applications Addressing Heterogeneity in Manycore Applications RTM Simulation Use Case stephane.bihan@caps-entreprise.com Oil&Gas HPC Workshop Rice University, Houston, March 2008 www.caps-entreprise.com Introduction

More information

High performance computing and numerical modeling

High performance computing and numerical modeling High performance computing and numerical modeling Volker Springel Plan for my lectures Lecture 1: Collisional and collisionless N-body dynamics Lecture 2: Gravitational force calculation Lecture 3: Basic

More information

Particle-in-Cell Simulations on Modern Computing Platforms. Viktor K. Decyk and Tajendra V. Singh UCLA

Particle-in-Cell Simulations on Modern Computing Platforms. Viktor K. Decyk and Tajendra V. Singh UCLA Particle-in-Cell Simulations on Modern Computing Platforms Viktor K. Decyk and Tajendra V. Singh UCLA Outline of Presentation Abstraction of future computer hardware PIC on GPUs OpenCL and Cuda Fortran

More information

Scalable Critical Path Analysis for Hybrid MPI-CUDA Applications

Scalable Critical Path Analysis for Hybrid MPI-CUDA Applications Center for Information Services and High Performance Computing (ZIH) Scalable Critical Path Analysis for Hybrid MPI-CUDA Applications The Fourth International Workshop on Accelerators and Hybrid Exascale

More information

Parallel Systems. Project topics

Parallel Systems. Project topics Parallel Systems Project topics 2016-2017 1. Scheduling Scheduling is a common problem which however is NP-complete, so that we are never sure about the optimality of the solution. Parallelisation is a

More information

TOOLS FOR IMPROVING CROSS-PLATFORM SOFTWARE DEVELOPMENT

TOOLS FOR IMPROVING CROSS-PLATFORM SOFTWARE DEVELOPMENT TOOLS FOR IMPROVING CROSS-PLATFORM SOFTWARE DEVELOPMENT Eric Kelmelis 28 March 2018 OVERVIEW BACKGROUND Evolution of processing hardware CROSS-PLATFORM KERNEL DEVELOPMENT Write once, target multiple hardware

More information

Matthew Patrick Sopha Clinical Assistant Professor Department of Information Systems, W. P. Carey School of Business, Arizona State University

Matthew Patrick Sopha Clinical Assistant Professor Department of Information Systems, W. P. Carey School of Business, Arizona State University Matthew Patrick Sopha Clinical Assistant Professor Department of Information Systems, W. P. Carey School of Business, Office: BA 320A E-mail (Work): msopha@asu.edu W. P. Carey School of Business Phone

More information

A Translation Framework for Automatic Translation of Annotated LLVM IR into OpenCL Kernel Function

A Translation Framework for Automatic Translation of Annotated LLVM IR into OpenCL Kernel Function A Translation Framework for Automatic Translation of Annotated LLVM IR into OpenCL Kernel Function Chen-Ting Chang, Yu-Sheng Chen, I-Wei Wu, and Jyh-Jiun Shann Dept. of Computer Science, National Chiao

More information

Advanced High Performance Computing CSCI 580

Advanced High Performance Computing CSCI 580 Advanced High Performance Computing CSCI 580 2:00 pm - 3:15 pm Tue & Thu Marquez Hall 322 Timothy H. Kaiser, Ph.D. tkaiser@mines.edu CTLM 241A http://inside.mines.edu/~tkaiser/csci580fall13/ 1 Two Similar

More information

SUMMARY OF QUALIFICATIONS

SUMMARY OF QUALIFICATIONS AMJAD AZIZ 1157 Rutherford Close SW Edmonton, Alberta T6W 1H6 Website: www.amjadaziz.com Email: amjadaziz@live.ca Mobile: 780.288.1515 Home: 780.249.0338 SKILLS Build customised websites to the highest

More information

NERSC Site Update. National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory. Richard Gerber

NERSC Site Update. National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory. Richard Gerber NERSC Site Update National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory Richard Gerber NERSC Senior Science Advisor High Performance Computing Department Head Cori

More information

CUDA GPGPU Workshop 2012

CUDA GPGPU Workshop 2012 CUDA GPGPU Workshop 2012 Parallel Programming: C thread, Open MP, and Open MPI Presenter: Nasrin Sultana Wichita State University 07/10/2012 Parallel Programming: Open MP, MPI, Open MPI & CUDA Outline

More information

HPC Architectures. Types of resource currently in use

HPC Architectures. Types of resource currently in use HPC Architectures Types of resource currently in use Reusing this material This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 4.0 International License. http://creativecommons.org/licenses/by-nc-sa/4.0/deed.en_us

More information

Sermakani. AM Mobile: : IBM Rational Rose, IBM Websphere Studio Application Developer.

Sermakani. AM Mobile: : IBM Rational Rose, IBM Websphere Studio Application Developer. Objective: With sound technical knowledge as background and with innovative ideas, I am awaiting to work on challenging jobs that expose my skills and potential ability. Also looking for the opportunity

More information

Dissertation: Analysis, Indexing and Visualization of Presentation Videos

Dissertation: Analysis, Indexing and Visualization of Presentation Videos MICHELE MERLER IBM TJ Watson Research Center 19 Skyline Drive Hawthorne, NY, 10532 phone: (646) 510-1702 email: michele.merler@gmail.com www.cs.columbia.edu/~mmerler EDUCATION 2012 : Columbia University,

More information

Chapter 3 Parallel Software

Chapter 3 Parallel Software Chapter 3 Parallel Software Part I. Preliminaries Chapter 1. What Is Parallel Computing? Chapter 2. Parallel Hardware Chapter 3. Parallel Software Chapter 4. Parallel Applications Chapter 5. Supercomputers

More information

Accelerating sequential computer vision algorithms using commodity parallel hardware

Accelerating sequential computer vision algorithms using commodity parallel hardware Accelerating sequential computer vision algorithms using commodity parallel hardware Platform Parallel Netherlands GPGPU-day, 28 June 2012 Jaap van de Loosdrecht NHL Centre of Expertise in Computer Vision

More information

Finite Element Integration and Assembly on Modern Multi and Many-core Processors

Finite Element Integration and Assembly on Modern Multi and Many-core Processors Finite Element Integration and Assembly on Modern Multi and Many-core Processors Krzysztof Banaś, Jan Bielański, Kazimierz Chłoń AGH University of Science and Technology, Mickiewicza 30, 30-059 Kraków,

More information

Jim Boyd.NET Senior Software Engineer Independent Contractor

Jim Boyd.NET Senior Software Engineer Independent Contractor Jim Boyd.NET Senior Software Engineer Independent Contractor Jim.Boyd@JBSystemConsulting.com Website: http://www.jbsystemconsulting.com Skill Set: Visual Studio and TFS (all versions),.net(vb and C#),

More information

Aug till date, Assistant Professor (tenure-track) Department of Electrical Engineering, University of Texas at Dallas

Aug till date, Assistant Professor (tenure-track) Department of Electrical Engineering, University of Texas at Dallas PROFESSIONAL APPOINTMENT: Rama Sangireddy Department of Electrical Engineering University of Texas at Dallas, Richardson, TX 75080 Phone: (972) 883 6143; E-mail: rama.sangireddy@utdallas.edu Aug. 2003

More information

Reconfigurable Architecture Requirements for Co-Designed Virtual Machines

Reconfigurable Architecture Requirements for Co-Designed Virtual Machines Reconfigurable Architecture Requirements for Co-Designed Virtual Machines Kenneth B. Kent University of New Brunswick Faculty of Computer Science Fredericton, New Brunswick, Canada ken@unb.ca Micaela Serra

More information

Teaching and Examination Regulations

Teaching and Examination Regulations Teaching and Examination Regulations MASTER's Degree Programme Computer Science (joint degree) B. Programme-specific section Academic year 2017-2018 1 Section B: Programme-specific section 1. General provisions

More information

Department of Electrical and Computer Engineering, University of Rochester, Computer Studies Building,

Department of Electrical and Computer Engineering, University of Rochester, Computer Studies Building, ,, Computer Studies Building, BOX 270231, Rochester, New York 14627 585.360.6181 (phone) kose@ece.rochester.edu http://www.ece.rochester.edu/ kose Research Interests and Vision Research interests: Design

More information

Tutorial. Preparing for Stampede: Programming Heterogeneous Many-Core Supercomputers

Tutorial. Preparing for Stampede: Programming Heterogeneous Many-Core Supercomputers Tutorial Preparing for Stampede: Programming Heterogeneous Many-Core Supercomputers Dan Stanzione, Lars Koesterke, Bill Barth, Kent Milfeld dan/lars/bbarth/milfeld@tacc.utexas.edu XSEDE 12 July 16, 2012

More information

Gassan Abu Jumra Alsyad

Gassan Abu Jumra Alsyad Gassan Abu Jumra Alsyad PERSONAL INFORMATION Place & Date of Birth: Khartoum, 29 th May 1980 Nationality: Sudanese Marital Status: Single Passport NO: B0700506 Language : Arabic,English Current Address:

More information

The Stampede is Coming: A New Petascale Resource for the Open Science Community

The Stampede is Coming: A New Petascale Resource for the Open Science Community The Stampede is Coming: A New Petascale Resource for the Open Science Community Jay Boisseau Texas Advanced Computing Center boisseau@tacc.utexas.edu Stampede: Solicitation US National Science Foundation

More information

Please consult the Department of Engineering about the Computer Engineering Emphasis.

Please consult the Department of Engineering about the Computer Engineering Emphasis. COMPUTER SCIENCE Computer science is a dynamically growing discipline. ABOUT THE PROGRAM The Department of Computer Science is committed to providing students with a program that includes the basic fundamentals

More information

Big Data Meets High-Performance Reconfigurable Computing

Big Data Meets High-Performance Reconfigurable Computing Big Data Meets High-Performance Reconfigurable Computing UF Workshop on Dense, Intense, and Complex Data Alan George CHREC Center Director Herman Lam CHREC Center Associate Director June 19, 2013 What

More information

The Rise of Open Programming Frameworks. JC BARATAULT IWOCL May 2015

The Rise of Open Programming Frameworks. JC BARATAULT IWOCL May 2015 The Rise of Open Programming Frameworks JC BARATAULT IWOCL May 2015 1,000+ OpenCL projects SourceForge GitHub Google Code BitBucket 2 TUM.3D Virtual Wind Tunnel 10K C++ lines of code, 30 GPU kernels CUDA

More information

Open Compute Stack (OpenCS) Overview. D.D. Nikolić Updated: 20 August 2018 DAE Tools Project,

Open Compute Stack (OpenCS) Overview. D.D. Nikolić Updated: 20 August 2018 DAE Tools Project, Open Compute Stack (OpenCS) Overview D.D. Nikolić Updated: 20 August 2018 DAE Tools Project, http://www.daetools.com/opencs What is OpenCS? A framework for: Platform-independent model specification 1.

More information

GPGPU/CUDA/C Workshop 2012

GPGPU/CUDA/C Workshop 2012 GPGPU/CUDA/C Workshop 2012 Day-1: GPGPU/CUDA/C and WSU Presenter(s): Abu Asaduzzaman Nasrin Sultana Wichita State University July 10, 2012 GPGPU/CUDA/C Workshop 2012 Outline Introduction to the Workshop

More information