Research Assistant Professor Texas Tech University Engctr Building Room Broadway, Lubbock, TX 79409

Similar documents
Jun Li, Ph.D. School of Computing and Information Sciences Phone:

OpenSFS Test Cluster Donation. Dr. Nihat Altiparmak, Assistant Professor Computer Engineering & Computer Science Department University of Louisville

Software verification and testing, software evolution, program analysis, and formal methods.

PROFESSIONAL EXPERIENCE

Call for Papers for Communication QoS, Reliability and Modeling Symposium

Collaborative Next Generation Networking

Data Provenance in Distributed Propagator Networks

GraphTrek: Asynchronous Graph Traversal for Property Graph-Based Metadata Management

The CISM Education Plan (updated August 2006)

EDUCATION RESEARCH EXPERIENCE

Xinyu Feng. January Ph.D. in Computer Science, Yale University, expected Advisor: Professor Zhong Shao.

Open Access Apriori Algorithm Research Based on Map-Reduce in Cloud Computing Environments

CURRICULUM VITAE - YANG HU

High Performance Computing on MapReduce Programming Framework

PROFILING BASED REDUCE MEMORY PROVISIONING FOR IMPROVING THE PERFORMANCE IN HADOOP

Dr. Spencer Sevilla Postdoctoral Researcher, University of Washington

Master Course in Computer Science Orientation day

Dissertation: Analysis, Indexing and Visualization of Presentation Videos

Mohamed Mahmoud Mahmoud Azab. Education: Ongoing research:

Graduate Student Orientation

Distributed NoSQL Storage for Extreme-scale System Services

Discover Viterbi: Computer Science, Cyber Security & Informatics Programs. Viterbi School of Engineering University of Southern California Fall 2017

CHAIR Jr, r7~. cou EGE FACULTY CHAIR (if \G,Jl ~ ~k amica~ pc{ 2.-0 I ; <-0 l "?J Approved,G. 'S u,\? Approved. Approved

HAI ZHOU. Evanston, IL Glenview, IL (847) (o) (847) (h)

Amir Aminzadeh Gohari

Dr. Zheng Chang. Personal. Education and Qualifications. Professional Experience

JOYCE JIYOUNG WHANG EDUCATION

ADAPTIVE HANDLING OF 3V S OF BIG DATA TO IMPROVE EFFICIENCY USING HETEROGENEOUS CLUSTERS

Master & Doctor of Philosophy Programs in Computer Science

Imani Palmer ipalmer2.web.engr.illinois.edu

JOYCE JIYOUNG WHANG. June 2008 May 2010: Undergraduate Research Assistant, Department of Computer Science and Engineering, Ewha Womans University.

MAGNO QUEIROZ Curriculum Vitae

Konstantinos Krommydas, Ph.D.

Department of Computer Science

BEST BIG DATA CERTIFICATIONS

Lingxi Xie. Post-doctoral Researcher (supervised by Prof. Alan Yuille) Center for Imaging Science, the Johns Hopkins University

AYAN MONDAL ayan.mondal/

Computer Science and Engineering Ira A. Fulton Schools of Engineering

Ph.D. in Computer Science & Technology, Tsinghua University, Beijing, China, 2007

Dynamic Data Placement Strategy in MapReduce-styled Data Processing Platform Hua-Ci WANG 1,a,*, Cai CHEN 2,b,*, Yi LIANG 3,c

Data Processing at Scale (CSE 511)

Thesis: An Extensible, Self-Tuning, Overlay-Based Infrastructure for Large-Scale Stream Processing and Dissemination Advisor: Ugur Cetintemel

RESUME WEI LI EDUCATION EMPLOYMENT RESEARCH INTERESTS HONORS AND AWARDS

Brian F. Cooper. Distributed systems, digital libraries, and database systems

John Clements Department of Computer Science Cal Poly State University 1 Grand Street San Luis Obispo, CA (805)

Semi-Structured Data Management (CSE 511)

MAPREDUCE FOR BIG DATA PROCESSING BASED ON NETWORK TRAFFIC PERFORMANCE Rajeshwari Adrakatti

QADR with Energy Consumption for DIA in Cloud

Twitter data Analytics using Distributed Computing

Commercial Data Intensive Cloud Computing Architecture: A Decision Support Framework

Degree Branch / Specialization College University CSE SONA COLLEGE OF TECHNOLOGY : ASSISTANT PROFESSOR (SENIOR GRADE) ASSISTANT PROFESSOR

B.S. in Electronic Science & Technology, Qingdao University, Qingdao, China.

DEPARTMENT OF COMPUTER SCIENCE

TEXAS STATE VITA. A. Name: David L. Gibbs Title: Assistant Professor

Yi Qiao yqiao

CURRICULUM VITAE. June, 2013

Global Journal of Engineering Science and Research Management

Big Data Meets HPC: Exploiting HPC Technologies for Accelerating Big Data Processing and Management

The 2018 (14th) International Conference on Data Science (ICDATA)

High Performance File System and I/O Middleware Design for Big Data on HPC Clusters

An Experimental Cloud Resource Broker System for Virtual Application Control with VM Allocation Scheme

Research on Load Balancing in Task Allocation Process in Heterogeneous Hadoop Cluster

Junchen Jiang. 513 N. Neville St. Apt C3, Pittsburgh, PA, URL: junchenj/

IOT ARCHITECT. Certification. IoT Architect

CURRICULUM VITÆ. Naama Kraus B.Sc. in Computer Science and Mathematics, Bar-Ilan University, Cum Laude GPA: 90.

Overview. Audience profile. At course completion. Course Outline. : 20773A: Analyzing Big Data with Microsoft R. Course Outline :: 20773A::

A Brief Introduction to CFINS

Web Security Vulnerabilities: Challenges and Solutions

Network Based Hard/Soft Information Fusion Stochastic Graph Analytics

Project Participants

Research on Heterogeneous Communication Network for Power Distribution Automation

Arbee L.P. Chen ( 陳良弼 )

Yihan Sun RESEARCH INTEREST EDUCATION PUBLICATIONS

Hong-Sheng Zhou. Research Interests. Education. Research Experience

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

D DAVID PUBLISHING. Big Data; Definition and Challenges. 1. Introduction. Shirin Abbasi

PUBLICATIONS. Journal Papers

Girija J. Narlikar Forbes Avenue girija

Dong Lu donglu

Resume. Techniques. Mail ID: Contact No.: S.No. Position held Organisation From To. AU PG Center, Vizianagaram

Survey on Incremental MapReduce for Data Mining

3715 McClintock Ave, GER 240, Los Angeles, CA 90089

J I N G H A I R A O. Institute for Software Research School of Computer Science Carnegie Mellon University 5000 Forbes Ave Pittsburgh, PA 15213

University of Illinois at Urbana-Champaign, Urbana, Illinois, U.S.

ASSIUT UNIVERSITY. Faculty of Computers and Information Department of Information Technology. on Technology. IT PH.D. Program.

Analyzing Big Data with Microsoft R

Amy Babay April 2018

CURRICULUM VITAE. Hao MA

Advanced Data Mining And Applications: 9th International Conference, ADMA 2013, Hangzhou, China, December 14-16, 2013, Proceedings, Part I

Analyzing and Improving Load Balancing Algorithm of MooseFS

IOGP. an Incremental Online Graph Partitioning algorithm for distributed graph databases. Dong Dai*, Wei Zhang, Yong Chen

CONCENTRATIONS: HIGH-PERFORMANCE COMPUTING & BIOINFORMATICS CYBER-SECURITY & NETWORKING

Syllabus Revised 01/03/2018

CSC 111 Introduction to Computer Science (Section C)

Joe Michael Kniss December 2005

MonetDB/DataCell: leveraging the column-store database technology for efficient and scalable stream processing Liarou, E.

Challenges for Data Driven Systems

The Fusion Distributed File System

FOUNDATIONS OF A CROSS-DISCIPLINARY PEDAGOGY FOR BIG DATA *

Opportunities to Integrate Technology Into the Classroom. Presented by:

Transcription:

Dong Dai CONTACT INFORMATION Research Assistant Professor Texas Tech University Engctr Building Room 314 2500 Broadway, Lubbock, TX 79409 Curriculum Vitae Mobile: +1-806-620-6508 Work: +1-806-834-6278 E-mail: dong.dai@ttu.edu WWW: myweb.ttu.edu/ddai/ RESEARCH INTERESTS Data-intensive High Performance Computing: storage system and I/O optimization Data-aware Computing: locality-aware job and workflow scheduling High Performance Data Management: graph-based rich metadata; provenance EDUCATION University of Science and Technology of China, Hefei, China Ph.D., Department of Computer Science and Technology 2008-2013 Thesis: Research and Implementation on Cloud Software Infrastructure B.S., Department of Computer Science and Technology 2002-2006 Major in Computer Science and Technology RESEARCH EXPERIENCE Research Assistant Professor 2016 - Present Department of Computer Science, Texas Tech University Working on data-intensive high performance computing, including parallel file system, graph storage, data-aware job scheduling, and data management system Research Scientist 2015-2016 Department of Computer Science, Texas Tech University Working on graph-based metadata management and I/O performance optimization Post-doctoral Researcher (Joint) 2013-2015 Argonne National Laboratory (ANL) and Texas Tech University (TTU) Supervisors: Dr. Robert Ross (ANL) and Prof. Yong Chen (TTU) Working on HPC parallel file system I/O scheduling and metadata analysis GRANTS [Sole-PI] CRII:CSR:Partitioning Large Graphs in Deep Storage Architecture (Award Id: #1756012) Source: NSF, CSR Duration: April 1, 2018 - March 31, 2020 (Estimated) Amount: $168,000. [Co-PI] SHF:Small:Collaborative Research:Uncovering Vulnerabilities in Parallel File Systems for Reliable High Performance Computing (Award Id: #1718336) Source: NSF, SHF Duration: August 15, 2017 - July 31, 2020 (Estimated) Amount: $233,000 [PI] NSF Student Travel Grant for 2017 IEEE/ACM International Conference on Utility and Cloud Computing (UCC) (Award Id: #1743903) Source: NSF, CNS Duration: September 15, 2017 - August 31, 2018 (Estimated) Amount: $14,000 1 of 6

REPRESENTATIVE PUBLICATIONS [1] Dong Dai, Yong Chen, Philip Carns, John Jenkins, and Robert Ross. Lightweight Provenance Service for High Performance Computing. In Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques (PACT 17), 2017. (acceptance rate: 23%). [2] Dong Dai, Wei Zhang, and Yong Chen. IOGP: An Incremental Online Graph Partitioning Algorithm for Distributed Graph Databases. In Proceedings of the 26th ACM International Symposium on High Performance Parallel and Distributed Computing (HPDC 17), 2017. (acceptance rate: 19%). [3] Dong Dai, Yong Chen, Dries Kimpe, and Robert Ross. Two-Choice Randomized Dynamic I/O Scheduler for Object Storage Systems. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC 14), 2014. (acceptance rate: 82/394=20.8%). [4] Dong Dai, Yong Chen, Dries Kimpe, and Robert Ross. Provenance-Based Object Storage Prediction Scheme for Scientific Big Data Applications. In Proceedings of the 2014 IEEE International Conference on Big Data (Big- Data 14), 2014. (acceptance rate: 49/264=18.6%). [5] Dong Dai, Forrest Sheng Bao, Jiang Zhou, Xuanhua Shi, and Yong Chen. Vectorizing Disk Blocks for Efficient Storage Systems via Deep Learning. International Journal of Parallel Computing (ParCo 18), 2018. [6] Dong Dai, Phil Carns, Robert Ross, John Jenkins, Nicholas Muirhead, and Yong Chen. An Asynchronous Traversal Engine for Graph-Based Rich Metadata Management. International Journal of Parallel Computing (ParCo 16). CONFERENCE/ JOURNAL PUBLICATIONS [7] Dong Dai, Yong Chen, Dries Kimpe, and Robert Ross. Trigger-based Incremental Data Processing with Unified Sync and Async Model. Accepted to appear in Transaction on Cloud Computing (TCC 18) 2018. [8] Wei Zhang, Yong Chen, and Dong Dai. AKIN: A Streaming Graph Partitioning Algorithm for Distributed Graph Storage Systems Accepted to appear in the Proceedings of The 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 18), 2018.. (acceptance rate: 20.8%) [9] Dong Dai, Wei Zhang, and Yong Chen. IOGP: An Incremental Online Graph Partitioning for Large-Scale Distributed Graph Databases. In Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 17), 2017. (Short Paper). [10] Jiang Zhou, Wei Xie, Dong Dai, and Yong Chen. Pattern-Directed Replication Scheme for Heterogeneous Object-based Storage. In Proceedings of the 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 17), 2017. [11] Chao Wang, Dong Dai, Xi Li, Aili Wang, and Xuehai Zhou. SuperMIC: Analyzing Large Biological Datasets in Bioinformatics with Maximal Information Coefficient. IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB 17), 2017. 2 of 6

[12] Dong Dai, Yong Chen, Phil Carns, John Jenkins, Wei Zhang, and Robert Ross. GraphMeta: A Graph-based Engine for Managing Large-Scale HPC Rich Metadata. In Proceedings of the IEEE International Conference on Cluster Computing (CLUSTER 16), 2016. (acceptance rate: 39/162=24%). [13] Jinrui Cao, Simeng Wang, Dong Dai, Mai Zheng, and Yong Chen. A Generic Framework for Testing Parallel File Systems. In Proceedings of the Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems held in conjunction with SC 16 (PDSW-DISCS 16), 2016. [14] Dong Dai, Forrest Sheng Bao, Jiang Zhou, and Yong Chen. Block2Vec: A Deep Learning Strategy on Mining Block Correlations in Storage Systems. In Proceedings of the 9th International Workshop on Parallel Programming Models and Systems Software for High-End Computing held in conjunction with ICPP 16 (P2S2 16), 2016. [15] Dong Dai, Phil Carns, Robert Ross, John Jenkins, Kyle Blauer, and Yong Chen. GraphTrek: Asynchronous Graph Traversal for Property Graph Based Metadata Management. In Proceedings of the IEEE International Conference on Cluster Computing (CLUSTER 15), 2015. (acceptance rate: 38/157=24.2%). [16] Dong Dai, Robert Ross, Philip Carns, Dries Kimpe, and Yong Chen. Using Property Graphs for Rich Metadata Management in HPC Systems. In Proceedings of the 9th Parallel Data Storage Workshop held in conjunction with SC 14 (PDSW 14), 2014. [17] Dong Dai, Xuehai Zhou, Dries Kimpe, Robert Ross, and Yong Chen. Domino: An Incremental Computing Framework in Cloud with Eventual Synchronization. In Proceedings of the 23rd ACM International Symposium on High- Performance Parallel and Distributed Computing (HPDC 14), 2014. (Short Paper). [18] Dong Dai, Xi Li, Chao Wang, Junneng Zhang, and Xuehai Zhou. Detecting Associations in Large Dataset on MapReduce. In Proceedings of the 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom 13), 2013. [19] Dong Dai, Xi Li, Chao Wang, Mingming Sun, and Xuehai Zhou. Sedna: A Memory Based Key-Value Storage System for Realtime Processing in Cloud. In Proceedings of the 2012 IEEE International Conference on Cluster Computing Workshops (CLUSTER WORKSHOPS 12), 2012. [20] Dong Dai, Xuehai Zhou, Feng Yang, and Chao Wang. An Auto-configuration Tool for Heterogeneous Hadoop Cluster. Journal of the Graduate School of Chinese Academy of Sciences, 2012. (Chinese) CONFERENCE POSTERS [21] Neda Tavakoli, Dong Dai, John Jenkins, Philip Carns, Robert Ross, and Yong Chen. A Software-Defined Approach for QoS Control in High-Performance Computing Storage Systems. SC 16, Salt Lake City, UT, 2016. 3 of 6

[22] Dong Dai, Robert Ross, Dounia Khaldi, Yonghong Yan, Matthieu Dorier, Neda Tavakoli, and Yong Chen. Exploiting Locality in Scientific Workflow System: A Cross-Layer Solution. SC 16, Salt Lake City, UT, 2016. [23] Dong Dai, Yong Chen, Dries Kimpe, and Robert Ross. Provenance-Based Prediction Scheme for Object Storage System in HPC. CCGrid 14, Chicago, IL, 2014. [24] Kun Lu, Dong Dai, and Mingming Sun. HDFS+: Concurrent Writes Improvements for HDFS. CCGrid 13, Delft, Netherlands, 2013. [25] Dong Dai, Xi Li, Chao Wang, and Xuehai Zhou. Cloud Based Short Read Mapping Service. CLUSTER 12, Beijing, China, 2012. TEACHING EXPERIENCE Texas Tech University, Lubbock, TX Guest Lecturer 2016 CS4352: Operating Systems (undergraduate level) Prof. Yong Chen Lectures on virtual memory management and paging system Guest Lecturer 2017, 2015 CS5352: Advanced Operating Systems Design Prof. Yong Chen Design and mentor course projects and lab sessions Guest Lecturer 2014 CS5331: Big Data Infrastructure and Data Management Prof. Yong Chen Lectures on big data software including HBase, Bigtable, and Spark; design and mentor course projects University of Science and Technology of China, Hefei, China Teaching Assistant 2012 Parallel Algorithm Prof. Pradip K Srimani (Clemson Univ.) Design and mentor course projects; grade projects and final exam Teaching Assistant 2011 Practical Optimization Algorithm Design Prof. Thomas Weise Mentor course projects; provide office hours; grade final exam Teaching Assistant 2011, 2010 Principles of Computer Organization (undergraduate level) Prof. Xi Li Mentor course projects; provide office hours, grade homework and exam Teaching Assistant 2008, 2009 Distributed System Prof. Qing Ding Lectures on grid computing; lab session; grade projects and exam MENTORING EXPERIENCE Actively involved in mentoring undergraduate/graduate students and course projects. REU Undergraduate Students: Participate REU 15 and mentor one undergraduate student (Nicholas Muirheada) on HPC metadata management system. Outcomes include a poster, a technical report, and a journal publication in ParCo 16. Graduate Students: Mentor two Ph.D. students (Neda Tavakoli and Wei Zhang) on HPC job scheduling and distributed graph storage systems. Outcomes include a 4 of 6

poster (SC 16), a workshop paper (P2S2 16), conference papers (HPDC 17, CLUS- TER 16), and a journal publication (ParCo 18 under major revision). Course Projects: Design projects and mentor six groups of students (18 students in total) for course CS5352 Advanced Operating Systems Design in 2015, 2017. All mentored groups received high marks. Result of a project (with Eswarappa Vidya) was submitted to a top-tier conference (USENIX FAST). PROFESSIONAL SERVICE Grant Panel Service Panelist: National Science Foundation, Computer Systems Research Core Program (CSR): 2017, 2018 Conference Service Program Chair/Co-Chair 1. The 5th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT 18) Big Data and HPC Track 2. The 10th IEEE/ACM International Conference on Utility and Cloud Computing (UCC 17) Poster Program 3. The 1st International Industry/University Workshop on Data-center Automation, Analytic, and Control, held in conjunction with UCC 17 (DAAC 17) Program Committee Member 1. The 17th IEEE International Conference On Trust, Security And Privacy In Computing And Communications (TrustCom 18) 2. The National Cyber Security Summit (NCS 18) 3. IEEE International Conference on Big Data (BigData Congress 18) 4. Workshop on Advances in Software and Hardware for Big Data to Knowledge Discovery held in conjunction with BigData 14 (ASH 14) Session Chair 1. The 10th IEEE/ACM International Conference on Utility and Cloud Computing (UCC 17) 2. The 4th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT 17) 3. International Workshop on Parallel Programming Models and Systems Software for High-End Computing held in conjunction with ICPP 16 (P2S2 16) 4. International Workshop on Data Intensive Scalable Computing Systems held in conjunction with SC 14 (DISCS 14) Editorial Service Guest Editor, Applied Soft Computing (BigData Special Issue) Reviewing Service Reviewer for IEEE Transactions on Parallel and Distributed Systems 2015-2017 Reviewer for IEEE Transactions on Cloud Computing 2017, 2015 Reviewer for IEEE Transactions on Industrial Informatics 2017, 2016 Reviewer for IEEE/ACM International Conference on Utility and Cloud Computing (IEEE/ACM UCC) 2017 Reviewer for Applied Soft Computing 2017, 2015 Reviewer for IEEE International Conference on Parallel and Distributed Systems 2017 5 of 6

Reviewer for IEEE International Conference on Cloud Computing (CLOUD), 2017 Reviewer for IEEE International Conference on Big Data (BigData) 2017, 2016 Reviewer for IEEE COMCOM 2017 Reviewer for International Journal of Parallel Programming 2015 Reviewer for International Journal of High Performance Systems Architecture 2015 Reviewer for International Journal of Parallel Programming, 2015 Reviewer for IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB) 2014 Subreviewer for IEEE International Conference on Parallel and Distributed Systems 2016 Subreviewer for IEEE International Parallel Distributed Processing Symposium (IPDPS) 2015, 2014 Subreviewer for Euro-Par 17, NPC 17, BDCAT 16, PMAM 15, CLUSTER 14, IC- CCN 14, CCGRID 14, CloudCom 14, BigData 14, PDCN 14, MSST 14, PMAM 14, MTAGS 14, MTAGS 13 Subreviewer for SuperComputing/SC 2013 Poster Session PROFESSIONAL MEMBERSHIPS Institute for Electrical and Electronics Engineers (IEEE), Member, 2012 present Association for Computing Machinery (ACM), Member, 2014 present 6 of 6