Thesis: An Extensible, Self-Tuning, Overlay-Based Infrastructure for Large-Scale Stream Processing and Dissemination Advisor: Ugur Cetintemel

Similar documents
Stavros Nikolaou. 413 Gates Hall URL: Ithaca, NY, 14853, USA Date of Birth: December, 1987

Brian F. Cooper. Distributed systems, digital libraries, and database systems

Architecture and Implementation of a Content-based Data Dissemination System

Adaptive In-Network Query Deployment for Shared Stream Processing Environments

Synergy: Quality of Service Support for Distributed Stream Processing Systems

Chris Vegter University of Northern Colorado Monfort College of Business (970)

Amy Babay April 2018

Ranveer Chandra. Computer Networks, Mobile Computing, Distributed Systems, Sensor Networks, Pervasive Computing

Graduate Student Orientation

Dealing with Overload in Distributed Stream Processing Systems

Gassan Abu Jumra Alsyad

MAGNO QUEIROZ Curriculum Vitae

30 Eleanor Dr., # 908, Ottawa, ON, Canada, K2E 7E5 +1 (613)

Yanif Ahmad. Research Interests. Education. Appointments. Journal, Conference and Workshop Publications

Amy Babay November Doctor of Philosophy in Computer Science September 2018

Matthew Patrick Sopha Clinical Assistant Professor Department of Information Systems, W. P. Carey School of Business, Arizona State University

John Clements Department of Computer Science Cal Poly State University 1 Grand Street San Luis Obispo, CA (805)

MonetDB/DataCell: leveraging the column-store database technology for efficient and scalable stream processing Liarou, E.

MSc in Information Systems

Tuncay Tekle Curriculum Vitæ

Dr. Spencer Sevilla Postdoctoral Researcher, University of Washington

Curriculum vitae , Athens GREECE : Web:

FACULTY PROFILE. Total Experience : Academic : 7 Years and 3 Months. Degree Branch / Specialization College University

Jun Li, Ph.D. School of Computing and Information Sciences Phone:

Network Awareness in Internet-Scale Stream Processing

Columbia University (office) Computer Science Department (mobile) Amsterdam Avenue

Dr. Imran Khan University of Nebraska-Lincoln Marketing (402)

INFN45, Informatics: Business Intelligence, 7.5 credits Informatik: Business Intelligence, 7,5 högskolepoäng Second Cycle / Avancerad nivå

Nomair A. Naeem. Personal Data. Education. Teaching Experience. Course Instructor/Sessional

The CISM Education Plan (updated August 2006)

Jinwei Gu. Ph.D. in Computer Science Dissertation: Measurement, Modeling, and Synthesis of Time-Varying Appearance of Natural

Imani Palmer ipalmer2.web.engr.illinois.edu

Facts and Figures. CEID Department of Computer Engineering & Informatics. December 2015

Software, Systems and Services Development in the Global Environment (GS 3 D)

STUDIES IN DIGITAL SYSTEMS INVESTMENT ON KNOWLEDGE OF DIGITAL SYSTEMS

AC : TEACHING A LABORATORY-BASED IPV6 COURSE IN A DISTANCE EDUCATION ENVIRONMENT

Resume. Techniques. Mail ID: Contact No.: S.No. Position held Organisation From To. AU PG Center, Vizianagaram

Form 18 Heriot-Watt University Graduate/Postgraduate Course Structure and Course Notes Template (RAY) 2009/2010

The course can be taken as part of the MSc Programme in Information Systems, or as a separate course.

Extensible Optimization in Overlay Dissemination Trees

Greg T. Harber Faculty Vita (September 1, August 31, 2013)

Semantic Multicast for Content-based Stream Dissemination

Amir Aminzadeh Gohari

Xinyu Feng. January Ph.D. in Computer Science, Yale University, expected Advisor: Professor Zhong Shao.

West Bengal University of Technology Bachelor of Computer Science and Engineering

Iranian Scholarship Foundation Graduate Scholarship Application Form

ELLIOT ANSHELEVICH. Department of Computer Science Cornell University Upson Hall 5139 Ithaca, NY 14853

Girija J. Narlikar Forbes Avenue girija

Application Procedures: Fulbright English Teaching Assistantship IMPORTANT DATES AND DEADLINES - Application Year

University At Buffalo COURSE OUTLINE. A. Course Title: CSE 487/587 Information Structures

XFlow: Dynamic Dissemination Trees for Extensible In-Network Stream Processing

Establishment of new Department of Informatics, UNIRI and experiences in teaching SE course

Ph.D. with Distinction in Computer Science Advisor: Prof. Micha Sharir Dissertation: Geometric Arrangements: Substructures and Algorithms

School of Engineering and the Built Environment Glasgow Caledonian University

Course Specifications M. Tech. / PG Diploma Programme

Discover Viterbi: Computer Science, Cyber Security & Informatics Programs. Viterbi School of Engineering University of Southern California Fall 2017

Computer Science Technology Department

University of Texas Administration Building Room Guadalupe Street Austin, TX Phone: (479)

Enas El-Sayed Mohammed El-Sharawy

ESHI Diploma THE EUROPEAN SPECIALISATION IN TRANSPLANTATION IMMUNOLOGY, HISTOCOMPATIBILITY & IMMUNOGENETICS

NGTC Course Catalog / Student Handbook PROGRAMS OF STUDY COMPUTER INFORMATION SYSTEMS COMPUTER INFORMATION SYSTEMS

Syllabus Revised 08/21/17

Client Services Procedure Manual

Yi Qiao yqiao

BS523 (with PDP) Postgraduate Diploma in Management. January 2016 January September 2017

MSc Digital Marketing

Faculty Activity Reporting Reference Guide

TEXAS STATE VITA. A. Name: David L. Gibbs Title: Assistant Professor

Master Course in Computer Science Orientation day

Essentials of Freight Forwarding Program

Program Proposal for a Direct Converted Program. BS in COMPUTER SCIENCE

Computer Science Technology Department

Student Handbook Master of Information Systems Management (MISM)

Syllabus Revised 03/05/2015

MSc Management with Project Management. BS524 (with PDP) Postgraduate Diploma in Management. January 2016 January 2021.

Resources and Services Virtualization without Boundaries (ReSerVoir)

Energy efficiency in school networks with ipv6 GREEK IPV6 PILOT

A RESOURCE MANAGEMENT FRAMEWORK FOR INTERACTIVE GRIDS

Rutgers Master Gardener Program of Somerset County Graduating Class of 2019 POSITION DESCRIPTION

ADMINISTRATIVE REGULATION Milwaukee Area Technical College Title: PROFESSIONAL GROWTH CREDIT EVALUATION FOR CERTIFIED POSITIONS

CS 378 (Spring 2003) Linux Kernel Programming. Yongguang Zhang. Copyright 2003, Yongguang Zhang

Zoltan Safar. Education. Awards and Honors. Experience

DR. UMMA HANY. Current Status: Education: Career Summary:

Mohamed Mahmoud Mahmoud Azab. Education: Ongoing research:

2007 Doctor of Philosophy(Ph.D.) in Computer Science University of California, San Diego (UCSD)

I. General regulations

Curriculum Vitae Tim Ophelders last updated: September 12, 2018

Call for Papers for Communication QoS, Reliability and Modeling Symposium

Masters in Software Engineering

Teaching with Technology. Retooling Lecture Capture

Personal Profile. Allied Additional Subordinate Subject studied. Management Information System. Electronic Commerce. Image Processing.

CURRICULUM VITAE. June, 2013

Please consult the Department of Engineering about the Computer Engineering Emphasis.

Data Processing at Scale (CSE 511)

Semi-Structured Data Management (CSE 511)

3 - Welcome. 4 - Your Success Begins Here Microsoft IT Academy. 7 - IBM Training. 8 - VMware Training. 9 - HP and Symantec Training

Microsoft Certified Professional Transcript

Developing a Software Industry with TSP and PSP

Snapshot: Landscape Architecture Laws Continuing Education Requirements

Application-aware Overlay Networks for Data Dissemination

Transcription:

Olga Papaemmanouil Phone: +1 (401) 588-0230 Department of Computer Science Fax: +1 (401) 863-7657 Box 1910, 115 Waterman St, Floor 4 Email: olga@cs.brown.edu Providence, RI, 02912, USA Web: http://www.cs.brown.edu/ olga RESEARCH INTERESTS My main research interests are in the area of experimental systems and lie in the intersection of distributed systems, databases and application-level networking. They encompass large-scale distributed systems, data management, query processing, distributed stream processing, peer-topeer computing, publish/subscribe systems, and optimization of overlay networks. EDUCATION Ph.D., Computer Science Summer 2008 (expected) Providence, RI, USA Thesis: An Extensible, Self-Tuning, Overlay-Based Infrastructure for Large-Scale Stream Processing and Dissemination Advisor: Ugur Cetintemel M.Sc., Computer Science May 2004 Providence, RI, USA Thesis: Semantic Multicast for Content-Based Publish-Subscribe Systems Advisor: Ugur Cetintemel M.Sc., Information Systems June 2001 Athens University of Economics & Business Thesis: Personalization techniques in the context of interactive TV Advisor: George I. Doukidis B.S., Computer Engineering & Informatics June 1999 University of Patras, School of Engineering Patra, Greece 5 year program, Degree with Honors (top 1% of class) RESEARCH EXPERIENCE Graduate Research Assistant 2002 present XPORT (Extensible Profile-driven Overlay Routing Trees): Designed and implemented XPORT, a general-purpose infrastructure that provides the core functionalities of large-scale profile-driven dissemination applications. XPORT can be extended to support diverse profile management logic, stream types, and performance targets and, given these specifications, it automatically creates and optimizes an overlay-based publish-subscribe dissemination tree. Its optimization is driven by metricindependent operations, which can refine the structure of the overlay network [4, 5, 7]. XFlow (Extensible Distributed Stream Processing): Designed and implemented XFlow, an extensible, highly-scalable and adaptive framework for distributing and optimizing stream processing queries. XFlow leverages XPORT s architecture; it creates and optimizes an overlay network of multiple, dynamic, overlay stream dissemination trees. Its optimization process relies on generic, customizable performance metrics, and refines the placement and execution of the operators, the structure of the underlying overlay network, as well as the statistics collection process, to meet the applicationdefined query performance and resource utilization expectations [1].

SemCast (Semantic Multicast): Designed SemCast, a system for content-based data filtering and dissemination over conventional multicast channels. SemCast splits input data streams into multiple pieces, using a cost-model on data contents, rates, and destinations, and spreads the pieces across multiple multicast channels for delivery. Each channel can be implemented as a standard overlay dissemination tree. This approach eliminates the need for content-based filtering and routing at interior nodes of the overlay, thus simplifying and improving the dissemination efficiency [9, 10]. Pulse (Continuous-time stream processing): I was involved in the design of the Pulse framework, which processes continuous queries over continuous-time data models. Pulse translates regular queries to work on continuous-time inputs and reduces computational overhead and latency while meeting error bounds on query results [3]. Borealis (Distributed Stream Processing): Borealis is a distributed stream processing engine that deploys a network of cooperating stream processing engines and distributes query processing across multiple machines. I was involved in the design and implementation of the statistics collection mechanism, which was used by the optimization protocols [6, 8]. Research Intern June-August 2007 Networking Group Hewlett-Packard Labs, Palo Alto Sharing-Aware In-Network Stream Processing: In distributed shared processing environments, runtime reconfigurations of the query deployment must be well-coordinated in order to guarantee strict, and potentially conflicting, QoS expectations of queries that share processing operators. I designed and implemented an adaptive optimization approach that allows nodes to collaborate and make fast operator placement decisions. These decisions resolve any existing QoS violations and conflicts, as well as they respect the resource constraints of the nodes in the system [2]. Research Assistant Fall 1999 Spring 2001 Information Systems Department Athens University of Economics and Business imedia (Intelligent Mediation Environment for Digital Interactive Advertising): The project involved the design and development of an end-to-end solution for the provision of personalized interactive advertisement in Digital TV environments. I participated in the analysis, design and implementation of a profile-based mechanism for targeting advertisements to specific customer groups. ACTIVE (Advertising and Commerce Through Internet in the context of Virtual Enterprise): The project involved the design and development of a global electronic commerce platform. I participated in the design and implementation of a mechanism that combined data mining and collaborative filtering techniques to derive dynamic online recommendations for e-commerce applications [11, 12]. INDUSTRY EXPERIENCE Software Engineer Sept 2000-June 2002 Athens Stock Exchange I participated in the upgrade and optimization of various trade settlement services provided by the Central Securities Depository, as well as in the design and implementation of the stock transactions settlement and clearing system of the Athens Derivatives Exchange. Software Engineer June 1999-September 2000 Vodafone Group, R & D Department I was member of a research team that explored data mining techniques for effective targeted marketing. My involvement included the design and implementation of customer classification tools which were based on clustering algorithms.

TEACHING & MENTORING EXPERIENCE Guest Lecturer Spring 2007 Computer Science: An Integrated Introduction (CS 018): Delivered lectures on Java programming focusing on data structures. Teaching Seminar Fall 2006-Spring 2007 Sheridan Center for Teaching & Learning Teaching Certificate I: Attended a year-long seminar that included lectures, workshops and teaching consultations regarding reflective teaching practices, establishing clear goals in a syllabus, addressing different learning styles, and learning to communicate effectively in the classroom. Advising 2005-2006 Master thesis advisor (Spring-Fall 2005): With Ugur Cetintemel, co-advised Austin Park whose master project was on the implementation of a distributed content-based publish-subscribe system. Research intern supervisor (Summer 2005 & 2006): With Ugur Cetintemel, supervised summer interns Yenel Yildirim, Mohit Gogia and Swati Goyal. Their projects included the implementation of an RSS feed dissemination application, as well as a multiplayer network game, both developed on the XPORT [4,5] extensible overlay-based infrastructure. Graduate Teaching Assistant Spring 2004 Computer Networks (CS 168) : Held office hours to assist students with the course material and weekly homeworks. Mentored two teams on the design and implementation of a TCP-like network protocol. Participated in the grading of homeworks, projects and exams. Teaching Assistant Spring 1996 Computer Engineering & Informatics Dept. University of Patras Programming Languages: Participated in the grading of weekly programming projects. PUBLICATIONS Under Submission 1. O. Papaemmanouil, U. Cetintemel, J. Jannotti. XFlow: Dynamic Dissemination Trees for Extensible In-Network Stream Processing., CS Department. November 2007. Conference & Workshop Papers 2. O. Papaemmanouil, S. Basu, S. Banerjee. Adaptive In-Network Query Deployment for Shared Stream Processing Environments. In Proceedings of 4th International Workshop on Networking Meets Databases (NetDB 08), April 2008. (to appear). (It appears also as Technical Report, HPL-2007-178, HP Labs). 3. Y. Ahmad, O. Papaemmanouil, U. Cetintemel, J. Rogers. Simultaneous Equation Systems for Query Processing on Continuous-Time Data Streams. In Proceedings of the 24th International Conference on Data Engineering (ICDE 08), April 2008. (81/671=12.1% accepted) 4. O. Papaemmanouil, Y. Ahmad, U. Cetintemel, J. Jannotti, Y. Yildirim. Extensible Optimization in Overlay Dissemination Trees. In Proceedings of the 25th ACM Special Interest Group on

Management of Data (SIGMOD 06), June 2006. (58/446=13% accepted) 5. O. Papaemmanouil, Y. Ahmad, U. Cetintemel, J. Jannotti, Y. Yildirim. XPORT: Extensible Profile-driven Overlay Routing Trees. (Demonstration). In Proceedings of the 25th ACM Special Interest Group on Management of Data (SIGMOD 06), June 2006. (24/81=29.6% accepted) 6. Y. Ahmad, B. Berg, U. Cetintemel, M. Humphrey, J. Hwang, A. Maskey, A. Jhingran, O. Papaemmanouil, A. Rasin, N. Tatbul, W. Xing, Y. Xing, S. Zdonik. The Borealis Distributed Stream Processing Engine. In Proceedings of the 2nd International Conference on GeoSensor Networks (GSN 06), Boston, October 2006. (Invited Demonstration) 7. O. Papaemmanouil, Y. Ahmad, U. Cetintemel, J. Jannotti. Application-aware Overlay Networks for Data Dissemination. In Proceedings of the International Workshop on Semantics enabled Networks and Services (SeNS 06) (In conjunction with ICDE 06), Atlanta, April 2006. (Invited Paper) 8. Y. Ahmad, B. Berg, U. Cetintemel, M. Humphrey, J. Hwang, A. Jhingran, A. Maskey, O. Papaemmanouil, A. Rasin. N. Tatbul, W. Xing, Y. Xing, S. Zdonik. Distributed Operation in the Borealis Stream Processing Engine. In Proceedings of the 25th ACM Special Interest Group on Management of Data (SIGMOD 05), June 2005. (24/71=33.8% accepted) Best Demonstration Award 9. O. Papaemmanouil, U. Cetintemel. SemCast: Semantic Multicast for Content-based Data Dissemination. In Proceedings of the 21st International Conference on Data Engineering (ICDE 05), April 2005. (67/521=12.8% accepted) 10. O. Papaemmanouil, U. Cetintemel. Semantic Multicast for Content-Based Stream Dissemination. In Proceedings of the 7th International Workshop on the Web and Databases (WebDB 04) (In conjunction with SIGMOD 04), June 2004. (16/61=26.2% accepted) 11. G. Prassas, K. C. Pramataris, O. Papaemmanouil. Dynamic Recommendations in Internet Retailing. In Proceedings of the 9th European Conference on Information Systems (ECIS 2001), June 2001. ( 30% accepted) Best Paper Award 12. G. Prassas, K. C. Pramataris, O. Papaemmanouil, G. I. Doukidis. A Recommender System for Online Shopping Based on Past Consumer Behavior. In Proceedings of the 14th Bled ecommerce Conference (BLED 01), June 2001. ( 30% accepted) PATENTS O. Papaemmanouil, S. Basu, S. Banerjee. Adaptive Overlays for Shared Stream Processing Environments. U.S. Patent application filed, November 2007. HONORS & AWARDS Best Demonstration Award, SIGMOD 2005 Conference Travel Award,, SIGMOD 2004, ICDE 2005, ICDE 2006 Kanellakis Fellowship from, Fall 2002, Spring 2003 Best Paper Award, 9th European Conference in Information Systems, 2001 Fellowship from Greek Ministry of Education, 2001 Honorary Distinction Award from Technical Chamber of Greece, 2000 Fellowship from Greek Scholarship Foundation, 1998, 1999 Fellowship from Greek Employment Organization, 1997, 1998, 1999

ACADEMIC ACTIVITIES Faculty Search Graduate Committee co-chair, Computer Science Dept.,, 2005 Faculty Search Graduate Committee Member, Computer Science Dept.,, 2004 Graduate Student Recruitment co-head, Computer Science Dept.,, 2004 External Reviewer for ICDE, SIGMOD, VLDB, ICDCS, DEBS Student volunteer: ICDCS 2003, ICDE 2004 STATUS Citizenship: Greek (Resident of USA under F1 VISA) REFERENCES Ugur Cetintemel Associate Professor of Computer Science Phone: +1-401-863-7644 Email: ugur@cs.brown.edu John Jannotti Assistant Professor of Computer Science Phone:+1-401-863-7755 Email: jj@cs.brown.edu Stan Zdonik Professor of Computer Science Phone: +1-401-863-7648 Email: sbz@cs.brown.edu Sujata Banerjee Manager, Networking Group Hewlett-Packard Labs 1501 Page Mill Road, MS 1181 Palo Alto, CA, 94306 Phone:+1-650-857-2137 Email: sujata.banerjee@hp.com Sujoy Basu Senior Researcher, Networking Group Hewlett-Packard Labs 1501 Page Mill Road, MS 1181 Palo Alto, CA, 94306 Phone:+1-650-236-2044 Email: sujoy.basu@hp.com