EUROSTAT and BIG DATA. High Level Seminar on integrating non traditional data sources in the National Statistical Systems

Similar documents
Research Infrastructures and Horizon 2020

Shaping the Cyber Security R&D Agenda in Europe, Horizon 2020

EISAS Enhanced Roadmap 2012

ehealth Ministerial Conference 2013 Dublin May 2013 Irish Presidency Declaration

COUNCIL OF THE EUROPEAN UNION. Brussels, 28 January 2003 (OR. en) 15723/02 TELECOM 78 JAI 307 PESC 593

The enabling role of geospatial information in the European energy policies

New Data: Sources, Governance, Infrastructure, Analysis. Tim Holt UK Data Forum

Research Infrastructures and Horizon 2020

Description of the European Big Data Hackathon 2019

Privacy and Security Aspects Related to the Use of Big Data Progress of work in the ESS. Pascal Jacques Eurostat Local Security Officer 1

Regional and subregional approaches to the Digital Economy: Lessons from Asia-Pacific and Latin America

Harmonisation of Digital Markets in the EaP. Vassilis Kopanas European Commission, DG CONNECT

ACCREDITATION: A BRIEFING FOR GOVERNMENTS AND REGULATORS

CONCLUSIONS OF THE WESTERN BALKANS DIGITAL SUMMIT APRIL, SKOPJE

LIMITE EN COUNCIL OF THE EUROPEAN UNION. Brussels, 21 October /13 LIMITE CO EUR-PREP 37. NOTE General Secretariat of the Council

Summary. Strategy at EU Level: Digital Agenda for Europe (DAE) What; Why; How ehealth and Digital Agenda. What s next. Key actions

Business Model for Global Platform for Big Data for Official Statistics in support of the 2030 Agenda for Sustainable Development

VdTÜV Statement on the Communication from the EU Commission A Digital Single Market Strategy for Europe

A Strategy for a secure Information Society Dialogue, Partnership and empowerment

Current status of WP3: smart meters

Kick-off Meeting DPIA Test phase

The NIS Directive and Cybersecurity in

Integration of INSPIRE & SDMX data infrastructures for the 2021 population and housing census

HEALTH IN ECSO (European Cyber Security Organisation) 18 October 2017

INTERMEDIATE EVALUATION

ESS Shared SERVices project Background, Status, Roadmap. Modernisation Workshop 16/17 March Bucharest

Green IT Strategies and Practices for a Sustainable Europe

CoE CENTRE of EXCELLENCE ON DATA WAREHOUSING

Birgit Morlion. DG Communications Networks, Content and Technology (DG CONNECT)

Big Data Value cppp Big Data Value Association Big Data Value ecosystem

Level 4 Diploma in Computing

Cybersecurity & Digital Privacy in the Energy sector

How UAE is Driving Smart Sustainable Cities: key Achievements and Future Considerations

MOBILE PHONE DATA AND OFFICIAL STATISTICS

EU projects for ehealth new ehealth activities in EU

Cybersecurity in Asia-Pacific State of play, key issues for trade and e-commerce

European Union Agency for Network and Information Security

Directive on security of network and information systems (NIS): State of Play

ehealth and DSM, Digital Single Market

ECDL / ICDL Digital Marketing Module Global Launch & Overview Daniel Palmer Head of Market Development ECDL Foundation

RESOLUTION 130 (Rev. Antalya, 2006)

Policy Group on Statistical Cooperation

Market - Technology - Policy Regulatory Policy Developments

The EuroHPC strategic initiative

DIGITIZING INDUSTRY, ICT STANDARDS TO

European Cybersecurity PPP European Cyber Security Organisation - ECSO November 2016

Promoting Digital Economy in the Eastern Partnership. Vassilis Kopanas European Commission, DG CONNECT

Networking for Sustainability & Green Digital Charter

EC Perspective on Connectivity and Smart Mobility and the Role of "Digital Co-production/Co-creation" in Transportation

Cybersecurity in the EU Steve Purser Head of Operational Departments, ENISA Regional Cybersecurity Forum Sofia, Bulgaria 29 th November 2016 European

New CEPIS Mission

IPv6 deployment, European Commission involvement. RIPE 60 Prague 4May Per Blixt

European Cybersecurity cppp and ECSO. org.eu

Toward Horizon 2020: INSPIRE, PSI and other EU policies on data sharing and standardization

METADATA MANAGEMENT AND STATISTICAL BUSINESS PROCESS AT STATISTICS ESTONIA

INCEPTION IMPACT ASSESSMENT. A. Context, Problem definition and Subsidiarity Check

Involvement with users

Data Governance for Smart City Management

Regulation and the Internet of Things

Digitising European industry

Between 1981 and 1983, I worked as a research assistant and for the following two years, I ran a Software Development Department.

EIT Health UK-Ireland Privacy Policy

UCD Centre for Cybersecurity & Cybercrime Investigation

The 13 th Progress Report on the Single European Telecoms Market 2007: Frequently Asked Questions

Security and resilience in Information Society: the European approach

Emergency response plan in the event of an attack against information systems or. a technical flaw in the information systems

Achim Klabunde European Commission DG Information Society & Media

Cyber Security in Europe

GNSSN. Global Nuclear Safety and Security Network

CEN and CENELEC Position Paper on the draft regulation ''Cybersecurity Act''

Introduction to the European Standardization System

Accelerating data-driven innovation in Europe

Smart IT and Sustainability July 2014

H2020-LEIT-ICT WP European Data Infrastructure ICT-13 Supporting the emergence of data markets and the data economy

Indicator Framework for Monitoring the Council Recommendation on the integration of the long-term unemployed into the labour market

Some Big Data Challenges

Digital Single Market Strategy for Europe

Joint Research Centre

Internet of Things, A European Outlook Antonis Tzortzakakis, Treasurer ECTA

FINLAND & SMART ENERGY. Pekka Sivonen Executive Director Digital Transformation of Finnish Industries May 2018

EGM, 9-10 December A World that Counts: Mobilising the Data Revolution for Sustainable Development. 9 December 2014 BACKGROUND

Enhancing the security of CIIPs in Europe - ENISA s Approach Dimitra Liveri Network and Information Security Expert

Overview of SDGs and Conference of European Statisticians Road Map

Striving for efficiency

The United Republic of Tanzania. Domestication of Sustainable Development Goals. Progress Report. March, 2017

MANAGING STATISTICAL DEVELOPMENT AND INFORMATION TECHNOLOGY IN THE STATISTICAL SYSTEM OF MALAYSIA

Resilience at JRC. Naouma Kourti. Dep. Head of Unit. Technology Innovation in security Security, Space and Migration Directorate

CIPMA CRITICAL INFRASTRUCTURE PROTECTION MODELLING & ANALYSIS. Overview of CIP in Australia

D2.2 Web Platform development

European Space Policy

The Africa-EU Energy Partnership (AEEP) The Role of Civil Society and the Private Sector. 12 February, Brussels. Hein Winnubst

Government IT Modernization and the Adoption of Hybrid Cloud

Proposition to participate in the International non-for-profit Industry Association: Energy Efficient Buildings

International Policy Division, Global ICT Strategy Bureau

ENISA s Position on the NIS Directive

WELCOME. October 19, 2017 The Mandarin Oriental Washington, DC

NATIONAL BROADBAND STRATEGY IN THE REGION OF CENTRAL AND SOUTH EAST EUROPE (overview, market structure, challenges, recommendations)

Integration of Economic and Construction Outlooks: A Case Study. Lorenz Kleist Consultant October 6, 2009

European Commission Directorate General Enterprise and Industry INSTITUTIONAL FRAMEWORK ON

Weather and climate Information SERvices for Africa (WISER) Joseph D. Intsiful, WISER Pan-African Lead, ACPC, UNECA

Transcription:

EUROSTAT and BIG DATA High Level Seminar on integrating non traditional data sources in the National Statistical Systems Santiago, Chile, October 1-2, 2018 1

Table of contents About Eurostat What is the European Statistical System (ESS) Big data Implications of big data for official statistics Big data action plan and roadmap for European official statistics & big data pilots 2

What is Eurostat? The statistical office of the European Union Mission: to provide high-quality statistics for Europe A Directorate-General (DG) of the European Commission Member of the ESS

Responsibilities of Eurostat Eurostat defines harmonised methodologies in collaboration with Member States so that data are comparable Eurostat receives, treats and publishes data on the EU, euro area, Member States and EFTA Eurostat disseminates statistics

Basic facts Based in Luxembourg Some 800 staff officials, national experts, contractual staff Director-General Mariana Kotzeva Created in 1953 as a service of the High Authority for Coal and Steel A long evolution since then

The European Statistical System (ESS) A partnership of all National Statistical Institutes (NSIs), other national statistical authorities (ONAs) and Eurostat Was built up gradually to provide comparable statistics at EU level The NSIs are the coordinators of the national statistical systems in Member States, including Other National Authorities

European Statistical System Committee - ESSC Established in 2009 Provides "professional guidance to the ESS for developing, producing and disseminating European statistics" Composed of Heads of Member States' NSIs, chaired by the Commission (Director-General of Eurostat) Meets four times a year

European Statistics Code of Practice New and improved! 16 principles on institutional environment, production & dissemination Among them: Professional independence Commitment to quality Impartiality and objectivity Peer reviews to monitor implementation of the Code

ESGAB Established in March 2008: An independent overview of the ESS as regards the implementation of the Code of Practice Opinions Annual report to the European Parliament and the Council

ESAC Established in March 2008 24 members representing users, respondents and others Makes sure that user requirements and response burden taken into account when making up statistical programmes. Gives view on balance (priorities and resources) of different areas of multiannual statistical programme.

Big Data Can we use Big Data for official statistics? Without data, you're just another person with an opinion. W. Edwards Deming

Describing big data Data deluge High Volume High Velocity High Variety (Variability) Veracity (Visualisation) http://www.theblindelephant.com/the_blind_elephant_fable.html Value ( ) 12

Source: Associated Press Source: REUTERS/Kimimasa Mayama CRB/DY http://pictures.reuters.com /archive/pope- RP6DRMRPZFAB.html

The data deluge (1/3) 1. Human-sourced information Social Networks (Facebook, Twitter, LinkedIn, Pinterest, ) Blogs and posted comments Pictures (Instagram, Facebook ) Videos (Youtube, ) Search engine queries Mobile data content (SMS, ) User-generated maps E-Mails https://statswiki.unece.org/display/bigdata/classification+of+types+of+big+data The Economist March, 2010 The Data Deluge Source: http://www.economist.com/node/15579717 14

The data deluge (2/3) 2. Business systems (processmediated data/transactions) Commercial transactions Banking/stock prices records E-commerce Telephone Call Detail Records Credit cards Medical records from Public Health 15 https://statswiki.unece.org/display/bigdata/classification+of+types+of+big+data

The data deluge (3/3) 3. Machine generated data (Internet of Things - IoT) Sensor data Weather/pollution sensors Traffic sensors/webcam Security/surveillance videos/images Tracking devices GPS systems Mobile phone location Satellite images Data from computer systems Logs & Web logs 16 https://statswiki.unece.org/display/bigdata/classification+of+types+of+big+data

Implications of big data for official statistics Data-driven economy Official statistics is no longer an "almost" statistical monopoly Scheveningen Memorandum on big-data and official statistics (2013), the general directors of the NSI Acknowledge that the use of Big-data in the context of official statistics requires new developments in methodology, quality assessment and IT related issues. The European Statistical System should make a special effort to supports these developments. 17

Implications of big data for official statistics Change of paradigm From: finite population sampling methodology To: additional statistical modelling and machine learning From: designers of data collection processes To: designers of statistical products Privacy Use of digital footprint Data subject has no control of data High data detail and insight from analytics

Official statistics Census-taking Relief ( Altar of Domitius Ahenobarbus ), Rome, Italy, ca. 100 B.C.E., Some things have not changed Mandatory census Census is official The census operation has not changed fundamentally for 2000 years It's all about registering and counting people Some other have changed People had to go to the censor, now the censors go to the people Not everyone was counted, only Roman citizens, now everyone counts regardless social class, gender or age Census served the king, now it serves society and the democratic process 19

Scheveningen Memorandum on Big Data 09/2013 Examine the potential of Big Data sources for official statistics Official Statistics Big Data strategy as part of wider government strategy Address privacy and data protection Collaboration at European and global level Address need for skills Partnerships between different stakeholders (government, academics, private sector) Developments in Methodology, quality assessment and IT Adopt action plan and roadmap for the European Statistical System https://ec.europa.eu/eurostat/cros/news/scheveningen-memorandum-big-data-andofficial-statistics-adopted-essc_en

ESS big data action plan Governance Policy Quality Skills Experience sharing Legislation IT Infrastructures Methods Ethics / Communication Big data sources Pilots 21

Policy Quality Skills Experience sharing Methods Governance Legislation Ethics / Communication Pilots IT Infrastructures Big data sources Challenges cooperation, sharing of know-how development of a sound methodology ("from design-based to model-based approach") exploration & tentative implementation Looking for partners Action Pilot projects, Member States (ESSnet) European Statistical System Network Exploring different big data sources Establish parternships with data providers, research and international organisations Cooperation with UN on Methodological Framework 22

Policy Quality Skills Experience sharing Methods Governance Legislation Ethics / Communication IT Infrastructures Big data sources Challenges new skills for NSI staff: statisticians vs. data scientists? computing capacity, hardware? analytical tools, software? storage? Pilots Action Training program for European statisticians (ESTP) Dedicated courses on big data Focus on big data sources & on big data tools Acquiring the skills needed to assess sources and their quality, the skills to use tools and to explore big data sources 23

Policy Quality Skills Experience sharing Methods Governance Legislation Ethics / Communication Pilots IT Infrastructures Big data sources Challenges integrating official statistics into big data strategies getting access to data & continuity of access data security & privacy concerns compensate for the burden? Action Project on the analysis of legislation and strategy (but also ethics and communication) 2015-2017 Analysis for EU and for Member States at national level 24

Policy Quality Skills Experience sharing Methods Action Governance Legislation Ethics / Communication Pilots IT Infrastructures Big data sources Challenges transversal challenges to all big data activities: quality and ethics & communication big data vs. statistics : "goodness of fit" (concepts, representativeness, ) impact on the public opinion of privacy and security concerns? Cooperation with UN on a quality framework for big data Project on the analysis of ethics and communication (but also legislation and strategy) 25

ESSnet WP1 Webscraping / Job Vacancies (UK, DE, EL, IT, SE, SI) Aim: To demonstrate by concrete estimates which approaches (techniques, methodology, etc.) are most suitable to produce statistical estimates in the domain of job vacancies and under which conditions these approaches can be used in the ESS. End date: May 2018 26

ESSnet WP2 Webscraping / Enterprise Characteristics (IT, BG, NL, PL, SE, UK) Aim: To investigate whether webscraping, text mining and inference techniques can be used to collect, process and improve general information about enterprises: presence of web sales facilities, profiling information: type of activity, links with other enterprises, etc. End date: May 2018 27

ESSnet WP3 Smart Meters (EE,AT,DK,SE) Aim: To demonstrate by concrete estimates whether buildings equipped with smart meters can be used to produce energy statistics but can also be relevant as a supplement for other statistics, e.g. census housing statistics, household costs, impact on environment, statistics about energy production. End date: May 2018 28

ESSnet WP4 Automated Identification System Data (NL, DK, EL, NO, PL) Aim: To investigate whether real-time measurement data of ship positions (via AIS system) can be used 1. to improve the quality and internal comparability of existing statistics and 2. for new statistical products relevant for the ESS. End date: May 2018

Possible Advantages: ESSnet: AIS data Port visits Determining ship routes Improve existing statistics on fuel consumption and emissions. Reduce respondent burden for some ports Accelerate publishing speed for some maritime statistics Experimental: Now-cast economic time series 30

Moving from Internet of Things A set of sensors, actuators, smart objects, Data communications and interface technologies that allow information to be collected, tracked and processed across local and global network infrastructures, enabling the future hyperconnected society 31

to Smart statistics Data capturing, processing and analysis will be embedded in the system itself Intelligence along data life-cycle enhanced with cognitive processes Trusted smart statistics 32

Yes, we can use big data in official statistics if we approach it carefully. Thank you for your attention Information: Collaboration in Research and Methodology for Official Statistics European Commission» Eurostat» CROS» Big data» Big Data Initiatives https://ec.europa.eu/eurostat/cros/content/big-data_en konstantinos.giannakouris@ec.europa.eu This photo, Cartoon: Big Data is copyright (c) 2014 Thierry Gregorius and made available under an Attribution 2.0 Generic license. 33