Business microdata dissemination at Istat

Similar documents
ESSnet on common tools and harmonised methodology for SDC in the ESS

A corporate approach to processing microdata in Eurostat

Metadata projects in Sweden and the Nordic countries. C-G Hjelm, Statistics Sweden

Item 3 Granting access to microdata for research purposes - an overview

METADATA MANAGEMENT AND STATISTICAL BUSINESS PROCESS AT STATISTICS ESTONIA

Metadata and Infrastructure for Researchers from a perspective of an NSI. C-G Hjelm Research and Development at Statistics Sweden

Privacy and Security Aspects Related to the Use of Big Data Progress of work in the ESS. Pascal Jacques Eurostat Local Security Officer 1

Applications to support the curation of African government microdata for research purposes

Economic and Social Council

2011 INTERNATIONAL COMPARISON PROGRAM

Microdata Management Toolkit (MMT) National Data Archive (NADA)

Enhanced Access to Micro Data of Official Statistics - Integrated Data and Meta Data Management System of the German Research Data Centres

A new shiny GUI for sdcmicro

Federal Statistics, Multiple Data Sources, and Privacy Protection: Next Steps

European Conference on Quality and Methodology in Official Statistics (Q2008), 8-11, July, 2008, Rome - Italy

Telling stories about data. A dynamic and interactive approach to disseminate thematic indicators

A STRATEGY ON STRUCTURAL METADATA MANAGEMENT BASED ON SDMX AND THE GSIM MODELS

DDI metadata for IPUMS I samples

Metadata and classification system development in Bosnia and Herzegovina

2011 INTERNATIONAL COMPARISON PROGRAM

Integration of INSPIRE & SDMX data infrastructures for the 2021 population and housing census

Generic Statistical Business Process Model

Data Harmonization Brief Harmonized Household Income and Expenditure Surveys (HHIES)

DDI Data Description Statistics Protection Software

Islam21c.com Data Protection and Privacy Policy

Introduction to IPUMS

PRIVACY POLICY INFORMATION DOCUMENT

Policy Group on Statistical Cooperation

Integrated Data Processing System (EAR)

Harmonizing the data collection and data entry applications for longitudinal and cross-sectional surveys in social science: A metadata driven approach

The OECD statistical information system integration of energy data with other domains

Telling stories about data. A dynamic and interactive approach to disseminate thematic indicators

CoE CENTRE of EXCELLENCE ON DATA WAREHOUSING

MANAGING STATISTICAL DEVELOPMENT AND INFORMATION TECHNOLOGY IN THE STATISTICAL SYSTEM OF MALAYSIA

Chapter 17: INTERNATIONAL DATA PRODUCTS

EUROSTAT and BIG DATA. High Level Seminar on integrating non traditional data sources in the National Statistical Systems

PROCEDURE POLICY DEFINITIONS AD DATA GOVERNANCE PROCEDURE. Administration (AD) APPROVED: President and CEO

Nettest. An implementation of BEREC s recommendations

A Modern European Data Protection Framework

Toward Horizon 2020: INSPIRE, PSI and other EU policies on data sharing and standardization

Standardizing and industrializing a business process the dissemination use case Annex 1

Generic Statistical Information Model (GSIM)

On the Design and Implementation of a Generalized Process for Business Statistics

Altitude Software. Data Protection Heading 2018

Privacy policy SIdP website EU 2016/679

IJESRT. (I2OR), Publication Impact Factor: (ISRA), Impact Factor: 2.114

Implementation of Information Technology for Statistical Activities in BPS Statistics Indonesia

Business Case for Industrialisation in Statistics Estonia: Small Example of a Large Trend

Description of the European Big Data Hackathon 2019

Outline of Presentation

The United Nations Crime Trends Survey (UN-CTS) Michael Jandl Statistics and Surveys Section UNODC

Sampling Error Estimation SORS practice

Throughout this Data Use Notice, we use plain English summaries which are intended to give you guidance about what each section is about.

Wendy Thomas Minnesota Population Center NADDI 2014

A web-based Census of services: an ISTAT evolutionary study

Statistics and Open Data

ITU Asia-Pacific Centres of Excellence Training on Conformity and Interoperability. Session 2: Conformity Assessment Principles

Country Report Uganda. ADP Quality Assessment ADP/PARIS21

Cambridge TECHNICALS LEVEL 3

Business Architecture concepts and components: BA shared infrastructures, capability modeling and guiding principles

AMERICAN JOURNAL OF POLITICAL SCIENCE GUIDELINES FOR PREPARING REPLICATION FILES Version 1.0, March 25, 2015 William G. Jacoby

Managing Privacy Risk & Compliance in Financial Services. Brett Hamilton Advisory Solutions Consultant ServiceNow

Metadata: an Integral Part of Statistics Canada s Data Quality Framework 1

DATA PRIVACY & PROTECTION POLICY POLICY INFORMATION WE COLLECT AND RECEIVE. Quality Management System

BISHOP GROSSETESTE UNIVERSITY. Document Administration. This policy applies to staff, students, and relevant data subjects

Mobile Positioning Data for Tourism Statistics

ECHA s Dissemination website

Data Protection and Privacy Policy PORTOBAY GROUP Version I

International Atomic Energy Agency Meeting the Challenge of the Safety- Security Interface

This Privacy Policy applies if you're a customer, employee or use any of our services, visit our website, , call or write to us.

Do you handle EU residents personal data? The GDPR update is coming May 25, Are you ready?

Ambition Training. Privacy Policy

CORA COmmon Reference Architecture

INFORMATION TECHNOLOGY DATA MANAGEMENT PROCEDURES AND GOVERNANCE STRUCTURE BALL STATE UNIVERSITY OFFICE OF INFORMATION SECURITY SERVICES

NATIONAL CYBER SECURITY STRATEGY. - Version 2.0 -

DELIVERABLE D12.6/12.7 IECM Database Extension & User Interface with Tabulator

NEW DATA REGULATIONS: IS YOUR BUSINESS COMPLIANT?

IMF Statistics Department

New IT solutions for item list management and data validation. 4 th Inter-Agency Coordinating Group Meeting October 23-25, 2017 Washington, DC

Introduction to Canadian data and Odesi. SOC 3142 Susan Mowers Data Librarian

2. The Information we collect and how we use it: Individuals and Organisations: We collect and process personal data from individuals and organisation

Testing and Certification Regulations For an SA8000 Applicant Status Certification

IMPACT OF INTERNATIONAL PRIVACY REGULATIONS. Michelle Caswell, Coalfire Julia Jacobson, K&L Gates

EU GDPR & ISO Integrated Documentation Toolkit integrated-documentation-toolkit

GESIS Datenservice Unter Sachsenhausen Köln Fax:

Information Technology Branch Organization of Cyber Security Technical Standard

Presentation and demonstration

Folsom Library & RensSearch Usability Test Plan

Accelerate GDPR compliance with the Microsoft Cloud

Introduction. Angela Holzworth, RHIA, CISA, GSEC. Kimberly Gray, Esq., CIPP/US. Sr. IT Infrastructure Analyst

Korea s efforts to improve criminal justice statistics and the role of KIC. Kim, Ji-Sun, Director of Crime Statistics and Survey Center

- Information that you provide by filling in a hard copy form and return to us, e.g. at one of our events;

Dexterity: Data Exchange Tools and Standards for Social Sciences

Where provided, the names and addresses of post holders within Public Bodies, Public Service Providers and Limited Companies.

GDPR - Are you ready?

This Policy has been prepared with due regard to the General Data Protection Regulation (EU Regulation 2016/679) ( GDPR ).

Data Quality Assessment: Data Validation (Data Techniques), Consistency with other Energy Statistics Availability of Metadata

Swedish National Data Service, SND Checklist Data Management Plan Checklist for Data Management Plan

Lead Forensics Software Data Compliance Policy

The NIS Directive and Cybersecurity in

Transcription:

Business microdata dissemination at Istat Daniela Ichim Luisa Franconi ichim@istat.it franconi@istat.it

Outline - Released products - Microdata dissemination - Business microdata dissemination - Documentation of microdata files - Further work

Information dissemination The mission of National Statistical Institutes (NSI) is to produce and disseminate: reliable impartial transparent information accessible pertinent The dissemination of this information should be performed in full compliance with the legislation pertaining to the privacy and confidentiality of respondents.

Different users, different needs Risk of confidentiality breach Press releases (e-)books TV Internet Social networks Expert users, controlled channels Information content

Microdata The demand of analysis of microdata is steadily increasing: a) infrastructure advances (computational power, software availability) b) more information is available (internet) c) need to analyze more localized phenomena Advantages of dealing with microdata - data processing is unlimited and unrestricted: data selection models and methods prioritisation of variables and/or sources - training (and experience) on real data, complex datasets - transparency, neutrality and impartiality - reproducibility of research and Official Statistics

Microdata The demand of analysis of microdata is steadily increasing: a) infrastructure advances (computational power, software availability) b) more information is available (internet) c) need to analyze more localized phenomena Disadvantages - microdata are NOT user-friendly - software tools are required - knowledge (IT, statistical, methodological, subject-matter) is required - privacy and confidentiality - controlled access and dissemination

Microdata dissemination at Istat 2013 2009 1999 (2012) 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% micro.stat MFR ADELE IT procedures statistical

Microdata dissemination at Istat ADELE - accredited researchers - scientific research projects - data analysis only in the secure rooms - output is checked by expert staff before its transmission to the users MFR - accredited researchers - scientific research projects - no statistical or IT restriction on the analyses micro.stat - registered users (only a valid e-mail is necessary) - no statistical or IT descriptions

Microdata dissemination at Istat Integrated system (microdata files share the same structure) ADELE MFR micro.stat recoding subsampling top/bottom coding microaggregation perturbation rounding etc. Multiple releases from the same survey.

Microdata dissemination at Istat Legal aspects: access to social, business, registers and integrated microdata access is independent on nationality no consent in required, but we have to inform respondents Access is free of charge.

Microdata dissemination at Istat ADELE: any Istat survey MFR: Survey EU Survey EU Continuous Vocational Training Survey (CVTS) YES Road Accidents Resulting in Death or Injury NO Factors of Business Success (FOBS) NO Structure of Earnings Survey (SES) YES Farm StructureSurvey (FSS) NO Survey on Doctorate Holders Vocational Integration Graduates' Transition NO Italian Innovation Survey (CIS) YES Labour Force Survey - cross-sectional quarterly YES University Graduates Census NO NO Labour Force Survey-12 months longitudinal data Population Census 2001 NO University Graduates' Vocational Integration NO NO More information: http://www.istat.it/it/prodotti/microdati

Dissemination strategy - Istat Apply SDL to reduce risk maintaining some utility Evaluate utility Original microdata Disclosure risk Utility SDL methods Anonimized microdata R Utility: analytical validity U

Business microdata dissemination at Istat Particular issues from an SDC point of view: - smaller reference population - (known) take-all strata - large enterprises are well-known (recognizable) - large enterprises are dominating - outliers, (extremely) skew distributions - there might be some «economic» interest in identifying some businesses - there might be some real (measurable) harm if a business is identified - both continuous and categorical variables - continuous variables each record is an unique case

Business microdata dissemination at Istat Disclosure scenarios: - categorical variables external registers - continuous variables outliers (data driven approaches) Turnover

Business microdata dissemination at Istat Istat approach: statistical disclosure control methods Survey\SDC Variable suppression Rounding Individual ranking Recoding Perturbation CIS X X X X X SES X X X X CVTS X X X X FSS X X X X X FOBS X X X - suitable to the scenario - perturb only the units at risk - suitable to the data analysis (research potential), including comparability and harmonisation at EU level - ensure coherence with already published information - apply the same methodology to subsequent waves

Documentation of Istat microdata files Microdata documentation is needed to facilitate its use.

Documentation of Istat business microdata files The microdata products share the same documentation, freely downloadable from the Istat web-site: a) survey methodology (sampling design, data collection, data calibration, etc) b) SDC methodology (disclosure scenarios, disclosure limitation methods, data utility evaluations) c) Survey questionnaire d) Layout description (list of variables and their characteristics: labels, length, type [categorical or continuous]) e) Classifications f) Routines to load the data in R, STATA, SPSS and SAS g) A toy microdata file, an example of structure file Istat microdata documentation is available also in English (EU).

Further a) Other microdata products may be developed, but it could be better to focus on «microdata are not user friendly»: develop instruments to process microdata: - tools: faster computation faster visualization faster interpretation standards (SDMX or DDI) faster communication - services: searchability documentation, metadata b) Coherent multiple releases from multiple surveys (integrated statistics)

a partire dal 2013 THANK YOU FOR YOUR ATTENTION!