RecordMe: A Smartphone Application for Experimental Collections of Large Amount of Data Respecting Volunteer s Privacy

Similar documents
LaHC at CLEF 2015 SBS Lab

Taking Benefit from the User Density in Large Cities for Delivering SMS

Fault-Tolerant Storage Servers for the Databases of Redundant Web Servers in a Computing Grid

Setup of epiphytic assistance systems with SEPIA

FIT IoT-LAB: The Largest IoT Open Experimental Testbed

Tacked Link List - An Improved Linked List for Advance Resource Reservation

An FCA Framework for Knowledge Discovery in SPARQL Query Answers

Service Reconfiguration in the DANAH Assistive System

QuickRanking: Fast Algorithm For Sorting And Ranking Data

HySCaS: Hybrid Stereoscopic Calibration Software

Blind Browsing on Hand-Held Devices: Touching the Web... to Understand it Better

BoxPlot++ Zeina Azmeh, Fady Hamoui, Marianne Huchard. To cite this version: HAL Id: lirmm

Linked data from your pocket: The Android RDFContentProvider

DSM GENERATION FROM STEREOSCOPIC IMAGERY FOR DAMAGE MAPPING, APPLICATION ON THE TOHOKU TSUNAMI

DANCer: Dynamic Attributed Network with Community Structure Generator

Multimedia CTI Services for Telecommunication Systems

Mokka, main guidelines and future

How to simulate a volume-controlled flooding with mathematical morphology operators?

SIM-Mee - Mobilizing your social network

Very Tight Coupling between LTE and WiFi: a Practical Analysis

From medical imaging to numerical simulations

MUTE: A Peer-to-Peer Web-based Real-time Collaborative Editor

YAM++ : A multi-strategy based approach for Ontology matching task

Assisted Policy Management for SPARQL Endpoints Access Control

Change Detection System for the Maintenance of Automated Testing

lambda-min Decoding Algorithm of Regular and Irregular LDPC Codes

Regularization parameter estimation for non-negative hyperspectral image deconvolution:supplementary material

Robust IP and UDP-lite header recovery for packetized multimedia transmission

Reverse-engineering of UML 2.0 Sequence Diagrams from Execution Traces

Natural Language Based User Interface for On-Demand Service Composition

Deformetrica: a software for statistical analysis of anatomical shapes

Privacy-preserving carpooling

Malware models for network and service management

Experimental Evaluation of an IEC Station Bus Communication Reliability

Fuzzy sensor for the perception of colour

X-Kaapi C programming interface

Linux: Understanding Process-Level Power Consumption

Zigbee Wireless Sensor Network Nodes Deployment Strategy for Digital Agricultural Data Acquisition

Open Digital Forms. Hiep Le, Thomas Rebele, Fabian Suchanek. HAL Id: hal

Simulations of VANET Scenarios with OPNET and SUMO

Multi-atlas labeling with population-specific template and non-local patch-based label fusion

Scalewelis: a Scalable Query-based Faceted Search System on Top of SPARQL Endpoints

Captured Motion Data Processing for Real Time Synthesis of Sign Language

Teaching Encapsulation and Modularity in Object-Oriented Languages with Access Graphs

Comparison of radiosity and ray-tracing methods for coupled rooms

Comparator: A Tool for Quantifying Behavioural Compatibility

Formal modelling of ontologies within Event-B

The Proportional Colouring Problem: Optimizing Buffers in Radio Mesh Networks

The Connectivity Order of Links

Leveraging ambient applications interactions with their environment to improve services selection relevancy

Fueling Time Machine: Information Extraction from Retro-Digitised Address Directories

Moveability and Collision Analysis for Fully-Parallel Manipulators

XML Document Classification using SVM

A Methodology for Improving Software Design Lifecycle in Embedded Control Systems

Relabeling nodes according to the structure of the graph

Paris-Lille-3D: A Point Cloud Dataset for Urban Scene Segmentation and Classification

Traffic Grooming in Bidirectional WDM Ring Networks

The New Territory of Lightweight Security in a Cloud Computing Environment

Decentralised and Privacy-Aware Learning of Traversal Time Models

Catalogue of architectural patterns characterized by constraint components, Version 1.0

UsiXML Extension for Awareness Support

Comparison of spatial indexes

Structuring the First Steps of Requirements Elicitation

Mapping classifications and linking related classes through SciGator, a DDC-based browsing library interface

Implementing an Automatic Functional Test Pattern Generation for Mixed-Signal Boards in a Maintenance Context

Self-optimisation using runtime code generation for Wireless Sensor Networks Internet-of-Things

COM2REACT: V2V COMMUNICATION FOR COOPERATIVE LOCAL TRAFFIC MANAGEMENT

NP versus PSPACE. Frank Vega. To cite this version: HAL Id: hal

Branch-and-price algorithms for the Bi-Objective Vehicle Routing Problem with Time Windows

A N-dimensional Stochastic Control Algorithm for Electricity Asset Management on PC cluster and Blue Gene Supercomputer

ASAP.V2 and ASAP.V3: Sequential optimization of an Algorithm Selector and a Scheduler

A 64-Kbytes ITTAGE indirect branch predictor

Accelerometer and Magnetometer Based Gyroscope Emulation on Smart Sensor for a Virtual Reality Application

Lossless and Lossy Minimal Redundancy Pyramidal Decomposition for Scalable Image Compression Technique

Quality of Service Enhancement by Using an Integer Bloom Filter Based Data Deduplication Mechanism in the Cloud Storage Environment

A Practical Evaluation Method of Network Traffic Load for Capacity Planning

A million pixels, a million polygons: which is heavier?

Synthesis of fixed-point programs: the case of matrix multiplication

Every 3-connected, essentially 11-connected line graph is hamiltonian

KeyGlasses : Semi-transparent keys to optimize text input on virtual keyboard

Study on Feebly Open Set with Respect to an Ideal Topological Spaces

Continuous Control of Lagrangian Data

Real-Time and Resilient Intrusion Detection: A Flow-Based Approach

LIG and LIRIS at TRECVID 2008: High Level Feature Extraction and Collaborative Annotation

Preliminary analysis of the drive system of the CTA LST Telescope and its integration in the whole PLC architecture

IMPLEMENTATION OF MOTION ESTIMATION BASED ON HETEROGENEOUS PARALLEL COMPUTING SYSTEM WITH OPENC

Using a Medical Thesaurus to Predict Query Difficulty

Efficient Gradient Method for Locally Optimizing the Periodic/Aperiodic Ambiguity Function

Light field video dataset captured by a R8 Raytrix camera (with disparity maps)

IntroClassJava: A Benchmark of 297 Small and Buggy Java Programs

QAKiS: an Open Domain QA System based on Relational Patterns

Stream Ciphers: A Practical Solution for Efficient Homomorphic-Ciphertext Compression

YANG-Based Configuration Modeling - The SecSIP IPS Case Study

SDLS: a Matlab package for solving conic least-squares problems

Monitoring Air Quality in Korea s Metropolises on Ultra-High Resolution Wall-Sized Displays

Application-Aware Protection in DWDM Optical Networks

Modularity for Java and How OSGi Can Help

The optimal routing of augmented cubes.

Real-time tracking of multiple persons by Kalman filtering and face pursuit for multimedia applications

The Animation Loop Station: Near Real-Time Animation Production

Transcription:

RecordMe: A Smartphone Application for Experimental Collections of Large Amount of Data Respecting Volunteer s Privacy David Blachon, François Portet, Laurent Besacier, Stéphan Tassart To cite this version: David Blachon, François Portet, Laurent Besacier, Stéphan Tassart. RecordMe: A Smartphone Application for Experimental Collections of Large Amount of Data Respecting Volunteer s Privacy. R. Hervás et al. UCAmI 2014, Dec 2014, Belfast, United Kingdom. Springer International, pp.345 348, 2014, LNCS 8867. <hal-01071484> HAL Id: hal-01071484 https://hal.archives-ouvertes.fr/hal-01071484 Submitted on 10 Oct 2014 HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

RecordMe: a Smartphone Application for Experimental Collections of Large Amount of Data Respecting Volunteer s Privacy David Blachon 1,2, François Portet 1, Laurent Besacier 1, Stéphan Tassart 2 1 Laboratoire d Informatique de Grenoble Univ. Grenoble Alpes, LIG, F-38000 Grenoble, France CNRS, LIG, F-38000 Grenoble, France 41 rue Mathématiques, BP 53, 38041 Grenoble cedex9, France {david.blachon,francois.portet,laurent.besacier}@imag.fr 2 STMicroelectronics, 75669 Paris, FRANCE stephan.tassart@st.com Abstract. Since the spread of smartphones, researchers now have opportunities to collect more ecological data. However, despite the many advantages of existing databases (e.g., clean data, direct comparison), they may not suit all criteria for a particular experiment, resulting in an unavoidable tradeoff between the gain they provide and the lack of some labels or data sources. In this paper, we introduce RecordMe, an Android application ready to use for the research community. RecordMe allows to continuously record many different sensors and sources and provides a basic GUI for quick and easy settings. Also, a mark up interface is embedded for experiments that need it. Because of the high sensitivity of some data, RecordMe includes features for protecting volunteers privacy and securing their data. RecordMe has already been successfully tested on different smartphones for 3 data collections. Keywords: Mobile and sensor data acquisition, Context Sensing technology, Databases for data mining, Non-intrusive smart technology 1 Introduction The recent spread of smartphones and tablets has opened many opportunities for the research community. Leveraging smartphones many resources, standard development frameworks, and the many potential volunteers, new sensing ways have emerged such as participatory sensing [1]. Another hot topic is the use of personal data collected through smartphones for tasks such activity recognition or context recognition [2]. In any of these cases, researchers must collect data to build systems and evaluate them. Early studies have performed such sensing for collecting personal data such as Reality Mining [3], a campaign for collecting data through a particular smartphone provided to around 90 participants. More recently, Kiukkonen et al [4] collected data of almost 200 volunteers over several months. Both campaigns

2 D. Blachon et al. provided useful databases (although they relied on a single brand of smartphones), containing volunteers location, social interaction clues or accelerometer readings [4]. Existing databases offer many advantages such as handling tools, annotated data, provided evaluation methodology, easy and direct benchmarking. However, such databases may not fit all the criteria, especially when it comes to use for a task a database that was acquired for another task. Hence, researchers have to trade research objectives off for the gain of time and money of using such databases. Moreover, existing databases are rarely provided with the tools that were used for their acquisition, which makes difficult to reproduce such acquisitions. In fact, to the best of our knowledge, no tool is available for the research community to collect personal data on smartphones. In this paper, we introduce RecordMe, an Android application designed for the research community so as to collect data through volunteers smartphones. RecordMe can continuously record data through around 20 different sources embedded on smartphones. A basic GUI allows an easy and quick use and can be configured. A mark up view is included in the GUI to allow volunteers to fill in personal annotations when experiments require it. Finally, because of the high sensitity of some collected data, built-in features allow to protect volunteers privacy and secure data. In Section 2, RecordMe is introduced while Section 3 reports first statistics and feedbacks of its use for data collections. 2 RecordMe: A Collecting Application for the Research Community 2.1 Design RecordMe can record almost 20 different data sources at a time, both continuous or event-based, and provide either raw data or processed ones. Table 1 summarizes recorded sensors and processings if any. This core feature is wrapped up with a basic GUI that allows volunteers a total control of the recording (see Figure 1). Indeed, she/he can select the sources to record, start and stop them at any time, and a notification informs her in real time about the status of the recording. If required, data transmission can be performed through a button on the main view that will start a wifi connection and transmit data through Secure Copy Protocol (SCP) to a predefined server. Table 1. Summary of sensed sources Type Sensors Data Accelerometer, Barometer, Ambient Continuous light, Magnetometer, Gyroscope Raw 3-axis readings Continuous Audio DFT coefficients magnitudes Proximity sensor, Headset, Battery Status transitions (e.g., on/off) Event Screen State, Communication Logs Service status (e.g., in service) (Call, SMS, Data, Bluetooth, Wifi) Comm. status (e.g., ringing) Event Hash function on character strings Communication logs, application use (ids, MAC addresses, app names) Event GPS, Wifi Translation of coordinates

RecordMe: a Smartphone Application for Experimental Collections 3 Because some experiments might require volunteers to report some notes or mark up, the GUI provides a view for such purpose. The markup template can be configured to provide predefined fields to fill in so as to make annotation quick and easy. However, volunteers can switch to text free areas for writing down their own annotations. 2.2 Privacy and Security For ensuring privacy care and security of the data, RecordMe embeds built-in features. For instance, data older than 48 hours are deleted on a regular basis. Also, audio sensing is stopped when volunteers pass a call. Specific processing are applied to sensitive data. Location coordinates from GPS or Wifi hotspots are translated then stored on the device so that it is not straightforward for non-experimenters to use them. For audio, RecordMe does not save raw signal but features computed on the fly. Coefficients magnitude of the Discrete Fourier Transform (DFT) are computed on 50% overlapping buffers of approximately 25 ms. Then, consecutive vectors are averaged on a window of approximately 200 ms. These settings guarantee that the raw signal cannot be reconstructed. Also, character strings such as phone numbers from incoming calls or SMS, MAC addresses of nearby Bluetooth or Wifi devices, or cell tower ids are processed since they are sensitive. Since only correlation between the occurrence of those elements with other flows of information and with markups are of interest, a non reversible transformation by hash function is applied to protect privacy. A hash function associates a unique identifier to every input so that the 2 distribution and correlation of the occurences are unchanged. Fig. 1. Screenshots of RecordMe : from left to right, Home view (a), Source Selection view (b), and Markup view (c).

4 D. Blachon et al. 3 Conclusion Three experiments have been run with RecordMe and are rich in feedbacks. First one is that RecordMe was stable on most tested smartphones. Only one smartphone had an original behaviour that consisted in stopping sensing when its screen turned off. We found that it was likely to be accountable for the constructor. Our workaround was the installation of a screen locker that kept the screen on then allowed a continuous recording. We also received volunteers feedbacks about their experience with the application, especially that the marking up process was not convenient. Indeed, one collection campaign required volunteers to mark up just after the end of an activity. Some volunteers reported that they sometimes forgot: either they provided a delayed mark up or they just skipped it. Workarounds could be to ask volunteers to follow predefined scenarios or to orally record annotations. Depending on experiments to perform and countries legislation, RecordMe can provide different security levels to protect volunteers data. For instance, for an experiment that studies urban volunteers motions and requires GPS coordinates collection, levels of security can be configured so that GPS coordinates are secured when stored on smartphones (by applying a translation for instance). Then, when transferred to the experimenters, coordinates are anonymized. Further processing of the coordinates is still possible by temporarily reversing the security processing (providing that volunteers agreed), yet the coordinates would remain anonymized. One of the data collections that used RecordMe performed in such a way and registered the protocol to the CNIL 3. So far, RecordMe has already been successfully installed and tested on more than 10 different smartphones from different brands (Acer, Google, Motorola, Samsung, Sony, Wiko) and Android OS versions 2.3 and 4.0. The three different data collections total almost 30 volunteers, located in different areas of France (mainly Grenoble, a South-Eastern town) and around the world (Budapest, Dublin, Singapour). 100 different recordings have been collected so far, resulting in a total of 430 hours. References 1. J. A. Burke, D. Estrin, M. Hansen, A. Parker, N. Ramanathan, S. Reddy, and M. B. Srivastava. Participatory sensing. World Sensor Web Workshop (at Sensys), 2006. 2. P. Chahuara, F. Portet, and M. Vacher. Making context aware decision from uncertain information in a smart home : a markov logic network approach. In Ambient Intelligence, Lecture Notes in Computer Science, pages 78 93, Dublin, Ireland, 2013. 3. N. Eagle and A. Pentland. Reality mining: sensing complex social systems. Personal and ubiquitous computing, 10(4):255 268, 2006. 4. N. Kiukkonen, J. Blom, O. Dousse, D. Gatica-Perez, and J. Laurila. Towards rich mobile phone datasets: Lausanne data collection campaign. Proc. ICPS, Berlin, 2010. 3 Please visit its website for more details http://www.cnil.fr/english/. Our file reference is 0750532.