STIS Statistical Information Systems Consortium INTRASOFT INTERNATIONAL S.A. and AGILIS S.A. European Commission Eurostat/B3 Framework Contract 14200.2005.007-2005.699 - Lot 1 Specific Contract 15100.2005.002-2006.231 XML-Publishing Implementation Strategy of an XML-based publishing in Eurostat Proposed architecture August 2007
Status: Company Approved Page 2/9 Table of Contents 1 Introduction... 4 1.1 Purpose and Scope... 4 1.2 Definitions & Acronyms... 4 1.3 References... 5 2 Alfresco architecture... 6 2.1 Core support frameworks... 6 2.2 System overview... 6 2.3 System architecture... 7 3 XML based publishing solution in context... 8 3.1 Custom components... 8
Status: Company Approved Page 3/9 Document Service Data Type of Document Reference: Technical report XML-Publishing proposed architecture v1.0.doc Issue: 1 Revision: 0 Status: Company Approved Created by: Victorio Bentivogli Date: 27/08/2007 Distribution: Contract Full Title: EU-Eurostat, Intrasoft International S.A. N/A Service contract number: Specific Contract 15100.2005.002-2006.231 For Internal Use Only Reviewed by: Approved by: Christian Boudot Mario Fendler Document Change Record Issue/Revision Date Change 0.1 24/08/2007 Draft version 1.0 27/08/2007 Delivery version
Status: Company Approved Page 4/9 1 Introduction 1.1 Purpose and Scope The present document provides an initial architectural view of the proposed XML based publication solution for Eurostat. 1.2 Definitions & Acronyms The following is a list of the acronyms used in this document: Acronym Definition SDMX Statistical Data and Metadata exchange ODF Open Document Format
Status: Company Approved Page 5/9 1.3 References Reference Document/Resource Name Filename [R1] D0.1: Minutes of the Project Kick-Off Meeting on 13. Nov. 2006 Kickoff Meeting 2006-11-13 v1.2.doc [R2] D1.1: Analysis of the publications programme, dissemination process and data life cycle D11 Analysis XML-Publishing v1.1.doc [R3] D1.2: Design of an XML-based production workflow for Eurostat publications and the related implementation strategy D12 Design of XML-based workflow v2.0.doc [R4] D2.1: Analysis & Evaluation of existing standards D21 Analysis of existing standards v2.0.doc [R5] D2.2: Design Document of XML schema(s) for Eurostat publications D22 Design Document of XML schema(s) v2.0.doc [R6] D0.5: Interim Report XML-Publishing D0.5 Interim Report 2007 v1.0.doc [R7] D3.1: Analysis of existing Eurostat Databases and data formats D31 Analysis of existing Eurostat Databases and data formats v1.0 [R8] D3.2: Evaluation of existing solutions to create XML-manuscripts without the need of the author to be a XML-expert D32 Evaluation of XML-manuscript creation tools v1.0.doc [R9] D1.0: Final Report XML-Publishing D1.0 Final Report 2007 v1.0.doc [R10] Alfresco web site (www.alfresco.com)
Status: Company Approved Page 6/9 2 Alfresco architecture This section introduces the core components used in Alfresco Content Management System. It also presents a System overview and an initial architectural diagram in the context of the XML based publication solution for Eurostat. The information provided in this document is preliminary and based in the findings and conclusions of the current analysis. It is subject to evolution during the specification phase of an initial Prototype. 2.1 Core support frameworks Alfresco uses state of the art core components that assembled together provide a powerful, scalable and reliable Content Management foundation. They include: Spring - Spring is an application framework that includes AOP capabilities. Hibernate - Hibernate is an object-relational mapping technology Lucene - Lucene is an enterprise-scale full-text and metadata search engine MyFaces - MyFaces is an implementation of Java Server Faces 2.2 System overview The following diagram introduces Alfresco Content Management system s services. Figure 1 - Alfresco system overview
Status: Company Approved Page 7/9 2.3 System architecture The following diagram shows how the different architectural components are integrated into Alfresco: Figure 2 - Alfresco system architecture
Status: Company Approved Page 8/9 3 XML based publishing solution in context This section introduces the proposed integration between Alfresco and the custom components to be developed for the XML based publishing solution for Eurostat. Technically speaking, both Alfresco and the custom components developed for the solution will be hosted on a JSP container (Apache Tomcat). The solution will also use: a file system to store documents and a relational database (like Oracle) in order to persist metadata and internal business related information 3.1 Custom components The following is a preliminary list (to be completed during the specification phase) of custom components that will be developed for the solution, and will extend Alfresco s own architecture: User interface components, built to integrate seamlessly with Statpub. Workflow custom components, to facilitate the integration with Statpub and support the collaborative business process (creation, authoring, proofreading, translation, publication). ODF custom generators, to facilitate the construction of ODF fragments containing tabular data and charts generated from external data sources. Digesters, to process and homogenise information coming from different data sources. Custom transformers, to produce publications compatible with the different output channels (PDF, mini Web sites, etc). Metadata extractors, to extract metadata contained in ODF documents, populating the Content Management System. Metadata assemblers, to stamp metadata on exported documents in order to facilitate tracking and control. Below we can see an initial diagram of the solution in context:
Status: Company Approved Page 9/9 XML based publishing custom developed components Backup and restore components Figure 3 XML based publishing solution diagram