DOC2MODEL. This document is a docuware for creation review and describes the doc2model project.

Similar documents
Papyrus: Advent of an Open Source IME at Eclipse (Redux)

Object Constraint Language (MDT OCL) 1.3 Galileo Simultaneous Release Review

Modeling Amalgamation Project

The Galilean Moons of Eclipse

GMF 2.0 Europa Simultaneous Release

EGF Creation Review. Benoît Langlois - Thales/TCS/EPM. April 22, 2009

Towards A Common Build Infrastructure: Designing For Reusability

Eclipse Sketch Creation Review

Dali JPA Tools Project Graduation Review

Acceleo Galileo Simultaneous Release

Java Community Process Program: State of the Community State of the Community December 2003

EMF Compare Galileo Simultaneous Release

OPEN REQUIREMENTS MANAGEMENT FRAMEWORK (ORMF) CREATION REVIEW Etish Limited, Made available under the Eclipse Public License v. 1.

IP Log for modeling.mdt.papyrus

EMFT 1.0 Release Review (OCL, Query, Transaction, and Validation)

Developing Applications With Java And UML By Paul R. Reed Jr.

EMF Compare Ganymede Simultaneous Release

Unified Modeling Language (MDT UML2) 3.0 Galileo Simultaneous Release Review. 4 June, 2009

EMF Ganymede Simultaneous Release

Eclipse Automotive Working Group

Eclipse Collaborative Development

The Next Generation of Eclipse: e4. Mike Milinkovich Executive Director Eclipse Foundation

Redefining Software- Defined Networking

Docuware for Creation Review

Eclipse SOA Tooling Platform: Project Overview. An Overview of the Eclipse STP (SOA Tooling Platform) Project

Creation Review. Eclipse IAM: Eclipse Integration for Apache Maven. Carlos Sanchez

Stand: File: gengler_java_e. Java Community Process: Not perfect, but operating very successfully

Introduction to OpenDaylight: An Open Source Community around Software-Defined Networking

FREQUENTLY ASKED QUESTIONS

Eclipse Data Tools Platform (DTP) 1.8 Release Review. Eclipse DTP PMC May 28, 2010

EMFT Mint (Incubation) 0.7 Ganymede Simultaneous Release Review

EMF Europa Simultaneous Release

CISC836: Models in Software Development: Methods, Techniques and Tools

JSR 377 Desktop Application Framework September Andres Almiray

Strategic Developer Report

Practical Model-Driven Development with the IBM Software Development Platform

TOPCASED. Current status

Sirius Your custom modeling environment made easy, at last! Stéphane Bonnet (Thales) Cédric Brun (Obeo) EclipseCon 2013

Planned Review Date: June 10, 2009 Communication Channel: eclipse.tools.emf Dave Steinberg and Ed Merks

The Eclipse Modeling Framework and MDA Status and Opportunities

Meeting Minutes of Jakarta EE Steering Committee Meeting on September 4 Attendees:

Embedded Rich Client Platform (ercp) 1.2 Release Review

3rd Lecture Languages for information modeling

Oliopäivät Modelling Now and in the Future, with Acronyms or without = RSA

SCA Tools project Creation Review. July

BPMN modeler project Creation review. Wednesday, Oct. 29 8am PT. Feedback and communication: eclipse.bpmn-modeler newsgroup

Java Community Process Executive Committee Meeting. May 14-15, 2009

A mobile campus project

Eclipse Data Tools Platform (DTP) 0.7 Release Review. Eclipse DTP PMC March 8, 2006

The JSF Tools Project

TOPCASED. Toolkit In OPen source for Critical Applications & SystEms Development

Simple to purchase, flexible to use

Practical Model-based Testing With Papyrus and RT-Tester

Tools to Develop New Linux Applications

TmL Creation Review. First Proposal

THIS IS ONLY SAMPLE RESUME - DO NOT COPY AND PASTE INTO YOUR RESUME. WE ARE NOT RESPONSIBLE Name: xxxxxx

The Eclipse Development Process for Committers

EATOP: An EAST-ADL Tool Platform for Eclipse

JSR 365 (CDI 2.0) Review

XML in the bipharmaceutical

Status by 30 th April 2017

International Journal of Advance Research in Engineering, Science & Technology. Study & Analysis of SOA based E-Learning Academic System

Modeling tools go up to the cloud... Mélanie Bats - Pierre-Charles David - Sirius Tech Lead

BPM Case Study - Managing Regulatory Controls in the Banking and Financial Sector with Activiti. Greg Harley Director - Special Projects BP3 Global

Website Design and Development Terms of Reference

Lotus Symphony. Siew Chen Way Lotus Technical Consultant

Eclipse SOA Tools Platform Project

Graphiti Release Review

Using Xml Schemas Effectively In Wsdl Design

ECLIPSE PERSISTENCE PLATFORM (ECLIPSELINK) FAQ

Data Scientists / Data Analysts (m/f) educational ackground university degree in data science, computer science, software engineering

Oracle Fusion Applications Connect Program. Release 11gRelease 2

Intelligent Recipe Publisher - Delicious

WebSphere Application Server, Version 5. What s New?

The Information Technology Program (ITS) Contents What is Information Technology?... 2

JSR 358 Progress Report. June 11, 2013

Eclipse as a Web 2.0 Application Position Paper

INTRODUCTION TO EMF. Creating Model using EMF. Our Domain model used to showcase this use case is as shown below in fig.1

Improve and Expand JavaServer Faces Technology with JBoss Seam

Eclipse and Java 8. Daniel Megert Platform and JDT Lead Eclipse PMC Member IBM Rational Zurich Research Lab

Best Practices for JSF Portlet Migration and Development

Accelerate Your Enterprise Private Cloud Initiative

IBM Rational Application Developer for WebSphere Software, Version 7.0

Expeditor Client for Desktop. Client Platform Overview

WindowBuilder Graduation & Release Review

Embedded Rich Client Platform (ercp) 1.0 Release Review

From the RCP Book To Reality. Jean-Michel Lemieux IBM Rational, Ottawa, Canada

Voting Members Present. Review of Minutes. Marketing Committee Update. Spec Committee Update. Technical Vision Update. Status of Oracle Contributions

Welcome to this IBM podcast, Realizing More. Value from Your IMS Compiler Upgrade. I'm Kimberly Gist

Caliber Data Training

JCP 2.8 Progress Report Public EC Meeting. Heather VanCura 20 November 2012

Quick tour of the Test and Performance Tools Platform (TPTP) Valentina Popescu IBM Tivoli TPTP AG Lead

REALIZE YOUR. DIGITAL VISION with Digital Private Cloud from Atos and VMware

InfoSphere Master Data Management Reference Data Management Hub Version 10 Release 0. User s Guide GI

IBM Social Rendering Templates for Digital Data Connector

Dominique Blouin Etienne Borde

Rover Use Case, Specification, design and implementation using Polarsys Tools: Capella, CDT, Gendoc, Papyrus and ReqCycle

Copyright Oracle Corporation and VMware Inc, Enterprise Modules Project (Gemini) Proposal

XF Rendering Server 2008

JD Edwards World. Service Enablement Guide Release A9.3 E

Transcription:

1 Introduction DOC2MODEL This document is a docuware for creation review and describes the doc2model project. 2 Aim The Doc2Model (Document to Model) framework is a proposed open source component under the Eclipse Modeling Framework Technology project for parsing structured documents (e.g., xlsx, docx, odt, odf...) to produce EMF models. It is in the Project Proposal Phase (as defined in the Eclipse Development Process document) and is written to declare its intent and scope. This proposal is written to solicit additional participation and input from the Eclipse community. You are invited to comment on and/or join the project. Please send all feedback to the Eclipse Modeling Framework Technology (http://www.eclipse.org/newsportal/thread.php?group=eclipse.technology.emf) newsgroup with [doc2model] as a prefix of the subject line. 3 Background The most widely used tools in many organizations continue to be text processors and spreadsheets. Often these documents describe business data that are important to manipulate in other contexts. Examples of data contained in such documents include the following: Requirements Tests CRC cards Structure definitions Documentation generation Because these kinds of tools often produce plain text documents, it's typically quite complex and time-consuming to develop a specific parser able to produce output more amenable to further manipulation. Currently some organizations are investing effort to publish specifications of open source file formats, for example Office XML (e.g., docx, xlsx...) and Open Document (e.g., odt, odf...) to facilitate widespread adoption and easier consumption. In fact, most of the business documents are organized with data defined in a common way, (top down for example for text documents) using text style, regular expressions, and column numbering. As such, it's possible to support a generic solution for parsing those documents and transforming the business data into EMF models, using XML parsing and EMF's reflective capabilities. 4 Scope

This project will provide an extensible framework for producing EMF model instances from plain text and structured documents. Transforming a business document into an EMF model will facilitate more opportunities to exploit the business data contained in such a document. In some cases documents represent the specification of a system. Instead of retyping information to produce the corresponding model it will be possible to generate it. Doc2Model can be used to, for example, to import requirements from text files and transforming them into SysML requirements models. The documents file formats which will be managed by Doc2Model include Open source formats as docx, xlsx, odt, odf; Common formats as csv; And formats desired by the eclipse community. The Doc2Model API will provide extension mechanism to allow users to add custom parsers for specific tools. These Parsers could be contributed to Doc2model component if the license is compatible with EPL. 5 Description The target model type is specified using a configuration model which describes how the data is identified during the parsing. This configuration is a map indicating what the generator does when a matching rule is applied. Matching rules make use of regular expressions, special styles, columns (spreadsheet), and tags. Transformation proceeds as follows: 1. Read the matching configuration. 2. Analyse (parse) the input document to identify matching data base on the rules. 3. Produce an output EMF model instance from the data recognized into the input document. Cross references between the data is supported and additional data can be injected. 6 Sample input user document

doc2model mapping

result after execution

7 Relationship with Other Eclipse Projects/Components Doc2Model will be built on top of EMF. Doc2Model will exploit EMF Compare to obtain differences between models. 8 Third party libraries no third party librairies, standard java parsing. 9 Code Contributions Topcased is offering doc2model as an initial codebase (see http://gforge.enseeiht.fr/projects/doc2model). o The code is hosted at : http://gforge.enseeiht.fr/projects/doc2model o There is no existing domain names associated with this code. o There is no trademarks applied to the project name. o Doc2model was hosted under EPL license. o present contributors and committers: Benjamin Marconato: benjamin.marconato@atosorigin.com Thibault Landré: thibault.landre@atosorigin.com Emilien Perico: emilien.perico@atosorigin.com David Ribeiro Campelo: david.ribeirocampelo@atosorigin.com Tristan Faure: tristan.faure@atosorigin.com a flash demo of current version is available here, http://www.4shared.com/file/123030374/bd993fe4/doc2model.html 10 Organization 10.1 Committers Tristan Faure (Atos Origin), tristan.faure@atosorigin.com Tristan Faure is a software engineer at Atos Origin. Working for 3 years with Eclipse Technology and modelling (EMF, UML...). He is also an EMF, OCL and ATL consultant. Commiter and technical manager for TOPCASED open source project, he makes training about TOPCASED platform for international Atos Origin teams. Emilien Perico (Atos Origin), emilien.perico@atosorigin.com Emilien Perico is a software engineer at Atos Origin, working for 3 years with Eclipse platform and modeling technologies such as UML2, GEF and GMF and model transformation tools. He is a committer on TOPCASED project, has been in charge of the platform build and packaging. He also worked on projects about tests generation and document generation. He is now a committer on the Eclipse MDT Papyrus project.

Werner Keil, werner.keil@gmx.net Werner Keil is currently Eclipse RCP Develper and Consultant at mobilkom / Vodafone after having worked for governments or Fortune 500 companies like Daimler, Nokia, BEA/Oracle, GE, Shell, Commerzbank, Capita, Credit Suisse or Legal & General. He has worked for more than 20 years as project manager, software architect, analyst and consultant on leading-edge technologies for Banking, Insurance, Telco/Mobile, Media and Public sector. Among earlier clients are Sony Music where Werner designed and implemented micro-format based tags for their online music portal. He develops enterprise systems using Java, JEE, Oracle, IBM or Microsoft, does Web design and development using Adobe, Ajax/JavaScript or dynamic languages like Ruby, PHP, etc. Besides work for major companies he runs his own creative, talent and consulting agency Creative Arts & Technologies. In his spare time, Werner runs and supports open-source projects, writes song lyrics, novels, screenplays and technical articles. He is committing member at Eclipse Foundation, Babel Language Champion (German) and active member of the Java Community Process, including his role as JSR-275 Spec Lead, JavaEE 6 EG and Executive Committee Member (SE/EE). 10.2 Mentors Ed Merks, ed.merks@gmail.com Cedric Brun, cedric.brun@obeo.fr 10.3 Interested Parties Atos Origin, http://www.atosorigin.com/ Obeo, http://www.obeo.fr/