Migration of SAS Software From VMS to Windows NT : A Real Life Story

Similar documents
Inmagic Content Server v9 Workgroup Configuration Technical Guidelines

The Submission Data File System Automating the Creation of CDISC SDTM and ADaM Datasets

Sage Compatibility guide. Last revised: August 20, 2018

Sage 300 ERP. Compatibility Guide Version Revised: Oct 1, Version 6.0 Compatibility Guide i

Running FileMaker Pro 5.5 on Windows 2000 Terminal Services

Autodesk Revit Structure 2012 System Requirements and Recommendations. Minimum: Entry-level configuration. Operating System Microsoft Windows 7 32-bit

Liaison ECS. Architecture. Introduction

AccessVia Publishing Platform

Perceptive TransForm. Technical Specifications. Version: 8.x Compatible with ImageNow 6.5.x to 6.7.x and Perceptive Content 7.x

Administrator s Guide

System Requirements and Implementation Options

EMC Backup and Recovery for Microsoft Exchange 2007

Cisco Unified Provisioning Manager 2.2

Local Area Network (LAN) Deployment Hardware Requirements

Evaluating Client/Server Operating Systems: Focus on Windows NT Gilbert Held

Pension System/Windows. Installation Guide

SCLIntra Enterprise Product Summary

Sage ERP Accpac. Compatibility Guide Version 6.0. Revised: November 18, 2010

Sysgem Enterprise Manager

Sage ERP Accpac. Compatibility Guide Version 6.0. Revised: February 2, Version 6.0 Compatibility Guide

Control printing and copying effectively with uniflow Output Manager

WPS Workbench. user guide. "To help guide you through using the WPS user interface (Workbench) to create, edit and run programs"

Microsoft Office SharePoint Server 2007

Providing Users with Access to the SAS Data Warehouse: A Discussion of Three Methods Employed and Supported

Geneva 10.0 System Requirements

APOGEE Insight Advanced Workstation

System Requirements. SAS Profitability Management 2.3. Deployment Options. Supported Operating Systems and Versions. Windows Server Operating Systems

Adept 8/8.1 System Requirements

FREQUENTLY ASKED QUESTIONS FOR VERSION 4.0

Pension System/Windows. Installation Guide

Rapid SQL 7.5 Evaluation Guide. Published: September 28, 2007

VERITAS WinINSTALL 2000

Sage ERP Accpac. Compatibility Guide Versions 5.5 and 5.6. Revised: November 18, Compatibility Guide for Supported Versions

Performance Considerations

March MomentumPro V3.1 Technical Specification Guidelines

INSTALLATION AND SET UP GUIDE

EnterpriseLink Benefits

Paragon Protect & Restore

Technical Specification Data 1

Adobe Acrobat Connect Pro 7.5 and VMware ESX Server

The Impact of Disk Fragmentation on Servers. By David Chernicoff

D.N.A. 5.6 MANAGEMENT APPLICATIONS

DQpowersuite. Superior Architecture. A Complete Data Integration Package

Data Protection for Virtualized Environments

INSTALLATION AND SET UP GUIDE

Page 1 of 5. Rental Network Software Corp., Rental Management Software v9.0 (R90) Release Notes. Topics Covered:

Xytech MediaPulse Equipment Guidelines (Version 8 and Sky)

Quest Central for DB2

ViewNow X Server Setup Guide

EMC Virtual Infrastructure for Microsoft Applications Data Center Solution

Risk. Systemystem. A SAS Datawarehouse on OpenVMS (AXP)

Veritas Storage Foundation for Windows by Symantec

Geographic Information Systems (GIS) - Hardware and software in GIS

IBM Internet Security Systems Proventia Management SiteProtector

Deltek Vision 7.4 Technical Overview & System Requirements: Advanced Deployment (150 or More Employees) 1/28/2015

System Requirements and Implementation Options

SAS Drug Development Program Portability

EMC Backup and Recovery for Microsoft Exchange 2007 SP1. Enabled by EMC CLARiiON CX4-120, Replication Manager, and VMware ESX Server 3.

Product Information for etrust Audit Components

2012 Microsoft Corporation. All rights reserved. Microsoft, Active Directory, Excel, Lync, Outlook, SharePoint, Silverlight, SQL Server, Windows,

Test Methodology We conducted tests by adding load and measuring the performance of the environment components:

TECHNICAL REQUIREMENTS

Enterprise print management in VMware Horizon

Oracle Field Sales/Laptop

Meridian 1 Attendant Hospitality Services Software Installation Guide

Symantec System Recovery 2013 Management Solution FAQ

Installation and User s Guide

MYOB Enterprise Solutions

Copyright 2009 by Scholastic Inc. All rights reserved. Published by Scholastic Inc. PDF0090 (PDF)

Database Server. 2. Allow client request to the database server (using SQL requests) over the network.

Creating Enterprise and WorkGroup Applications with 4D ODBC

The HP Blade Workstation Solution A new paradigm in workstation computing featuring the HP ProLiant xw460c Blade Workstation

Molecular Devices High Content Screening Computer Specifications

Notices. Third Party Project Usage. Sample Code in Documentation

The Impact of Disk Fragmentation on Servers. By David Chernicoff

Edexcel GCSE 2009 e-spec Introduction 2 Minimum requirements 2 Hardware 2 Software 2 How do I install the e-spec? 3 Installing e-spec on a network 3

Sage Estimating (SQL) v17.11

DesignPro Tools for Xerox Elixir Technologies Corporation. All rights reserved.

Sage 300 People & Web Self Service Technical Information & System Requirements

FOR NETWARE. Tested and. Approved

Phire Frequently Asked Questions - FAQs

VMware View Upgrade Guide

1. SALESLOGIX SYSTEM REQUIREMENTS

InfoPrint ProcessDirector V1.3.1 helps you manage print workflow with flexible control for efficiency and cost-savings

Construct a High Efficiency VM Disaster Recovery Solution. Best choice for protecting virtual environments

Pro Ecommerce Website Pack Fifteen pages. 60 initial products. Photography. Domain. VPS Hosting. SSL. Dedicated IP. .

Software Product Description

Purkinje Dossier System requirements. Version 5.0

SAS Data Integration Studio 3.3. User s Guide

Technical Brief: Specifying a PC for Mascot

Quick Start - NetApp File Archiver

PARTNER ACS R6.0 PC Administration Getting Started

Micro Focus Net Express

Using OptiView Console

IMPORTANT. Registration Settings: SERIAL NUMBER: COMPUTER ID: REGISTRATION NUMBER:

Best Practices for Implementing Autodesk Vault

Minimum Hardware and OS Specifications

VMware vfabric Data Director 2.5 EVALUATION GUIDE

PRODUCT DESCRIPTION. Version 0.01 (Alpha) By F. Scott Deaver Chief Technical Officer/Founder/Inventor February 27, 2017

Parallels Virtuozzo Containers 4.6 for Linux Readme

Transcription:

Migration of Software From VMS to Windows NT : A Real Life Story Paul Gilbert & Steve Light at DataCeutics, Inc., John Scott Grainger at ClinTrials Research Introduction At ClinTrials Research, Inc. clinical data is maintained in Ingress and Oracle data bases and extracted to for reporting and analysis. An Open VMS mainframe-based configuration is used on DEC VAX and AXP machines. In the VMS programming environment data, programs, output, and work libraries are resident on the VAX cluster and processing is performed on the DEC machines. With approximately 30 active programmers, the burden on the VAX system generated by programming is significant. To reduce the processing load on the VAX it was decided to move processing off the mainframe while retaining central storage of programs and data on the VAX cluster. The target environment that was selected for the migration was a network of Intel-based PCs running Windows NT 4.0 over a 100 MB LAN. This paper describes the specific goals of the project to move processing into NT, the selection of a client/server configuration, and the selection and implementation of tools supporting the distributed environment. Goals The primary goals of the project were as follows: 1) Implement a Windows-based programming environment, with desktop processors running locally to maximize processing distribution. 2) Maintain or reduce the cost of licensing the System software. 3) Provide the ability to easily, quickly and cheaply add more computing power to the programming environment. 4) Distribute processing across multiple locations to level the work load. This was addressed by including additional Pentium Pro dual processor servers dedicated to running programs. 5) Facilitate code and data management through storage of all code and data in shared, centralized directory structures. 6) Allow for a configuration which is scaleable for a small or large work group. 7) To maximize flexibility during the transition from VMS to NT, allow programmers to edit and run programs from either NT or VMS, using one set of data and programs. Migration Strategy The migration to NT was planned to occur in stages starting with implementation of the full NT infrastructure, including installation of a 100 MB network, NT workstations, Paths drives and software configuration. Once the NT environment was in place a single project involving a small number of programmers was implemented in NT. At the time of completion of this paper the initial project was underway, with a review planned following project completion. Any problems encountered in the initial project will be addressed before further implementation of the NT programming environment. Results of the initial project will be reported in the conference presentation of this paper. Following the initial project migration from VMS to NT will take place on a project-by-project basis. Projects already underway in VMS may be retained in VMS or moved over to NT, based on the expected duration or project-specific requirements. New projects will be implemented in NT. Selecting the distributed computing configuration A number of client/server configurations were considered before final selection of a two-tiered 1

model with work group process servers, shown in Figure 1. In this configuration data libraries, catalogs, and associated code and output are stored in Windows format on Paths drives on the VAX cluster. Paths drives can be accessed from either NT or VMS, allowing programmers to work with the data and programs from either platform. /CONNECT Software Remote Library Services is used to access the NT format data from VMS. Programs can be edited and run, output printed, etc. from either VMS or NT. /ACCESS is used to access Oracle and Ingress data. Two Tier Client/Server with Group Process Servers Windows NT Process Server Stack DEC Alpha AXP 8400 Software Data Libraries Code Clinical Database (Oracle & Ingress Software including /Connect and /Access library on 10 GB RAID level 3 drive VMS Server Specifications DEC Alpha AXP 8400 and DEC VAX 7720 Open VMS Software including /Access and /Connect Paths drives where all data, programs and output are stored The distinctive feature of this configuration is the use of workgroup process servers. These servers were introduced into a conventional two-tiered configuration to provide a platform for execution of resource-intensive programs that may exceed the capabilities of the standard workstation. The process servers have dual processors, increased memory and a large amount of disk space available for work libraries. programs are submitted for remote execution on the process servers using a third party scheduling tool, Seagate Ashwin. Ashwin provides the capability to essentially submit programs in manner analogous to a batch queue in VMS. Ashwin supports scheduling of programs for delayed execution, or repeated execution according to a calendar-based schedule. Ashwin automatically balances the load of executing programs across the pool of process servers. Figure 1 DEC Vax 7720 Specifications of the selected configuration include: Client station 200 MHz Pentium Pro, single processor 128 MB RAM Microsoft Windows NT station 4.x Software library on dedicated 2 GB SCSI drive group Process Server 200 MHz Pentium Pro, dual processor 128 MB RAM Microsoft Windows NT Server 4.x Extracting Ingress and Oracle Data into Datasets A combination of /Access and /Connect Remote Library Services is used to extract data. /Access for Oracle and and Ingress allow use of pass-through SQL directly to Oracle via SQL*NET. The resulting datasets are stored on the VAX Paths drives in Windows format. Accessing data and programs from NT and VMS data and programs are centrally located on a set of VAX-based Paths drives. These drives can be accessed from Windows NT or from VMS, allowing use of a single set of data and programs from the two platforms. This provides for maximum flexibility during the period of transition from VMS to NT. Either environment may be used to edit or execute programs, and to print output. The datasets are stored in Windows format. Using the datasets in Windows format from 2

VMS requires use of /Connect Remote Library Services. To allow use of individual programs from either NT or VMS, data libraries are allocated via a standard macro that assigns all libname statements. Programmers do not write libname statements in their programs. For programs executed from NT the data access macro determines the current operating environment and assigns libname statements with standard syntax, there are no special requirements. If the program is being executed from VMS an active /Connect session and a current remote system option are required. The remote option is assigned as part of initiation of a /Connect session and holds the name or address of the remote session being connected to. If remote has no current value the data access macro is terminated and the user is notified that a /Connect session must be active to access data from VMS. If the remote option has a valid value, libname statements are assigned to use Remote Library Services using the server=<remote> qualifier. Remote Library Services does not allow crossplatform access of Catalogs. This is an issue at CTR primarily for format libraries. To support execution of programs in both NT and VMS it has been necessary to maintain format libraries in both Windows and VMS formats. Maintenance and synchronization of the two copies of the catalogs is an ongoing issue. Tools supporting the NT programming environment Whereas VMS provides adequate support for editing, batch job execution and scheduling, Windows NT does not meet the requirements for a large programming shop. Part of the move to NT required selection of tools for use by programmers. Program Editors A text editor for program development is the most basic requirement for the NT programming environment. The text editor supplied as part of Windows NT, Notepad, is not adequate for use as a program editor. Fortunately, there is a wealth of third party program editors available for NT. Several options were considered including: the AWS environment, formerly referred to as Display Manager the System Viewer full featured professional programmer s editors such as CodeWright low priced text editors, generally shareware products The AWS was not considered to be the best program development environment for several reasons, including limited editing features and a restriction to a single program editor window. While the AWS can be a convenient way to code and run quickie programs, it was felt that a more robust and flexible editor was desirable. The System Viewer, supplied as a no cost standalone application, can be used as a text editor. This was not considered to be adequate due to a very limited number of editing features. Professional programmer s editors were found to be very powerful and loaded with features, but were expensive and somewhat complex to work with. Many of the included features were not of interest to programmers and only made them more difficult to use. Many low priced text editors are available for use in NT. These editors are generally very powerful and easy to use, providing many editing features which were not available at all in VMS. Significant editing features available in several of these editors include: unlimited number of open edit windows syntax highlighting with color drag and drop from the Windows Explorer auto-indent of code bookmarking of file positions file comparison multiple undo/redo of last edit action code template storage/insertion Three editors were selected for evaluation with the goal of identifying a single standard editor for use by all programmers. The editors evaluated included: WinEdit (Wilson Windowware), UltraEdit (IDMComp) and KEDIT (Mansfield Software). At the time of completion of this paper a final selection had not been made. Batch Processing Production jobs at CTR require execution of sequences of programs in a controlled order. These jobs can become quite large, and sometimes require several days for execution. Within the VMS environment, scheduling and control of production 3

jobs has been handled by use of DCL scripts and batch execution queues. Our strategy for handling the large production batch jobs in NT was through use of the workgroup process servers. Programmers submit production jobs to execute on the servers. This keeps their workstations clear and provides greater, dedicated resources for processing the jobs. One of the goals of the migration was to introduce a mechanism for balancing the server processing load which could be easily extended as additional servers were added to the system. NT 4.0 does not provide powerful scripting tools or scheduling tools such as the programmers were accustomed to using in VMS. Additionally, there is no equivalent to VMS batch queues which would allow programmers to submit large programs to run on remote processors. These are significant gaps in the NT environment which require third party solutions. As part of the initial project in NT Seagate Ashwin is being evaluated as a tool to provide job scheduling, batch processing and process load balancing. Ashwin enables users to submit (or schedule) individual programs or groups of programs for batch execution on the workgroup process servers. It provides scheduling facilities that allow users to define dependencies between jobs or schedules to control the sequence of program execution, and can provide email notification on success or failure of jobs. This functionality approximately substitutes for the batch queue facility and DCL programming done in VMS. Ashwin is capable of managing executable programs such as, as well as NT command files (BAT or CMD files). Command files are useful for automating time consuming tasks such as copying files or printing large volumes of output. Using Ashwin features command files can be set to run conditionally based on the success or failure of a production job. For example, a command file which prints all output from a large production job can be set to execute only upon successful completion of all components. Ashwin also provides load balancing by monitoring a defined set of networked machines and directing scheduled jobs to execute on the first machine available. Load levels allowed on each machine can be adjusted as needed and servers can be easily added or removed from the list of available machines. This provides an easily scaleable batch processing configuration, meeting one of the primary goals of the project. Printing Output generates text output as unformatted text files. Printing output for production work at CTR requires standardization of font, font size, page orientation, margins, etc. In cases where only a few files need to be printed it is a relatively easy matter to load each file into an editor or word processor, assign the formatting information and print from that application. However, the large number of files generated at CTR makes this method impractical. A batch method of printing files with accurate formatting information (that does not require a programmer to load each file into an interactive application) is required. NT does not provide good support for this kind of direct printing, leading us once again to look for third party solutions. The features of the ideal printing solution for text output in NT might be as follows: Allows command line or interactive execution. Uses an INI file to assign the target printer and page formatting information. This would allow one to store separate INI files with different page formats. Capable of printing multiple files as a single job in an order specified by the user. Capable of attaching network resources (printers). Allows interactive selection of files to be printed in an Explorer-like interface. Capable of reading a text file with an ordered list of files to print. Generates log of print history including files printed, date and time, target printer, INI file in use and success or failure of the print job. Unfortunately, if such a utility exists we have not found it. At the time of completion of this paper a shareware utility, CodePrint for Windows 95 and NT (Micro Software Solutions) is being evaluated. This utility provides most of the desired functionality, but does not have the capability of reading a text file with an ordered list of files to print and does not generate log files. Code Management Part of the initial project in NT included an evaluation of use of a code management system for 4

programs. In an effort to gain greater control over the program development process and to track project status Microsoft SourceSafe was implemented to manage all programs. SourceSafe stores source code in a central repository and tracks changes made to program files. This application provides a number of benefits to the programming environment including control of write-access to files tracking of modifications to files review file history revert to earlier file versions track which files are checked out for editing, current owner of file ability to label a project with a version number and to later restore entire project as a single operation Conference Presentation The conference presentation of this paper will discuss ongoing developments, results of our initial project and our experiences with moving more of the production environment from VMS into NT., /ACCESS and /CONNECT are registered trademarks or trademarks of / Institute in the USA and other countries. management, /PH-Clinical implementation, programming and clinical data management. Paul and Steve can be reached at or email DataCeutics, Inc. 244 High St Suite 302 Pottstown, PA 19464 gilbertp@dataceutics.com lights@dataceutics.com Scott Grainger is Vice President, Information Systems for ClinTrials Research, Inc., an international contract research services organization with approximately 1500 employees providing comprehensive preclinical and clinical research services including monitoring, data management, biostatistics, medical and regulatory to pharmaceutical, biotechnology and medical device clients. His 25 years of experience is in operations, systems management, application development, system deployment, and information technology planning in a variety of clinical environments. indicates USA registration. Other brand and product names are registered trademarks or trademarks of their respective companies. Biographies Paul Gilbert is Vice President of DataCeutics, Inc., a consulting group specializing in software solutions, system integration, programming and support in the areas of clinical data management and statistical reporting. His fifteen years of experience includes clinical data management, implementing and maintaining BBN/Clintrial, designing based biostatistics reporting systems, managing based NDA programming support, /PH-Clinical implementation and CANDA integration. Steve Light, a Project Manager at DataCeutics, Inc., is responsible for report macro development and biostatistics/clinical project management. His twelve years of experience with clinical information systems include systems development and validation, NDA driven project 5