Global Technology Services The IBM Resiliency Model Business Continuity & Resiliency Services Site & Facility Services 7 June 2007 - Lux Tech - Luxembourg Erwin Husmann Graham Aldridge The IBM Resiliency Model 11 June 2007 2006 IBM Corporation
Let us introduce a case A client has a datacenter fully operational. A new, large scale database solution has to be designed, installed and managed. The solution includes: Large number of Wintel and Unix servers Storage equipment Lotus Notes The solution must have a 99.999% uptime for the coming three years. Question: What issues should be discussed to achieve this solution? 2 The IBM Resiliency Model 11 June 2007
Business Resilience definition Business Resilience is the ability to rapidly adapt and respond to risks and opportunities, in order to maintain continuous business operations, be a more trusted partner, and enable growth. 3 The IBM Resiliency Model 11 June 2007
Computer prediction in 1964 (for the year 2004) 4 The IBM Resiliency Model 11 June 2007
In the beginning The first data centers were not designed, they were open areas to place these machines called computers No networking yet, computers were confined to these central rooms Organisation and processes were relatively simple 5 The IBM Resiliency Model 11 June 2007
Next phase Servers were developed requiring special environmental needs. Cooling & Humidity control The personal computers were introduced, IT spreaded to the workplaces Systems were interconnected, requiring a little more upfront planning and control Support and processes were becoming necessary 6 The IBM Resiliency Model 11 June 2007
Data center lifecycles Data Centers were built with a life cycle of 10+ years Desktops and laptops became powerfull enough to handle their own computations IT became part of everything and more systems were being run by computers More complex technology But no one thought about what was coming 7 The IBM Resiliency Model 11 June 2007
Data center lifecycles Then, the density age came not too long ago Or some call it the faster, hotter, power hungry age, power consumption has increased 7-fold in the past 7 yrs IT is intergrated in every business process, nothing works if the IT fails We were caught, not being ready to handle these new demands 8 The IBM Resiliency Model 11 June 2007
Inexpensive dense computing and increasing power costs are shifting requirements and spending What is the greatest facility problem with your primary data center? Excessive heat Insufficient power Insufficient raised floor Excessive facility cost Poor location None of the above Gartner 2006 0% 5% 10% 15% 20% 25% 30% 35% Power and cooling will be a top 3 issue with all CIOs in the next 6-12 months Michael Bell Gartner Group Power and cooling costs will increase to more than one-third of the total IT budget Robert Frances Group, January 2006 $80 $70 $60 $50 $40 $30 $20 $10 Power and cooling exceeds server Spending IDC 2006 Spending (US$B) $0 New server spending Power and cooling 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 Installed base (M units) 18 16 14 12 10 8 6 4 2 9 The IBM Resiliency Model 11 June 2007
And can everyone still be reached from the Voice outside? Recovery ACD Quickship Client Production Location LIVE AGENT Interim Call Handling Customers IBM BCRS Recovery Center ACD Telephone Sets Open Agent Telephone Workstations Supervisor & Reporting Functions Public Network Advance Agent Location: Customer Site IBM BCRS Mobile Recovery Center IBM BCRS ACD Servers Advance Agent Location: Open Agent Telephone Workstations Supervisor & Reporting Functions Phone Server Voice Services Module Home 10 The IBM Resiliency Model 11 June 2007
Applications becoming complexer, data becoming more critical, and everything needs to be more secure Data growth Growing every minute Applications developing Integrated Data becoming more critical Growing need to archive Back up & restore Back up windows shrinking Online methods becoming easier and cheaper Restore needs to be faster Virtualised Distributed Security Outside threats Data theft Longterm archiving 11 The IBM Resiliency Model 11 June 2007
But how does the workforce continue working? Workspace, Chair, and Telephone Copy Mail Office Environment F A X Technology PC Servers PC Workstations ACD Agents Printers LAN/WAN Connectivity Recovery Support Team Transportation Accommodations Restaurants Shopping Etc. Meeting Space, Office Space, Lounge 12 The IBM Resiliency Model 11 June 2007 Amenities and Services Nearby
Where should you go? Everything is connected, a good strategy addresses all issues. Can I overcome a business outage? After how much time will I loose business? Are all business processes evenly important? Do I want long term continuity, or short term profit? Should I make very strict security rules for maximum safety, or can I expect a self regulating workforce? Is my organisation up to the task, or should I outsource? I need to understand my risks and impacts if I m without my business processes but don t know where to get started. 13 The IBM Resiliency Model 11 June 2007
Recent events are driving the customer need for Business Resilience solutions. Infrastructure- and business-based interdependencies, Strategies that deal with the data explosion Anticipation of a primary facility loss Flexible architecture with ample capacity for growth Contingencies for potential personnel loss or unavailability Integration of dedicated solutions into the strategy Impacts of an Internet outage Compliance with industry regulations Customer expectations of 24x7 availability Integration of recovery, continuity, high availability (HA), and security 14 The IBM Resiliency Model 11 June 2007
IBM Business Continuity and Resiliency Seminar And it does go wrong, very often: December 2006 November 2006 A customer s switch implodes in the data center Power failure on a Single Point of Failure shuts down a data center of a large bank December 2006 Salt in a humidifier sprayed high concentrations of salt through a data center January 2007 Violent storm disrupts normal life in Europe November 2006 Large Dutch insurance company loses a data center We handle small and large disasters on a weekly basis 15 The IBM Resiliency Model 11 June 2007
A well designed, tested and maintained Business Continuity Plan is critical for the coming years A 10% increase in the number of power related failures means that 90% of companies will see data center operations halted in the next 5 years. Within the next five years, one out of every four data centers will experience a business disruption serious enough to affect the entire company s ability to continue business-as-usual By 2010, nearly 70% of all data centers will utilize some form of grid computing or other virtual processing Source: DataCenter Institute 16 The IBM Resiliency Model 11 June 2007
Evolution from Recovery Solutions to Business Process Availability and Resilience Resilience Virtual computing, Integrated business applications, Systemic operations Business Continuity Multi-platform computing, Extended business applications, Network centricity, Global presence Business Recovery Distributed computing, New business applications, Extended networking, Multiple locations Disaster Recovery Disaster Recovery Mainframe centric computing, Internal application support, Internal networking, Single physical location 1970 1980 1990 2000 2005 200? The Changing Dynamics Recovery Times Risk Profiles Approach to Planning Regulatory Pressures Online Computing Technology Capability Availability Requirements 17 The IBM Resiliency Model 11 June 2007
IBM Business Continuity and Resiliency Seminar Six Key Layers of an Enterprise that Must Be Addressed to Achieve Business Resilience Strategy and Vision Processes Applications and Data Technology Business Resilience Organization Facilities 18 The IBM Resiliency Model 11 June 2007
The IBM Business Resilience Lifecycle Determine potential risks Prioritize business impact by risk Evaluate current resiliency capabilities Design a resilient architecture Manage to resilience objectives Plan for Business Resilience Rehearse and review resilience program Implement the architecture 19 The IBM Resiliency Model 11 June 2007
BCRS / SFS Services and Service Products Service product group Service product line IT strategy and architecture services Service IBM Business Continuity Services Service product Business continuity assessment Business continuity program management Infrastructure access services Infrastructure management services Middleware services End user services Integrated communications services Business continuity & resiliency services Security and privacy services Site & facilities services IBM Regulatory Compliance Services IBM High Availability Services IBM Data Center and Facilities Strategy Services IBM IT Facilities Assessment, Design and Construction Services IBM IT Facilities Consolidation and Relocation Services Compliance and governance assessment Compliance and governance planning and design High availability assessment High availability planning and design Capacity services for high availability High density computing data center readiness assessment Data center strategy and plan Thermal analysis for high density computing Power/environmental protection services Data center relocation and consolidation Infrastructure support services Server services Storage and data services Maintenance and technical support services IBM Disaster Recovery Services IBM Crisis Management Services Data center recovery Workplace recovery Disaster recovery validation and testing Crisis management assessment Crisis management planning and design Crisis response 20 The IBM Resiliency Model 11 June 2007
Example case: Dutch insurance company (1) Customer issue: all important business processes should be up and running in 48 hours after a disruption. Disaster recovery was not adequately arranged Recovery times reduced to 24 hours and 12 hours for part of business 21 The IBM Resiliency Model 11 June 2007
Example case: Dutch insurance company (2) First step IBM consultants and customer team drew up the requirements Data center scenarios set up High level design, solution outline Each part signed of / agreed by customer management 22 The IBM Resiliency Model 11 June 2007
Example case: Dutch insurance company (3) Interim step for Asset Management / Bank: First disaster recovery test AM/B conducted: FAILED. Design of Fast track (to bring down recovery times High Availability). Short term solution: IBM BCRS- Almere Long term solution: DR Site -> Arnhem 23 The IBM Resiliency Model 11 June 2007
Example case: Large Dutch insurance company (4) IBM Services provided: Consulting Scenario selecting Process support (how to recover) Business Continuity Manager Data Center Services Data center assessment Data center design Data Center Disaster Recovery System z System p High Availability Services System x (Lotus Notes) Workplace Recovery 400 standard workplaces Financial workplaces (new service for IBM NL!) Call center recovery (Extra functionality) 24 The IBM Resiliency Model 11 June 2007
Service outline and positioning: High Availability Services Mirrored iseries systems Continuous online data replication Data available on primary and backup system Backup tapes can be made from the backup system Use of backup system for production purposes in case the primary system is upgraded Data loss reduced to a minimum Recovery process down to minutes instead of hours Enlarged processor capacity backup system during disaster recovery operations 25 The IBM Resiliency Model 11 June 2007
Characteristics IBM High Availability Services (1) _ 1. User sends data to server 2. Primary system sends data to backup system 3. Primary system fails 4. User sends data directly to backup system WAN VPN Tunnel / Dedicated line VPN Tunnel / Dedicated line Primary system Backup system 26 The IBM Resiliency Model 11 June 2007
Characteristics IBM High Availability Services (2) IBM takes care of: Installation mirroring software Management of software, backup system and mirroring process Regular tape backup (from backup system) Expanding the backup system during disaster recovery Fully supported yearly switch over test You need to take care of: Primary system Network connection with the IBM recovery center Notification of a calamity Change management Processor capacity Disk capacity Normal situation 27 The IBM Resiliency Model 11 June 2007 Disaster recovery situation
Differences between High Availability and traditional disaster recovery High Availability Online replication process Continuous backup Dedicated backup system, only need to expand processor capacity Disaster recovery Recovery from tapes Backup once per day Backup system only available after disaster notification Data loss minimal Recovery time in minutes Data loss up to 24 hours Recovery time in hours (up to days) 28 The IBM Resiliency Model 11 June 2007