Predictive Maintenance in a Mission Critical Environment

Similar documents
Vibration analysis goes mainstream

All Rights Reserved - 1

Mark Bitto / ABB Inc / PSPG / CIBO Technical Focus Group / June Instrumented and Actuated Systems Condition Monitoring of Rotating Equipment

VERTIV SERVICE CAPABILITY

5 STEPS for Turning Data into Actionable Insights

CSI 9330 Vibration Transmitter

Switchgear Survival Guide: Ten Tips to Optimize Switchgear and Enhance Reliability

AMS Suite: Machinery Health Manager 5.2 and 5.3 New Features and Enhancements

Applying handheld test tools to preventive maintenance

Welcome. Hardeep Singh Service and Parts Director Trane Middle East & Africa

CSI 6500 Machinery Health Monitor Chassis Options

Fair City Plaza. Task: Diagnose problems, recommend improvements for small municipal district cooling system

ЗАО «БЕРГ АБ», (495) , Thermal cameras. Detect hot spots before they cause trouble

Valve and Case Expansion Monitor Specifications

1.0 Executive Summary

POWERING NETWORK RESILIENCY WITH UPS LIFECYCLE MANAGEMENT

Schneider Electric Critical Power & Cooling Services. Services to keep your mission-critical applications operating at optimal performance

Data Center Assessment Helps Keep Critical Equipment Operational. A White Paper from the Experts in Business-Critical Continuity TM

ZKLWHýSDSHU. 3UHð)DLOXUHý:DUUDQW\ý 0LQLPL]LQJý8QSODQQHGý'RZQWLPH. +3ý 1HW6HUYHUý 0DQDJHPHQW. Executive Summary. A Closer Look

Assets Condition Monitoring Using ISA100.11A Wireless System. Standards Certification Education & Training Publishing Conferences & Exhibits

SE Engineering, PC strives to be a leader in the power system engineering field by providing our customers with the highest level of quality,

PowerVault MD3 Storage Array Enterprise % Availability

Energised diagnostic testing. Maximise your uptime.

THE ROLE OF INFRARED TESTING AT DATA CENTERS

Service Level Agreement

AMS 6500 Machinery Health Monitor Chassis Options

Powering Change in the Data Center. A White Paper from the Experts in Business-Critical Continuity TM

CSI 2600 Machinery Health Expert

SCADA Solutions for Water and Wastewater Treatment Plants

Power Quality of Commercial Buildings - Advanced

COST EFFECTIVE PREDICTIVE MAINTENANCE SOLUTIONS

The Future. Today. YZ MAGNETIC BEARING CENTRIFUGAL CHILLER

Pan Gulf Industrial Systems-RITEC

Power System Vulnerabilities : AHCA 2016 Copyright (c) 2016 SSR, Inc. All rights reserved

AMS 6500 ATG. Overview. API 670 compliant TSI protection system

DCIM Data Center Infrastructure Management

Provided services catalogue

Help your Cisco customers defend against downtime and reduce costs

Reasons to connect your tools

Remote Monitoring. for Building HVAC Systems. A unique combination of human expertise and technology helps ensure optimum operation.

Rosemount Transmitter Diagnostics Reduce Maintenance Costs

02 Why Does My IT Infrastructure Need a UPS?

Ovation Machinery Health Monitor for the Power Industry

M-series Foundation Fieldbus I/O

Tech Byte 20: The Benefits of Power Switching in Dual Bus UPS Designs

Anomaly Detection Solutions for Improved Equipment Availability

Remote Control Thermal Imaging Camera System - 12V - Wireless or Wired Remote - 6" LCD Monitor

1.0 Executive Summary. 2.0 Available Features & Benefits

Introduction. Delivers four ports to provide increased input/ output per card. Takes advantage of all smart device capabilities

WHY BUILDING SECURITY SYSTEMS NEED CONTINUOUS AVAILABILITY

Triax Accelerometer for Route-Based Vibration Analysis

DIA TECH Monitoring and Diagnosis System

2016 COOLING THE EDGE SURVEY. Conducted by Vertiv Publication Date: August 30, 2016

Network Performance, Security and Reliability Assessment

Executive Summary. Zuma Engineering and Research. Data Center Virtualization Assessment. Service Description

GE SmartSignal powered by Predix

Data Loss in a Virtual Environment

A Mission Critical Protection Investment That Pays You Back

CRITICAL INFRASTRUCTURE ASSESSMENTS

Symantec Security Monitoring Services

7 Best Practices for Increasing Efficiency, Availability and Capacity. XXXX XXXXXXXX Liebert North America

EPSON PreferredSM Limited Warranty Program for the Epson Stylus

Liebert. icom Thermal System Controls Greater Data Center Protection, Efficiency & Insight

THECHARGEHUB.COM. User Manual. For Square & Round Models

Maintain, Repair, Rebuild, Protect. Solutions for Industrial Asset Maintenance and Reliability

Manual Vibration Meter PCE-VT PCE Instruments

Installation and Operation Back-UPS Pro 900

Service Level Agreement

Pathfinder AWV air-cooled screw chillers

DATA CENTER COLOCATION BUILD VS. BUY

Why the Threat of Downtime Should Be Keeping You Up at Night

Gain better understanding and insight into your plant data

FREQUENTLY ASKED QUESTIONS

BUSINESS CONTINUITY: THE PROFIT SCENARIO

Machine Condition Monitoring = Improved Manufacturing

The challenge is that BAS are often very complex with 1000s of data points to review. EMS Optimization Software

A Dell Technical White Paper Dell

Power Audit & Thermography Test

National Service Center

AMS 2600 Machinery Health Expert

Test Equipment Depot Washington Street Melrose, MA TestEquipmentDepot.com. Deployment Planning Guide

Trane mytest Water-cooled Chiller Performance Validation What you spec is what you get *

Liebert NXC. 10 to 40kVA Compact and Reliable Power in a Fully Integrated Packaged Solution

AMS Suite: Machinery Health Manager v5.6

VIBSCANNER Data collection & machine diagnostics

Back-UPS RS 550 Installation & Operation

RAID systems within Industry

The SCADA Connection: Moving Beyond Auto Dialers

Engineered Energy Solutions An Overview. Controls, Automation and Energy Conservation

Tech Byte 25: UPS Systems - Key To Ensuring Business Continuity - Part 1

Great Western Painting Assured Equipment Grounding Conductor Program OR Ground Fault Circuit Interrupter (GFCI)

Advanced Monitoring Solutions for Critical Electric Power Transmission & Distribution Assets

SYSCO HOUSTON HOUSTON, TEXAS

Staying Connected with Mobile Devices in the Power Sector

Manage power your way

CLEANROOM CERTIFICATION AND ACCEPTANCE PROCEDURES

Liebert : The Right Cooling For IT Environments

Support Policy and Service Level Commitment

Molded Case and Low-Voltage Power Circuit Breaker Health

(Project Name & Location)

Transcription:

CSI 2130 Predictive Maintenance in a Mission Critical Environment This paper discusses the need to proactively protect mechanical and electrical support systems in data processing centers and data storage facilities against unexpected failure that could cause an interruption in essential operations. By implementing predictive maintenance programs based on the use of advanced information gathering technologies, these facilities can maintain critical equipment at high performance levels and prevent failures that could cause unplanned downtime with catastrophic results.

Protecting mechanical and electrical support systems against unexpected failure Preventive maintenance is a misnomer! Periodic lubrication of mechanical equipment and other checks may not actually prevent a breakdown, because serious internal faults and electrical overloads can go unnoticed for a long time. A more effective maintenance program is necessary for support equipment that is critical to the uninterrupted operation of a data processing center or data storage facility. Motors, pumps, HVAC units, chillers, electrical control cabinets, and power distribution panels seldom fail without providing some kind of warning! More often than not, the signs of impending failure, such as elevated vibration levels or high internal temperatures, go unnoticed during routine tasks. Yet, the results can be catastrophic. Information about the health of rotating machinery and electrical equipment is often concealed in physical signals, which can be recognized using well proven techniques for vibration monitoring and analysis as well as infrared thermography. If properly interpreted, these hidden signs can pinpoint the nature, location, and even the severity of developing problems, so steps can be taken in time to avoid losses of data and money. While every data center backs up current information and maintains redundant systems to prevent the loss of irreplaceable data, the possibility of a faulty piece of support equipment shutting down your round-the-clock operation may never have been considered. Yet, the timely identification of a potentially bad bearing in a motor, pump, fan, or other mechanical equipment could avoid a catastrophic failure and save your operation thousands of dollars. Predictive Maintenance Sometimes called reliability centered maintenance, predictive maintenance goes considerably beyond the generally accepted rules for preventive maintenance, performing service only when necessary as dictated by the condition of the equipment. If a failure is imminent, that unit may need to be repaired or replaced immediately. If performance is not significantly degraded, it may be possible to delay repairs until a backup can be arranged to avoid lost time. Machinery health management carries this approach a step further by prioritizing each machine according to its significance to the operation. Then, greater care is given to the most important machines to be sure they keep running. For example, vibration monitoring is much more frequent, and in some cases continuous online monitoring may be installed, on essential equipment that must continue to operate reliably. On support equipment deemed to be less important, collection of vibration data can be done less often. Information generated by the monitors is collected and used by analysts and maintenance managers to predict when a problem is likely to occur. With that insight, maintenance can be scheduled when needed before the problem becomes severe enough to adversely affect the performance of the asset. Predictive maintenance has been proved in the process industries as a great way to improve maintenance effectiveness while reducing costs for repairs and unexpected downtime. Reactive and preventive strategies are generally reserved for equipment that s not process-critical and will cause little or no collateral damage if allowed to run to failure. As shown in Figure 1, implementing a predictive maintenance program based on monitoring the health of machines similar to those found in data centers and storage facilities dramatically reduced the risk of downtime by making timely repairs. In this graph, Priority 1 calls for immediate repair action, Priority 2 requires repair at the earliest opportunity, and Priority 3 allows maintenance to be scheduled when convenient. Reactive and preventive strategies (Priorities 4 and 5) are generally reserved for equipment that s not process-critical and will cause little or no collateral damage if allowed to run to failure. In all cases, identifying potential problems early resulted in better maintenance at a lower overall cost. 2

Figure 1 Measuring Problems Found with Vibration Analysis The Reliability Challenge In order to handle the core business and operational data of an organization in the information technology age, data centers must reliably process the transactions and store important electronic information that allow a modern business and government to operate efficiently. Critical mechanical and electrical support assets in server rooms, data centers, and their UPS systems include: HVAC units for temperature and humidity control Chillers for cooling or dehumidification Cooling Towers to reject heat from chillers or HVAC units Pumps that support the cooling equipment Engine-Generator sets as source of backup power Electrical control cabinets, motor control centers and switch gear Uninterruptible power supplies (UPS), automatic transfer switches Power distribution panels 3

Preventing Equipment Failures Vibration Analysis is a non-intrusive technology by which users attach a spectrum analyzer to machinery and record waveform signatures. Analysis of this data makes it possible to diagnose: Mechanical wear in bearings, belts, couplings, gears, and support structures Imbalances and misalignments Other defects such as lubrication failure, bent shafts, broken motor rotor bars, resonances, and many more. Emerson was contracted by a major financial institution to perform vibration analysis on critical rotating machinery at their data center and trading floor. Maintenance technicians in the facility were trained to collect vibration data using a portable CSI 2130 to record waveform measurements along a predetermined route of selected machinery. The data is dumped periodically via a secured Internet connection or e-mailed to an Emerson machinery health analyst who performs vibration analysis using the AMS Machinery Health Manager software. A monthly machinery condition report to the facility manager includes recommendations for repairs. The trend analysis chart (Figure 2) illustrates periodic measurements recorded on an HVAC unit at this facility over a fivemonth period. In March 2008, the initial data collection and analysis showed that the motor had a severe bearing defect, which was reported to the facility manager as critical, along with a recommendation to replace the motor at the earliest opportunity. Figure 2 Vibration Analysis Reveals Failing Bearing That unit continued to deteriorate until the repair actually took place during a regularly scheduled outage in June 2008. A catastrophic failure might well have occurred if the warning signs had not been recognized in time. As a result, unexpected downtime in this mission critical operation was averted. 4

Infrared Thermography is an effective means of identifying potential problems before an incident or failure occurs. Applying this technology in a data center environment can be extremely beneficial when trying to maintain system availability 24 hours a day, 7 days a week. This non-intrusive technology can identify abnormal thermal rises in electrical, mechanical, or structural components. A successful infrared thermography program will identify: Loose and or poor connections Unbalanced electrical loads Defective components Irregular surface temperatures The benefit of performing infrared thermography is its ability to identify problems before they impact the facility. A tripped circuit breaker may protect equipment, but the subsequent downtime could have disastrous consequences within a data center. A Meta Group study identified the hourly impact on different organizations to system outages. (Figure 3) It s clear that any downtime in the data center of a retail brokerage comes with a very expensive price tag. With this amount of money at stake, infrared thermography should be a key component in the maintenance routine of any data center, especially those serving retail brokerage firms. Type of Business Average Hourly Impact Retail brokerage $6,450,000 Credit card sales authorization $2,600,000 Home shopping channel $113,750 Catalog sales centers $90,000 Airline reservations centers $89,500 Cellular service activation $41,000 Package shipping services $28,250 Online network connect fees $25,250 ATM service fees $14,500 Source Meta Practice, 25 February 2004 Figure 3 Financial Impact of Data Center Downtime Emerson Network Power was contracted by a major financial institution to perform an infrared thermographic survey on all electrical connections within their facility. The survey included electrical and mechanical equipment from the building s utility entrance through the branch circuit breakers within the data center and trading floor. Qualified maintenance technicians performed the survey using an infrared camera in accordance with the procedure set forth in the Standard for Maintenance Testing Specifications for Electrical Power Distribution Equipment and Systems by the International Electrical Testing Association. Scanning the electrical distribution panels that feed the IT equipment on the data center floor revealed a problem (Figure 4) where the camera indicated a temperature 140.2 F. Furthermore, current readings of the individual conductors revealed that Phase A of the circuit was carrying 120Amps, or 96 percent of the rated line capacity of 125Amps. The National Electrical Code 102-22(c) states a continuous load shall not exceed 80 percent of the branch circuit rating. 5

Figure 4 Infrared Camera Readings on a Data Center Floor The Emerson technician recommended the loads in the panel be distributed immediately to other circuits. If the load should exceed 125Amps, the circuit breaker will likely overheat and trip open, causing the panel fed from this device to lose power, as well as all the IT equipment fed from that panel. Depending on circuits fed from that panel, part or all of the data center s electronic equipment could be shut down due to that overheating situation. That disaster was averted by the timely infrared scan. Conclusion With no room for downtime, data centers need a strategy to ensure that supporting assets are reliable. Deteriorating machinery may cause severe problems, which can be avoided by utilizing advanced predictive technologies in a mission critical environment. A best-practices facility uses predictive maintenance for essential equipment where condition-monitoring is practical, limiting reactive and preventive strategies to equipment that s not mission-critical and will cause little or no collateral damage if allowed to run to failure. By applying infrared thermography and vibration monitoring, you can observe the health of critical support systems in order to detect impending failures and minimize the risk of a catastrophe. 6

About the Authors Frank Vitucci is a District Manager in New England & New York Metro regions for Emerson which helps process industries to better manage plants through automation systems, intelligent control systems and software, measurement devices, consulting and industry expertise. He has a B.S. from the University of Connecticut, Level 1 Certification in Infrared Thermography and Vibration Analysis and 20 years experience working with diverse industries to help implement Plant Asset Management Programs, Reliability Based Maintenance and Machinery Health Management Technologies. Brian Olsen is a Business Development Manager for Emerson Network Power, Liebert Service Division. He has a B.S. from the United States Merchant Marine Academy and a M.S. in Management from Stevens Institute of Technology. Brian has 8 years of experience in maintenance of critical data center equipment. Brandon Schaner is a Senior Machinery Health Analyst at Emerson. He has over 13 years experience in heavy and commercial industrial applications providing predictive & preventive maintenance implementation service, along with PdM mentoring services. In addition, he also provides troubleshooting, corrective and re-engineering expertise for problematic in-place machinery. Brandon is certified ISO Category II in Vibration Analysis and ASNT Level I in Thermography, Ultrasound and Visual Inspection. Emerson Reliability Solutions 835 Innovation Drive Knoxville, TN 37932 USA +1 865 675 2400 The Emerson logo is a trademark and service mark of Emerson Electric Co. The AMS logo is a mark of one of the Emerson family of companies. All other marks are the property of their respective owners. The contents of this publication are presented for informational purposes only, and while every effort has been made to ensure their accuracy, they are not to be construed as warranties or guarantees, express or implied, regarding the products or services described herein or their use or applicability. All sales are governed by our terms and conditions, which are available on request. We reserve the right to modify or improve the designs or specifications of our products at any time without notice. 2017, Emerson. All rights reserved.