Analysis of Stochastic Model on a Two-Unit Hot Standby Combined Hardware-Software System

Similar documents
Monu Kumar Department of Mathematics, M.D. University, Rohtak , INDIA

Analysis of Replication Control Protocols

Optimal Cost-Effective Design of Standby Systems Subject to Imperfect Fault-Coverage

VALIDATING AN ANALYTICAL APPROXIMATION THROUGH DISCRETE SIMULATION

Optimal Allocation for Reliability Analysis of Series Parallel System using Dynamic Programming

OPTIMAL ALLOCATION FOR RELIABILITY ANALYSIS OF SERIES PARALLEL SYSTEM USING DYNAMIC PROGRAMMING

Automatic Generation of Availability Models in RAScad

Integration of analytic model and simulation model for analysis on system survivability

Review of Software Fault-Tolerance Methods for Reliability Enhancement of Real-Time Software Systems

Availability of Coding Based Replication Schemes. Gagan Agrawal. University of Maryland. College Park, MD 20742

Computing Submesh Reliability in Two-Dimensional Meshes

RealTime. RealTime. Real risks. Data recovery now possible in minutes, not hours or days. A Vyant Technologies Product. Situation Analysis

Error Control System for Parallel Multichannel Using Selective Repeat ARQ

A Modelling and Analysis Environment for LARES

Fault Tolerant System for Sparse Traffic Grooming in Optical WDM Mesh Networks Using Combiner Queue

A Technique for Design Patterns Detection

Handling Multi Objectives of with Multi Objective Dynamic Particle Swarm Optimization

Module 4: Stochastic Activity Networks

Case Study #3: Analysis of Replicated Data with Repair Dependency

WITH THE proliferation and ubiquity of handheld devices

Discrete Time Batch Arrival Queue with Multiple Vacations

A Simple and Efficient Mechanism to Prevent Saturation in Wormhole Networks Λ

Evaluation of Moving Object Tracking Techniques for Video Surveillance Applications

Subset Cut Enumeration of Flow Networks with Imperfect Nodes

Analysis and Design Language (AADL) for Quantitative System Reliability and Availability Modeling

International Journal of Computer Engineering and Applications, Volume XII, Special Issue, July 18, ISSN

Availability Model for Virtualized Platforms

Fault tolerance and Reliability

STRUCTURE-BASED software reliability analysis techniques

MATERIALS AND METHOD

An Ameliorated Methodology to Eliminate Redundancy in Databases Using SQL

International Journal of Advance Engineering and Research Development. Flow Control Loop Analysis for System Modeling & Identification

Adding Formal Requirements Modeling to SysML

An optimal novel Byzantine agreement protocol (ONBAP) for heterogeneous distributed database processing systems

The Design and Performance Analysis of QoS-Aware Edge-Router for High-Speed IP Optical Networks

Fault tolerant scheduling in real time systems

Keywords: Software reliability, Logistic Growth, Curve Model, Software Reliability Model, Mean Value Function, Failure Intensity Function.

Dynamic Deferred Acknowledgment Mechanism for Improving the Performance of TCP in Multi-Hop Wireless Networks

Available online at ScienceDirect. Procedia Computer Science 54 (2015 ) 24 30

A Review on Reliability Issues in Cloud Service

ISSN: ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 7, Issue 1, July 2017

SLL REDUCTION IN APERIODIC LINEAR ARRAY ANTENNA SYSTEM WITH SCAN ANGLES

Fault tolerant system with imperfect coverage, reboot and server vacation

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS

Proceedings of the 2012 International Conference on Industrial Engineering and Operations Management Istanbul, Turkey, July 3 6, 2012

Synthesis of Supervisory Control of Discrete Event System for Static VAR Compansator

Open Access Extended Dynamic Fault Tree Algorithm Based on Stochastic Petri Net and Micro-Satellite On-Board Computer Case Analysis

Lifetime Enhancement of Wireless Sensor Networks using Duty Cycle, Network Coding and Cluster Head Selection Algorithm

A Fault Tolerant Approach for WSN Chain Based Routing Protocols

Integration of Dynamic Tariff Scheme in Congestion Control

Enhancing Availability Using Identity Privacy Preserving Mechanism in Cloud Data Storage

Backup vs. Business Continuity

Including Failure Correlation in Availability Modelling of a Software-Defined Backbone Network

An Optimal Locking Scheme in Object-Oriented Database Systems

A heuristic approach to find the global optimum of function

Comparison of pre-backoff and post-backoff procedures for IEEE distributed coordination function

Dynamic Reliability Block Diagrams VS Dynamic Fault Trees

Cost Reduction of Replicated Data in Distributed Database System

Gupta Nikita $ Kochhar

COMPARATIVE ANALYSIS OF EYE DETECTION AND TRACKING ALGORITHMS FOR SURVEILLANCE

Dynamic control and Resource management for Mission Critical Multi-tier Applications in Cloud Data Center

Robust Pole Placement using Linear Quadratic Regulator Weight Selection Algorithm

An Efficient Selective-Repeat ARQ Scheme for Half-duplex Infrared Links under High Bit Error Rate Conditions

Design and Implementation of Microcode based Built-in Self-Test for Fault Detection in Memory and its Repair

STUDY OF THE DEVELOPMENT OF THE STRUCTURE OF THE NETWORK OF SOFIA SUBWAY

Reliability-Based Software Rejuvenation Scheduling for Cloud-Based Systems

The effect of server s breakdown on the performance of finite-source retrial queueing systems

Impact of Source Counter on Routing Performance in Resource Constrained DTNs

AUTOMATIC repeat request (ARQ) techniques have been

Aggrandize the Reliability by Bug Retrieval (ARBR)

A simple mathematical model that considers the performance of an intermediate node having wavelength conversion capability

Reliability Estimation Model for Software Components Using CEP

ISSN: [Keswani* et al., 7(1): January, 2018] Impact Factor: 4.116

HYBRID PETRI NET MODEL BASED DECISION SUPPORT SYSTEM. Janetta Culita, Simona Caramihai, Calin Munteanu

International Journal of Scientific & Engineering Research Volume 8, Issue 5, May ISSN

Fault Tolerance. The Three universe model

Comparative Analysis of GKCS and EKCS

Enhanced Live Migration of Virtual Machine Using Comparison of Modified and Unmodified Pages

FEC Performance in Large File Transfer over Bursty Channels

A Co-Operative Cluster Based Data Replication Technique for Improving Data Accessibility and Reducing Query Delay in Manet s

Built-in Self-Test and Repair (BISTR) Techniques for Embedded RAMs

Study of Load Balancing Schemes over a Video on Demand System

Analysis of Link-Layer Backoff Algorithms on Point-to-Point Markov Fading Links: Effect of Round-Trip Delays

An ICA based Approach for Complex Color Scene Text Binarization

Design and Analysis of Kogge-Stone and Han-Carlson Adders in 130nm CMOS Technology

A Comparison of Mechanical Artefacts used in Calibration of Coordinate Measuring Machines (CMMs)

Near Optimal Repair Rate Built-in Redundancy Analysis with Very Small Hardware Overhead

Reliability of Software Fault Prediction using Data Mining and Fuzzy Logic

A Process Scheduling Algorithm Based on Threshold for the Cloud Computing Environment

A Novel Class-based Protection Algorithm Providing Fast Service Recovery in IP/WDM Networks

Automated Development of Markovian Chains for Fault- Tolerant Computer-Based Systems with Version- Structure Redundancy

Optimal Resource Allocation for Multimedia Cloud Based on Queuing Model

Implementation of Crosstalk Elimination Using Digital Patterns Methodology

A New Statistical Software Reliability Tool

A Level-wise Priority Based Task Scheduling for Heterogeneous Systems

Defects Per Million (DPM): A User-oriented Perspective of Telecommunication Systems

Discrete Event Simulation and Petri net Modeling for Reliability Analysis

Optimization of Process Time in the Creation and Updation of Technical Publication Documents Using Business Process Reengineering: Case Study

Optimal Design of E-Commerce Site Infrastructure from a Business Perspective

Survey on Incremental MapReduce for Data Mining

Transcription:

Analysis of Stochastic Model on a Two-Unit Hot Standby Combined Hardware-Software System Rajeev Kumar Department of Mathematics, M.D. University, Rohtak-124001, INDIA Sudesh Kumari Department of Mathematics, M.D. University, Rohtak-124001, INDIA ABSTRACT This paper deals with a stochastic model for a two-unit hot standby combined hardware-software system in which one unit is operative and the other is hot standby. The operative unit may have hardware or software failures and goes to repair. This leads to degradation of the system and then hot standby unit is under operation. Further on hardware or software failures of the hot standby unit the system goes to complete failures. Various measures of performance of the system are obtained using semi- Markov process and regenerative point techniques. Numerical results and graphs pertaining to a particular case are also included. General Terms Stochastic Process Keywords Two-Unit Hot Standby System, Combined hardwaresoftware system, reliability, mean up time, profit, semi- Markov and Regenerative point techniques. 1. INTRODUCTION In the recent years, a large number of hardware-software systems have been developed and adopted due to their practical applications and common man s affordability and inherent reliability. A hardware-software system consists of hardware and software subsystems. Most of the modern systems like mobiles, computers, robots, missiles, rockets, electronic meters, washing machines etc. are improvement over hardware-software systems. Today society and its Institutions are becoming more and more depend on many such systems. A few of the numerous applications of the hardware and software systems are control of communications and transport systems, automated plants operations and even in routine activities e.g. reservation of tickets, computation of various bills and in business. All these things demands high reliability since failure of any can be costly and hazardous. At present it is a matter of great importance to produce and provide the cost-effective, efficient, user friendly hardware-software system having higher reliability. For effective performance of the hardware-software system, both its hardware and software components must function with considerable reliability. As the hardwaresoftware systems are more complex and the demand on their reliability has increased exponentially. For the reliability analysis of the one-unit hardware-software system, combined reliability model i.e. considering both hardware and software components were discussed by some researchers such as Boyd and Monahan [1], Friedman and Tran [2], Hecht and Hecht [3], Huang et al.[4], Iyer and Velardi [5], Kanoun and Ortalo-Borrel [6], Kumar and Malik [7], Kumar and Kumar [8], Martin and Mathur [9], Teng et al. [10], Trivedi [11], Trivedi et al. [12], Welke et al. [13] etc. However, analysis of a twounit hot standby hardware-software system with respect to its reliability, availability and profit has not been reported in the literature of reliability modeling. Keeping this in view, in the present paper, a stochastic model for two identical units hot standby combined hardware-software system is analyzed. Initially, one unit is operative and other is hot standby and any one of the units may have a hardware or software component failure. On failure of a unit either due to hardware or software component failure, the system goes to degraded state and when both the units fails the system goes to complete failure. It is assumed here that hardware and software failures are independent of each other and occurs only due to hardware and software components, respectively. Further, it is assumed that there is single repair facility/engineer that handles all types of hardware or software failures and reaches the system in negligible time. Other assumptions of the model are: 1. After each hardware/software repair, the system is as good as new. 2. If a unit is under repair, it does not work for the system. 3. The times to failures are exponentially distributed whereas recovery/repair time distributions are general. 4. All random variables are mutually independent. 5. Switching is perfect and instantaneous. Various measures of system performance are obtained using semi Markov Process and regenerative point techniques. The expressions for various measures of system performance are obtained such as mean time to system failure, mean up time, expected number of software repairs, expected number of hardware repairs, expected number of visits by repairman. Expected profit incurred for the system is also computed using above measures. Various conclusions regarding reliability and profit of the system are drawn on the basis of graphical studies. 2. NOTATIONS O : Operative unit. F sr : Software failure is not recovered and unit is under manual repair. F hr : Hardware failure is not recovered and unit is under manual repair. F sr : Software repair is continuing from the previous state. 29

F hr : Hardware repair is continuing from the previous state. F sw : Unit is under software failure and it is waiting for repair. F hw : Unit is under hardware failure and it is waiting for repair. λ s : Software failure rate. λ h : Hardware failure rate. α s : Software repair rate. α h : Hardware repair rate. g s (t) : P.d.f. of time to software repair. g h (t) : P.d.f. of time to hardware repair. The transition diagram depicting the various states of the system is shown in the fig.1.the epochs of entry into the states 0, 1, 2 are regenerative points and thus the states 0, 1, 2 are regenerative states and 3, 4, 5, 6 are failed states. 3. MEASURES OF THE SYSTEM PERFORMANCE 3.1 Transition Probabilities The transition probabilities p ij are given by The unconditional mean time ( taken by the system to transition for any regenerative state j, when it (time) is counted from epoch of entrance into that state i is, mathematically, stated as: Thus, λ ; λ λ ; ; 4. OTHER MEASURES OF THE SYSTEM PERFORMANCE By probabilistic arguments for the regenerative process, the recursive relations for various measures of the system performance are obtained. Then on solving the relations using Laplace-Stieltjes transforms, we get the following measures: Mean Time to System Failure Mean Up Time Expected Number of Hardware Repair Expected Number of Software Repairs Expected Number of Visits by the Repairman where By these transition probabilities it can be verified that where S =, S 1 =, S 2 =, 3.2 Mean Sojourn Time The mean sojourn time in the i th state is the expected first passage time taken by the unit at the i th state before transiting to any other state. where T i is the p.d.f. of device life time. Therefore, ; ; ; ; 3.3. Unconditional Mean Time 5. PROFIT ANALYSIS The expected total profit (P 0 ) incurred to the system in steady state is given by P0 = where C 0 = revenue per unit up time of the system. C 1 = revenue per unit degradation time of the system. C 2 = cost per unit of hardware repair. C 3 = cost per unit of software repair. C 4 = cost per visit of the repairman. C 5 = installation cost. 30

6. PARTICULAR CASE For the analysis purpose, the following particular case is taken: ; The values of the various failure rates and repair rates are assumed as, i.e h =.01, h =.001, h =.002, s =.0002, s =.002, α h =.2, α h =.6, α h =.8, α h = 1.8, α s =.3, α s =.5, α s =.6, α s =.7, C 0 = 30000, C 1 = 30000, C 2 = 500, C 3 = 300, C 4 = 20, C 5 = 24000 etc. 7. GRAPHICAL ANALYSIS Using the above particular case and the assumed values of parameters, various graphs are plotted for mean time to system failure (T 0 ), mean up time (A 0 ), mean degradation time (D 0 ) and profit (P 0 ) of the system for different values of hardware and software failures rates (λ h, λ s ) and repair rates (α h, α s ), respectively. Fig. 2 and fig. 3 depicts the behavior of mean time to system failure (T 0 ) with respect to various hardware and software failure rates ( h, s ), respectively for different values of hardware and software repair rates ( α h, α s ). It can be seen that mean time to system failure decreases as hardware and software failure rates increase and further mean time to system failure increases with higher values of repair rates. Fig. 4 and fig. 5, shows the behavior of mean up time (A 0 ) with respect to various hardware and software failure rates ( h, s ), respectively for different values of hardware and software repair rates ( α h, α s ). From the graphs, it can be seen that mean up time decreases with the increase in the values of hardware and software failure rates and further mean up time increases with higher values of repair rates. Fig. 6 and fig. 7 reveals the pattern of the profit (P 0 ) incurred to the system with respect to hardware and software failure rates ( h, s ). It can be seen that the profit (P 0 ) of the system decreases as the failure rates increases whereas increases for higher values of repair rates. 8. CONCLUSIONS It can be observed that reliability, mean up time and profit of the two unit hot standby hardware-software system decrease as hardware and software failure rates increases and these increases with higher values of hardware and software repair rates. The limits of hardware and software failure/repair rates can be obtained for the system to give positive profit that may be quite useful for both the system developer and the system user. 9. REFERENCES [1] Boyd, M.A. and Monahan, C. M., 1995 Developing Integrated Hardware-Software System Reliability models: Difficulties and Issues [For Digital Avionics]. Proceeding of the Digital Avionics Systems Conference, 14 th DASC, pp.193-198, Cambridge, USA. [2] Friedman, M.A. and Tran, P., 1992. Reliability Techniques for Combined Hardware/Software Systems, Proceeding Annual Reliability and Maintainability Symposium, pp.290-293 [3] Hecht and Hecht, 1986. Software Reliability in The System Context, IEEE Transactions on Software Engineering, Vol.-12, pp.51-58. [4] Huang, B., Li, X., Li, M., Bernstein, J. and Smidts, C.,2005. Study of the Impact of Hardware Fault on Software Reliability, Proc. 16 th IEEE Int l Symp. Software Reliability Engineering. [5] Iyer, R.K. and Velardi, P., 1985. Hardware related Software Errors: Measurement and Analysis. IEEE Transactions on Software Engineering, Vol.11, pp.223-230. [6] Kanoun, K. and Ortalo-Borrel, M., 2000. Fault-Tolerant System Dependency-Explicit Modeling of Hardware and Software Component-Interactions, IEEE Transactions Reliability, Vol. 49, No. 4, pp. 363-376. [7] Kumar, A. and Malik, S.C., 2012. Stochastic Modeling of a Computer System with Priority to PM over S/W Replacement Subject to Maximum Operation and Repair, International Journal of Computer Applications, Vol.43, No.3, pp. 27-34. [8] Kumar, R. and Kumar, M. 2012. Performance and Cost Benefit Analysis of a Hardware-Software System Considering Hardware based Software Interactions Failures and Different Types of Recovery, International Journal of Computer Applications, Vol. 53, No.-17, pp.25-32 [9] Martin, R. and Mathur, A.P., 1990. Software and Hardware Quality Assurance: Towards a Common Platform for Increase the Usability of this Methodology, Proc. IEEE Conf. Comm. [10] Teng, X., Pham, H. and Jeske, D.R., 2006. Reliability Modeling of Hardware and Software Interactions and its Applications, IEEE Transactions Reliability, Vol.55, pp.571-577. [11] Trivedi, K. S., Vasireddy, R., Trindade, D., Nathan,S., and Rick Castro., 2006. Modeling High Availability Systems. Proceeding Pacific Rim Dependability Conference PRDC. [12] Trivedi, K.S., 2011. Probability and Statistics with Reliability, Queuing and Computer Science Applications, KPH Publishing. [13] Welke, S.R., Johnson, B.W. and Aylor, J.H., 1995. Reliability Modeling of Hardware/Software Systems, IEEE Trans. Reliability, Vol.44, pp.413-418. 31

MTSF(T 0 ) F sr,f hw 4 λ s g s (t) F hr,f hw 6 λ h g h (t) F sr,o 1 2λ s g s (t) O,O 0 2λ h g h (t) F hr,o 2 g s (t) λ s g h (t) λ s F sr,f sw 3 F hr,f sw 5 : Operative State : Failed State : Degraded State : Regenerative Point Fig. 1: State Transition Diagram 10000 MTSF VERSUS HARDWARE FAILURE RATE FOR DIFFERENT VALUES OF HARDWARE REPAIR RATE 9000 8000 7000 α s =. 5, λ s =.0002 6000 5000 4000 3000 2000 αh=.6 αh=1.3 αh=2 1000 0 0.01 0.012 0.014 0.016 0.018 HARDWARE FAILURE RATE (λ h ) Fig. 2: MTSF versus Hardware Failure Rate 32

MEAN UP TIME (A 0 ) MTSF (T 0 ) MTSF VERSUS SOFTWARE FAILURE RATE FOR DIFFERENT VALUES OF SOFTWARE REPAIR RATE 9500 9000 8500 α h = 1.8, λ h =.01 8000 7500 7000 6500 αs=.5 αs=1 αs=1.5 6000 5500 5000 0 0.0005 0.001 0.0015 0.002 SOFTWARE FAILURE RATE (λ s ) Fig. 3: MTSF Versus Software Failure Rate 1 MEAN UP TIME VERSUS HARDWARE FAILURE RATE FOR DIFFERENT VALUES OF HARDWARE REPAIR RATE 0.99 α s =.7, λ s =.002 0.98 0.97 0.96 0.95 0.94 αh=.8 αh=1.4 αh=2 0.93 0.92 0.01 0.015 0.02 0.025 0.03 0.035 HARDWARE FAILURE RATE (λ h ) Fig. 4: Mean Up Time Versus Hardware Failure Rate 33

PROFIT (P 0 ) MEAN UP TIME (A 0 ) 0.99 MEAN UP TIME VERSUS SOFTWARE FAILURE RATE FOR DIFFERENT VALUES OF SOFTWARE REPAIR RATE 0.989 α h = 1.8, λ h =.01 0.988 0.987 0.986 0.985 αs=.6 αs=1.05 αs=1.5 0.984 0.983 0 0.0005 0.001 0.0015 0.002 SOFTWARE FAILURE RATE (λ s ) Fig. 5: Mean Up Time Versus Software Failure Rate PROFIT VERSUS HARDWARE FAILURE RATE FOR DIFFERENT VALUES OF HARDWARE REPAIR RATE 2500000 2000000 α s =. 5, λ s =.002, C 0 = 30000, C 1 = 30000, C 2 = 500, C 3 = 300, C 4 = 20, C 5 = 24000 1500000 1000000 500000 αh=.6 αh=1.3 αh=2 0 0.01 0.012 0.014 0.016 0.018 0.02 HARDWARE FAILURE RATE (λ h ) Fig. 6: Profit Versus Hardware Faifure Rate 34

PROFIT (P 0 ) 2300000 2100000 1900000 1700000 PROFIT VERSUS SOFTWARE FAILURE RATE FOR DIFFERENT VALUES OF SOFTWARE REPAIR RATE α h = 1.8, λ h =.01, C 0 = 30000, C 1 = 30000, C 2 = 500, C 3 = 300, C 4 = 20, C 5 = 24000 1500000 1300000 1100000 900000 αs=.3 αs=.9 αs=1.5 700000 500000 0.002 0.004 0.006 0.008 0.01 SOFTWARE FAILURE RATE (λ s ) Fig. 7: Profit Versus Software Failure Rate IJCA TM : www.ijcaonline.org 35