
Monte Carlo Simulation in Dependability Analysis
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
System dependability is a complex task to grasp and analyze since it encompasses reliability, maintainability, availability, failure mode analysis and feared events. For operational safety analyses, reliability is a quantitative basis for the other disciplines of maintainability, availability and safety. Reliability metrics such as failure rate or MTBF are often misused as they are only valid for low-maintenance applications, and wrongly for others, as MTBF is only relevant for availability. In addition, in operational safety, many equations do not have explicit solutions, and Monte Carlo simulations are a little-used way of obtaining and/or confirming the solution obtained by numerical methods.
Monte Carlo Simulation in Dependability Analysis fills this gap as best as we can. This task is a difficult one, since operational safety is a cross-disciplinary activity in the engineering sciences - cross-disciplinary in that it must be present throughout a product's life cycle.
Franck Bayle trained as an electronic engineer. He has practiced for almost 15 years, working at Crouzet and then at Thalès in Valence, France. He has also worked as Design Authority in reliability and maturity.
Laurent Denis is the CEO of StatXpert, a consulting, training, and software company specializing in statistics and operational reliability based in Pessac, France.
Adrien Gigliati is Dependability Engineer at Thalès in Valence, France.
More details
Other editions
Additional editions

Content
Foreword by Philippe Bogdanik xi
Foreword by Gilles Zwingelstein xiii
List of Notations xv
List of Acronyms xvii
Definitions xix
Introduction xxi
Part 1. Reliability 1
Chapter 1. Predictive Reliability 3
1.1. Concept of predictive reliability 3
1.2. FIDES methodology 4
1.3. Application example 7
1.4. Maintaining a reliability specification 11
1.5. Summary 12
Chapter 2. Statistical Characteristics of Exponential and Weibull Distributions 13
2.1. Refresher about exponential and Weibull distributions 13
2.2. Parameter estimation for a reliability model using the maximum likelihood method 14
2.3. Estimator properties 17
2.4. Simulation of failure times by inverting the probability of failure 19
2.5. Impact of temperature 21
2.6. Relative bias and coefficient of variation of the Weibull parameters 22
2.7. Simulation scenarios considered from the different parameters 26
2.8. Summary 31
Chapter 3. System Reliability 33
3.1. Assumptions 34
3.2. Maintenance-free systems 34
3.3. Maintenance-free systems 45
3.4. Series/parallel system 73
3.5. Parallel/serial system 78
3.6. Use cases 86
Chapter 4. Impact of Temperature on Reliability 97
4.1. Arrhenius law 97
4.2. Operational life profile 98
4.3. Sedyakin's principle 98
4.4. Consequences for reliability estimates 100
4.5. Taking the effect of maintenance into consideration 107
4.6. Summary 114
Chapter 5. Aging Tests 115
5.1. Accelerated aging test 117
5.2. Aging test design 118
5.3. Sequential test at two constant temperatures 121
5.4. Constant-level parallel testing 124
5.5. Constant-level mixed testing 125
5.6. Summary 125
Chapter 6. Application of the Noncentral Beta Distribution 127
6.1. Context 127
6.2. The "noncentral beta" probability distribution 129
6.3. Measurement modeling 130
6.4. Rejection method 132
6.5. Confidence interval for noncentral beta distribution 136
6.6. Rationale for the choice of the noncentral beta distribution 137
6.7. Summary 139
Chapter 7. Statistical Characteristics of HPP and PLP Processes 141
7.1. Reminders about Poisson processes 141
7.2. HPP homogeneous Poisson process 141
7.3. PLP power process 143
7.4. Summary 148
Part 2. Maintainability 151
Chapter 8. Maintainability 153
8.1. Average number of failures 153
8.2. Serial system 154
8.3. Parallel system 159
8.4. k/n system 162
8.5. Avionics system 164
8.6. Summary 165
Part 3. Availability 167
Chapter 9. System Availability 169
9.1. Assumptions 169
9.2. Uptime and repair time: exponential distributions 171
9.3. Exponential distribution uptime and constant repair time 174
9.4. Exponential distribution uptime and uniform distribution repair time 176
9.5. Exponential distribution uptimes and normal distribution repair times 178
9.6. Uptimes exponential distribution and repair times Weibull distribution 182
9.7. Serial system 183
9.8. Parallel system 186
9.9. k-out-of-n redundancy 189
9.10. Series/parallel system 192
9.11. Parallel/serial system 193
9.12. Energy conversion 194
9.13. Summary 200
Part 4. Safety 203
Chapter 10. FMEA Concurrent Failure Mechanisms 205
10.1. Maintenance-free industrial applications 209
10.2. Industrial applications with maintenance 211
10.3. Consideration of physical contributions 215
10.4. Summary 219
Chapter 11. Feared Events (FTA) 221
11.1. Introduction 221
11.2. Regulatory aspects 222
11.3. Probability of the occurrence of a feared event 228
11.4. Practical application 235
11.5. Summary 239
Appendices 241
References 249
Index 251
Introduction
When Napoleon asked Laplace why his treatise on cosmology did not mention God, the latter replied, "Sire, I had no need for this hypothesis". At the end of the 19th century, physicists thought they had covered the whole of physics, and it was considered purely deterministic. There was however a small problem, but they thought it would be quickly resolved during the following century. Indeed, Young's famous slit experiment has remained somewhat forgotten. The problem encountered was that, depending on the conditions of the experiment, light behaves either like a particle or a wave, and there was no explanation for this at the time.
Owing to Planck, Einstein, Bohr, Heisenberg, Pauli, Dirac, Schrödinger and Born, quantum mechanics was born in the 1920s. It essentially states that at the subatomic level, we cannot know the position of a particle with great precision. Rather, only its probability of presence and determinism give way to uncertainty. Despite the fact it is little addressed in educational programs, quantum mechanics is nevertheless present in our daily lives. It fuels the doubt that makes us uncomfortable, and we all seek to reduce, or even better, eliminate it. We continued to monitor weather forecasts, depsite the fact we know that the weather is notoriously unpredictable and that forecasts are quite often inaccurate.
In industry, we more commonly speak of operational safety. This discipline, which acquired this name and its current form mainly over the last half century and in the defense, aeronautics, space, nuclear, telecommunications and transport sectors, is now useful, even essential, for all sectors of industry and even other activities. The goal that requires the use of operational safety is more recognizable under the term "risk control". Operational safety is, according to Villemeur (1988), "the ability of an entity to satisfy one or more required functions under given conditions". It mainly encompasses four components: reliability, maintainability, availability and safety.
Figure I.1. Various analyses of operational safety.
Reliability is thus the quantitative basis for the other three analyses of operational safety. It can itself be divided into three distinct levels depending on the phases of the system's lifecycle. This can be illustrated synthetically by Figure I.2.
Figure I.2. Different types of reliability and their positioning in the lifecycle of a system.
The maintainability of a system is also a major parameter. Of course, it only makes sense for systems subject to maintenance. It does not depend directly on reliability (the reason for the green arrow in Figure I.1). For a system to be optimally maintained, it is first essential that the spare stock is in line with the number of system failures. Thus, the estimation of the "right" number of systems in stock can be assessed on the basis of the reliability level.
Availability is a critical parameter for many industrial applications, such as aeronautics, railways, energy production and distribution. Availability represents the ability of a system to perform a required function under given conditions at a given time. It depends not only on the reliability of the system but also on its ability to be repaired. The number of repairs is generally random.
Safety, more commonly referred to as "safety" in certain industrial fields, is the ability of a product to respect, during all phases of a device's life, an acceptable level of accident risk likely to cause an attack on personnel or major degradation of the product or its environment. It is generally broken down into an analysis of failure modes and their effects and an estimation of the probability of occurrence of certain feared events specified by the client.
The authors, despite sometimes working in different industrial fields, have very often encountered methodological errors, resulting in the confusion of the major parameters of these different areas of operational safety. For example, the failure rate is often referred to as a key reliability indicator, particularly on the Internet, whereas its practical use is restricted to a few industrial applications, such as space or certain military applications (missiles). It is also possible that, for a new generation component, reliability tests are carried out on test benches to verify that its intrinsic reliability is indeed as expected. However, these tests are rare because they are time consuming and have a significant cost. The effect of maintenance, which is present in most industrial applications, is often neglected even though it plays a fundamental role in the reliability of an operating system.
There are books detailing in a very rigorous way the theoretical methods used to approach these different themes, but they often remain very academic and difficult to master in the industrial field. In addition, we often address theoretical equations that do not have explicit solutions. In some cases, numerical solutions exist, but they can present certain problems, such as inconsistent results, without really realizing them, and they do not allow us to directly understand which parameters intervene in the variable of interest.
However, some renowned scientists often discovered revolutionary theories simply through thought experiments, well before these theories were experimentally verified. Therefore, it seems quite legitimate to want to simulate random events such as moments of failure, repairs or, more generally, the appearance of the feared or hoped-for fact under the assumption of probability distributions. Monte Carlo simulations can be an effective tool for resolving the various problems mentioned above.
The objective of this book, in an industrial context of operational safety, is therefore twofold:
- We propose theoretical solutions adapted to the problems encountered.
- The corresponding Monte Carlo simulations are presented when explicit solutions are not available or to verify the proposed theoretical approximations or numerical calculations.
In accordance with Figure I.1, the structure of the book is composed of four distinct parts. For each of them, we present a theoretical analysis for systems operated without maintenance as well as a theoretical analysis and Monte Carlo simulations for systems with maintenance. If an explicit solution exists, we use only this one. If, on the other hand, only a numerical solution is possible, we will then use simulations to corroborate the results obtained. Finally, if no analytical solution is possible, either because it does not exist or because it is not known to us, we will systematically use simulations either by the failure probability inversion method or by the rejection method when the reciprocal function has no explicit expression. The simulations carried out are either created in Python code or using the "Weibull++" or "BlockSim" software from ReliaSoft®.
We begin with Part 1 on Reliability because if there is one area where uncertainty is very present, it is that of reliability. Regardless of how well we know the physics of the technology used, the failure times of technological entities being tested or operated are always random.
During the so-called specification phase, a reliability objective is generally required by the client, which translates either into a probability of successfully completing a specific mission for systems operating without maintenance or into an "MTBF" for those with maintenance. Thus, at the end of the design phase, since the final (series) version of the system is not yet available (only prototypes are functional), a predictive reliability analysis is carried out to verify that the objective is met. The realism of this predictive analysis is therefore very important, particularly the estimation of the levels of physical contributions (life profile) to which the system will be exposed during its operational life.
Thus, Chapter 1 propose to evaluate the sensitivity of the forecast reliability estimate to the parameters of the life profile of a system. Therefore, instead of constructing it from constant levels, we propose, when necessary, to assume a probability law chosen according to the information collected. We then use Monte Carlo simulations to estimate the variability of the system reliability.
On the other hand, in certain specific cases, reliability tests on a component deemed "at risk" (with respect to the reliability of the entire system) may be deemed necessary or even essential. This may be a component with an insufficient reliability level on a previous generation and that has undergone a design change, a new technology component, a single-source component for which the manufacturer does not provide reliability information, etc. In contrast to the general case where we want to minimize the number of system failures, particularly during the design phase, we will therefore do everything possible to induce failures to obtain an "intrinsic" reliability model of the component with a given accuracy. The notions of bias and variance of the estimators of the reliability model parameters are then of the utmost importance. Chapter 2 therefore proposes a theoretical approach with explicit solutions when possible and Monte Carlo simulations when this is not the case.
Finally, so-called...
System requirements
File format: ePUB
Copy protection: Adobe-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Install the free reader Adobe Digital Editions prior to download (see eBook Help).
- Tablet/smartphone (Android; iOS): Install the free app Adobe Digital Editions or the app PocketBook before downloading (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (not Kindle).
The file format ePub works well for novels and non-fiction books – i.e., „flowing” text without complex layout. On an e-reader or smartphone, line and page breaks automatically adjust to fit the small displays.
This eBook uses Adobe-DRM, a „hard” copy protection. If the necessary requirements are not met, unfortunately you will not be able to open the eBook. You will therefore need to prepare your reading hardware before downloading.
Please note: We strongly recommend that you authorise using your personal Adobe ID after installation of any reading software.
For more information, see our ebook Help page.