A Stochastic Approach For The Analysis of Fault

480 IEEE TRANSACTIONS ON RELIABILITY, VOL. 63, NO.
2, JUNE 2014
A Stochastic Approach for the Analysis of Fault

Trees With Priority AND Gates
Peican Zhu, Jie Han, Member, IEEE, Leibo Liu, Member, IEEE, and Ming J. Zuo, Senior Member, IEEE
Abstract—Dynamic fault tree (DFT) analysis has been used to cdf cumulative density function
account for dynamic behaviors such as the sequence-dependent,
functional-dependent, and priority relationships among the fail- BDD binary decision diagram
ures of basic events. Various methodologies have been developed
to analyze a DFT; however, most methods require a complex ana- SBDD sequential binary decision diagram
lytical procedure or a significant simulation time for an accurate MC Monte Carlo
analysis. In this paper, a stochastic computational approach is pro-
posed for an efficient analysis of the top event’s failure probability FPGA Field Programmable Gate Array
in a DFT with priority AND (PAND) gates. A stochastic model is
initially proposed for a two-input PAND gate, and a successive cas-
cading model is then presented for a general multiple-input PAND NOTATION
gate. A stochastic approach using the proposed models provides an
efficient analysis of a DFT compared to an accurate analysis or al- symbol for an inclusive precedence in a
gebraic approach. The accuracy of a stochastic analysis increases failure order
with the length of random binary bit streams in stochastic com-
putation. The use of non-Bernoulli sequences of random permuta- mission time
tions of fixed counts of 1s and 0s as initial input events’ probabili-
ties makes the stochastic approach more efficient, and more accu- basic events
rate than Monte Carlo simulation. Non-exponential failure distri- failure rate
butions and repeated events are readily handled by the stochastic
approach. The accuracy, efficiency, and scalability of the stochastic failure time of basic event
approach are shown by several case studies of DFT analysis.
failure probability in the time interval
Index Terms—Dynamic fault tree, reliability analysis, stochastic
computation, non-Bernoulli sequence, priority AND gate, sto-
chastic logic. binary sequence at
sequence length in the number of bits
ACRONYMS AND ABBREVIATIONS
FTA fault tree analysis I. INTRODUCTION
DFT dynamic fault tree
FDEP
PAND
functional dependency gate
priority AND gate
F AULT TREE ANALYSIS (FTA) was first proposed in
1962 for evaluating a system’s failure probability, the
probability that a system fails during a specified mission time
SEQ sequence enforcing gate [1]. Failures can be disastrous for systems such as chemical
plants, nuclear reactors, airplanes, and computer systems; or
WSP warm spare gate
costly for systems such as online sales, or commercial servers.
CSP cold spare gate FTA has developed rapidly, and gained much attention in many
pdf probability density function applications, especially in the analysis of large safety-critical
systems [2]–[7].
However, dynamic behaviors, such as sequence-dependent,
Manuscript received April 07, 2013; revised September 17, 2013, November functionally dependent, and priority relationships, cannot be
25, 2013; accepted November 29, 2013. Date of publication April 14, 2014; date
modeled properly by traditional FTA. To account for these
of current version May 29, 2014. This work was supported in part by the Natural
Sciences and Engineering Research Council of Canada. Associate Editor: O. dynamic behaviors, dynamic fault tree (DFT) analysis has been
Gaudoin. proposed by incorporating additional dynamic gates into FTA.
P. Zhu and J. Han are with the Department of Electrical and Computer
Dynamic gates include the priority AND gate (PAND), the
Engineering, University of Alberta, Edmonton, AB T6G 2R3 Canada (e-mail:
Peican@ualberta.ca; jhan8@ualberta.ca). sequence enforcing gate (SEQ), the standby or spare gates that
M. J. Zuo is with the Department of Mechanical Engineering, University of include the warm spare gate (WSP) and cold spare gate (CSP),
Alberta, Edmonton, B T6G 2R3 Canada (e-mail: ming.zuo@ualberta.ca).
and the functional dependency gate (FDEP) [8], [9]. The failure
L. Liu is with the Institute of Microelectronics, Tsinghua University, Beijing
100084, China (e-mail: liulb@tsinghua.edu.cn). of a system is determined by the states of basic events and the
Digital Object Identifier 10.1109/TR.2014.2313796 interactions among them. The interactions can be derived from
0018-9529 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
ZHU et al.: A STOCHASTIC APPROACH FOR THE ANALYSIS OF FAULT TREES WITH PRIORITY AND GATES 481
a system’s topology. A DFT relies on the interactions among is applicable in the general case of a multiple-input PAND gate
static gates (including AND, OR, and K-out-of-N voting gates), through a cascaded PAND model. Subsequently, non-Bernoulli
and dynamic gates (PAND, FDEP, SPARE, and SEQ) [10]. sequences of random permutations of fixed numbers of 1s and 0s
Due to their operational characteristics, the dynamic gates are are used to encode initial event probabilities for an efficient im-
divided into two categories: 1) PAND and FDEP, which are plementation of stochastic computation. It has been shown that
referred to as priority dynamic gates; and 2) WSP and SEQ, the use of the non-Bernoulli sequences as initial inputs provides
whose operations are dependent on the duration of failure a more efficient, more accurate evaluation than using Bernoulli
events [11]. For systems with a perfect fault coverage, FDEP sequences [24], [25]. This paper shows that the non-Bernoulli
has been modeled as an OR gate [12]–[15]. In a redundant sequences significantly improves the efficiency of a stochastic
system with imperfect fault coverage, however, uncovered or FTA. Hence, the advantage of the stochastic approach is not lim-
undetected faults can propagate, and may have a global effect ited to those due to parallelization or a specific hardware im-
on system failure [16]. In this case, the OR-gate model is not plementation. Furthermore, repeated events, as frequently en-
applicable, and a combinatorial method has been proposed for countered in a general DFT, are inherently modeled by the sto-
an efficient reliability analysis of systems with FDEP gates chastic sequences that preserve signal correlation. Finally, the
[16]. As a first study, this paper is focused on priority dynamic stochastic approach is general because it is applicable to any
gates, and in particular the PAND gate. failure distribution by encoding failure probabilities into sto-
Various methodologies using Markov [12] and Bayesian [17] chastic non-Bernoulli sequences.
models have been proposed for evaluating the dependability of The remainder of the paper is organized as follows. Section II
a fault tree. Due to the inevitable state-space explosion problem, reviews the definitions of PAND. Section III presents some
however, these approaches incur a large complexity for the anal- hypotheses considered in this paper. The stochastic approach
ysis of complex systems. Moreover, the evaluation of a large and the proposed model for the PAND gate are described in
DFT using a state-space based method becomes difficult when Section IV. In Section V, the accuracy and efficiency of the
a basic event’s failure behavior is not exponentially distributed. proposed approach are shown by the simulation results of
In [2], an Inclusion-Exclusion method is proposed for an several examples. Finally, Section VI concludes the paper.
exact analysis of a DFT that contains PAND gates and repeated
events. However, this method is limited to the analysis of sys- II. REVIEW OF THE PAND GATE
tems with exponentially distributed failure events; in addition, A PAND gate is a special type of AND gate for which an
detailed information on the minimal cut set is usually required input indicates the firing of a basic event that occurs in a prede-
in advance. In [7], an integral-based analysis is proposed for termined order, and the output indicates whether a failure occurs
handling any probability distribution; however, an analytical [5], [26]. Without the loss of generality, the predefined order is
expression is generally difficult to derive as a function of the assumed to be from left to right in this paper unless otherwise
basic events’ failure distributions. Several approaches have noted. The operational principles of a two-input PAND gate, as
been developed to simplify the process of deriving an exact well as its symbols, are shown in Fig. 1 for an inclusive con-
analytical expression. These approaches include those using bi- dition [11]. By an inclusive condition, if the two inputs of the
nary decision diagrams (BDDs) [18], sequential binary decision PAND gate fail simultaneously, the output fails at the same time
diagrams (SBDDs) [19], [20], and an algebraic analysis [11], as the inputs.
[21]. In particular, the SBDD approach has been applied to the As shown in Fig. 1, the output of the PAND gate is 1 (i.e.,
analysis of the PAND gate [22]. Monte Carlo (MC) simulation it fails) when the basic event fails before , or and
[23] has been widely used to evaluate complex DFTs; however, fail at the same time; otherwise, the output of the PAND gate
a long run time and a large sample size are needed to meet is 0, i.e., fault free. Let , and be the failure time
an accuracy requirement, because of the slow convergence of basic events , and respectively; the failure time of the
typically encountered in an MC simulation. Generally, it is PAND gate’s output, , is given by
challenging to efficiently and accurately evaluate the reliability
of a DFT.
A stochastic approach has been proposed in [15] for an effi-
cient evaluation of a system’s reliability. In particular, the se- (1)
rial and parallel implementations of stochastic computation are
considered, and a speedup in analysis is obtained by a parallel
implementation in field programmable gate arrays (FPGAs). In III. HYPOTHESES
[15], the PAND gate is modeled as a three-input AND gate, and Some hypotheses of the paper are as follows.
a sequential event is considered to be a basic event. In a general • The quantization level of a basic event is denoted by a
case, however, the input of a PAND gate is not limited to a basic binary variable , with 0 indicating no fault.
event. • All basic events are fault-free at the beginning of the mis-
In this paper, a new stochastic approach is presented for an sion time.
efficient analysis of fault trees with PAND gates. Initially, a sto- • The basic events are non-repairable [5]. This means that,
chastic computational model is proposed for the PAND gate; in if a basic event fails, the variable that indicates the status
this model, the output failure probability is obtained as a func- of the basic event takes 1. Let be the failure time
tion of the basic inputs’ failure probabilities. Thus, this model of a basic event ; the status variable of is 1 for time
482 IEEE TRANSACTIONS ON RELIABILITY, VOL. 63, NO. 2, JUNE 2014
Fig. 1. (a) Symbols for a two-input PAND gate [5], [26], (b) the expected behaviour of the two-input PAND gate for an inclusive condition (adapted from [11])
where 1, and 0 indicate faulty, and fault-free events respectively.
Fig. 3. A stochastic encoding using a sequence of 10 bits for a probability of

0.6.
Fig. 2. A timing diagram for a non-repairable basic event [5], [11], where a
value of 0 indicates no fault, while 1 means the event has failed and is
the failure time of the basic event .
IV. STOCHASTIC PAND MODEL
, and 0 otherwise. A generic timing diagram for A. Stochastic Computational Models

a non-repairable basic event is shown in Fig. 2.
Stochastic computation was introduced in the 1960s for reli-
• The probability density function (pdf), and cumulative den-
able circuit design [27]. In stochastic computation, real numbers
sity function (cdf) of an exponential distribution are given
or probabilities are encoded into the statistics of random binary
respectively by
bit streams [27]. In the random streams, a proportional number
(2) of bits are set to a specific value (usually 1) to indicate a proba-
and bility. Fig. 3 illustrates the stochastic encoding using a sequence
of 10 bits, although a longer sequence length is usually required
(3) in a practical application.
Fig. 4 shows several stochastic logic operations. In Fig. 4, the
where is the specified mission time, and is the (constant) input probabilities are coded in the form of binary sequences.
failure rate of a basic event for an exponential distribution. Each bit in a sequence is processed by a logic gate, and the
• The failure probability of a basic event in a selected time result of a probabilistic analysis is encoded in the output se-
interval is considered constant at the value quence. By this method, the dependencies among signals are in-
in the beginning of the time interval, i.e., the failure prob- herently maintained and propagated through a logic network; an
ability is given by for any time in this time example is shown in Fig. 4(c). Stochastic computation has sev-
interval. For simplicity, the time interval is eral advantages such as computational simplicity, high speed,
referred to as time in this paper. and fault-tolerance [24], [25].
Fig. 4. Stochastic logic in four examples: (a) an inverter with a random binary bit sequence as the input, (b) an AND gate with statistically independent inputs,
(c) an AND gate with totally correlated inputs, and (d) an OR gate with statistically independent inputs.
Due to inevitable stochastic fluctuations in stochastic com- Then, the failure time of the PAND gate’s output is given by
putation, the number of 1s in the output sequence is not de- , due to the model
terministic but probabilistic. Hence, the output of a stochastic considered in Case 2 in Fig. 1(b).
analysis follows approximately a Gaussian distribution when If , and , the basic event fails at
long random binary bit streams are used [25]. However, the use time while fails before time . The failure time of the basic
of non-Bernoulli sequences for initial input probabilities leads event is then
to a faster convergence of the result, as stated in the following
lemma. (5)
Lemma 1 (Theorem 1 in [25]): Compared to the case when The relationship between the failure times of the basic events
Bernoulli sequences are used to represent initial input probabil- and is given by
ities, the use of large non-Bernoulli sequences as random per-
mutations of fixed numbers of 1s and 0s results in an output (6)
sequence with the same mean number of 1s, and a smaller vari-
ance for an AND gate with statistically independent inputs. Thus, , due to (1), and the model consid-
Lemma 1 leads to the conclusion that, to meet a specific accu- ered in Case 1 in Fig. 1(b).
racy requirement, a smaller sequence length is required by using For the other possible scenario, i.e., the basic event fails
the non-Bernoulli sequences compared to the use of Bernoulli after , the top event of the PAND gate would not fail, i.e., with
sequences for encoding initial input probabilities of an AND a failure time of infinity, due to the model considered in Case 3
gate [25]. in Fig. 1(b).
It is trivial to show that Lemma 1 is also applicable to an in- Because the basic events are non-repairable, the state of the
verter, thus any logic network (as combinations of inverters and two-input PAND gate’s output event is affected by the gate’s
AND gates) can be more efficiently and accurately evaluated output at the previous time, hence the output of the PAND gate
by using the non-Bernoulli sequences as initial input probabili- at time , is determined by three factors:
ties. When the inputs of a gate are correlated, the output is also 1) the current states of the input basic events and at time
determined by the correlation between the input signals. How- and ;
ever, signal correlation (usually caused by the re-convergence 2) the inverted state of basic event at time
of signals) is handled efficiently in stochastic computation. This ; and
feature is a particularly favorable property for handling the re- 3) the output of the PAND gate at time .
peated input events in a complex DFT. Hence, the output of the PAND gate at time is given by
(7)
B. A Two-Input PAND Gate Model A stochastic logic model can be constructed to determine the
failure of the two-input PAND gate, as shown in Fig. 5.
Let , and respectively indicate the states of basic Following the hypothesis in Section III, all basic events are
events , and at time , and , and for the states at fault free at the beginning of the mission time; thus, the input
time . If both and fail at time , i.e., , and signals of the model in Fig. 5 are zeros. In Fig. 5(a), if the basic
, then the failure time of the basic events and is event fails before time if , and .
given by Then, at time . However, if , which
indicates that fails before or both events fail simultaneously
(4) at time , then , and . Because
number of 1s in these sequences for the two probabilities are

given by
(8)
The difference of the number of 1s is then
(9)
Further assume that the non-Bernoulli sequence for the prob-

ability in is given by . Then the sequence
for the probability in can be obtained by ran-
domly assigning 1s to replace the 0s in . Be-
cause the 1s in are a subset of those in , we
obtain
(10)
Fig. 5. (a) A stochastic logic model for a two-input PAND gate, and (b) the
decomposition of the three-input AND gate in (a) into two-input AND gates. 3) Stochastic Model Validation:
Theorem 1: Compared to an accurate analysis, a stochastic
simulation of the two-input PAND gate model in Fig. 5, using
and cannot be 1 at the same time, either or large non-Bernoulli sequences of random permutations of fixed
results in . Otherwise, the state of the top numbers of 1s and 0s as initial input probabilities, produces the
event remains zero. From this analysis, it can be seen that the same increment in the failure probabilities of two adjacent time
stochastic PAND model in Fig. 5 computes (7), thus it accurately intervals when .
implements the function of the PAND gate. Proof: Assume that the failure probabilities of the PAND
gate at time and are given by , and
, respectively; we show that the failure proba-
bility of in the stochastic model in Fig. 5 is the same as the in-
C. Model Validation crement in the output failure probability of the PAND gate from
time to , i.e., .
To validate the proposed stochastic PAND model, the dis- Given the basic events , and with the probability density
cretization of a continuous probability distribution and the gen- functions (pdfs) , and respectively, the failure prob-
eration of stochastic non-Bernoulli sequences are introduced ability for the two-input PAND gate’s output (when both
next, followed by a theoretical proof. and fail or fails before , i.e., ), is given by
1) Discretization: Assume that the mission time is divided
into equal time intervals, i.e., . Due to the nature (11)
of discretization, a failure probability is estimated more pre-
cisely at time with a larger . However, a longer run time
is required as more stochastic sequences need to be generated. For an exponential distribution, (11) becomes
Hence, is determined by a tradeoff between accuracy and
efficiency. With a reasonable , the discretization provides a (12)
relatively accurate estimation of the continuous failure proba-
bility of a basic event. which leads to the failure probability of the sequential event
2) Generation of Non-Bernoulli Sequences: Assume that the as (13). Equation (13) can be obtained by using an
failure probabilities for the two adjacent time intervals, analytical approach [7] or an algebraic analysis [11].
, and , are given by , and By discretization, the failure probabilities of the sequential
respectively. If we use non-Bernoulli sequences of bits, as a event at time , and are given by (14) and (15),
random permutation of a fixed number of 0s and 1s, then the respectively. Equation (14) can also be written as (16).
(13)
Because for any is negligible when Let be the stochastic sequence generated for the prob-
, applying a Taylor series expansion on (16) leads to ability of the basic event at time , and be
(17). the stochastic sequences for the basic event at time , and
From (15) and (17), the probability increment for two adja- respectively. In the model of Fig. 5, the inverter’s output
cent times is obtained as (18). sequence, , is given by
Next, the stochastic analysis of the increased probability
between two adjacent times is pursued. Let , and (24)
indicate the cumulative density functions (cdfs) for the con-
tinuous, and discretized distributions respectively. By applying For the three-input AND gate in Fig. 5(a), its output sequence
the discretization process to the exponential distributions of the is obtained as (25).
basic events (i.e., and ), we have Similar to (10), the probability encoded in the sequence
is given by ,
(19) i.e., the probability increment for the basic event in two ad-
jacent times.
By (22) and (23), this probability increment is thus
(20)
where is the number of equally discretized time intervals . (26)

Hence, the input probabilities of , and at time and
are given by Considering as the probability encoded in the se-
quence , together with (21) and (26), the probability
(21) increment in is given by (27).
(22) The application of a Taylor series expansion on (27) leads
to a first-order approximation given by (18). This result shows
(23) that the proposed stochastic model accurately implements the
(14)
(15)
(16)
(17)
(18)
(25)
function of a two-input PAND gate for exponentially distributed When , we have

events, i.e.,
(28)
(32)
Next, the proof of the theorem is pursued in the general case
when the basic events are non-exponentially distributed. By an
integral analysis, the failure probability of the two input PAND (33)
gate at time is given by (29).
Similarly, this failure probability at time is given by In this case, (31) becomes (34).
(30). When , see (35).
The increment of the failure probabilities between and Because , we obtain (36). The right
is then (31). hand side of (36) is the failure probability increment computed
(27)
(29)
(30)
(31)
(34)
by the stochastic model of PAND in Fig. 5. This result proves

Theorem 1 in the general case.
4) Analysis of the Increment in Failure Probability: If
, and are the non-Bernoulli sequences for the
failure probabilities of the basic event , and ,
at time , and respectively, the mean number of elements
equal to 1 in the non-Bernoulli sequence of bits is
then , and the variance is 0 (by the nature of the
non-Bernoulli sequence). This result indicates that the use of
non-Bernoulli sequences results in a deterministic initial value.
Because there is no variation in the input signal of the inverter,
the variance in the inverter’s output sequence is 0 as
. Hence, the mean and variance of
the number of 1s in the sequence are given by
(37)
(38)
respectively. In Fig. 5(b), the first AND gate’s output sequence Fig. 6. (a) A three-input PAND gate, and (b) a successive cascading model of
is given by , the three-input PAND gate in (a).
where is statistically dependent on , as discussed
previously. The mean, and variance of the number of elements
equal to 1 in the first AND gate’s output sequence are then given Fig. 5(b), and thus at the output of the three-input AND gate in
by Fig. 5(a), than using Bernoulli sequences.
If the expected probability of is given by ,
(39) where indicates the number of 1s in the sequence ,
through a combinatorial analysis and the application of Stir-
ling’s formula [28], [29], the number of elements equal to 1 in
(40)
the output stochastic sequence of bits follows approx-
respectively. (39) and (40) indicate that is also a non- imately a Gaussian distribution, i.e.,
Bernoulli sequence.
For the basic event , a non-Bernoulli sequence at time (41)
, is generated for the failure probability . For the
last AND gate in Fig. 5(b), the input sequences and with (42) and (43), as well as with a mean, and variance
are for two statistically independent signals. Per Lemma given by and
1, therefore, the use of non-Bernoulli sequences produces a
more accurate result at the output of the last AND gate in respectively.
(35)
(36)
(42)
(43)
Fig. 7. The failure probabilities obtained by using the stochastic, Monte Carlo (MC) [23], and analytical [7] methods for the two-input PAND gate in Fig. 1(a).
D. Generalization of the PAND Model ities of basic events are assumed to be exponentially distributed,
A multiple-input PAND gate can be converted to a succes- with . The mission time is 300 hours,
sively cascaded model of two-input PAND gates. Take a three- and the time interval for discretization is one hour, i.e.,
input PAND as an example, as shown in Fig. 6(a); its cascaded hour.
model is shown in Fig. 6(b). Assume that the failure order of the A quantitative analysis of the two-input PAND gate is first
three inputs is from left to right, i.e., . Then, if the performed using the stochastic PAND model. The results are
failures of the input events occur in this order, the output is compared with those obtained by using the Monte Carlo (MC)
1; otherwise, is 0. [23] and analytical [7] methods, as shown in Fig. 7. In Fig. 7
In the cascaded model in Fig. 6(b), a 1 at the gate output (and all subsequent figures and tables, wherever applicable),
indicates that the intermediate event fails before the basic is the number of simulation runs for the MC method, and is
event , or both and fail at the same time. Because the sequence length for the stochastic approach. It can be seen
is caused by the fact that the basic event fails before , or that the stochastic approach produces very accurate results com-
both and fail at the same time, the gate output pared to the MC and accurate analysis methods.
means that the sequential event occurs; thus the Because a continuous failure distribution is discretized into
cascaded model implements the function of a three-input PAND time intervals, the stochastic analysis results in a vector of
gate. This model can be generalized for an arbitrary multiple- the failure probability of the top event at every time interval,
input PAND gate. . Let , and denote
In summary, for a DFT with priority relationships, the sto- the failure probability vectors obtained by the stochastic ap-
chastic two-input PAND model and the successive cascading proach, an accurate analysis [7], and the MC method [23]. While
model can be utilized in an FTA using the non-Bernoulli an accurate result can be efficiently obtained by using an SBDD
sequences generated for discretized probabilities of the basic method [19], [20] or an algebraic analysis [11], a direct integral
events. The failure probability of the top event is encoded in the method is used in this work for an accurate analysis. Albeit very
statistics, i.e., the proportion of the number of 1s in the output fast for a simple DFT analysis, such accurate analysis may be-
sequence of the stochastic analysis. come cumbersome in the evaluation of large DFTs. Further, let
indicate the discrepancies between the failure prob-
V. CASE STUDIES AND VALIDATION RESULTS ability vectors obtained from the MC method in [23] and the
In this section, several case studies are presented to show accurate analysis in [7], and let indicate the discrepan-
the accuracy, efficiency, and the ability of dealing with repeated cies between the failure probability vectors obtained from the
basic events of the stochastic PAND model. Simulations are per- stochastic approach and the accurate analysis in [7]. The three
formed for both exponential and non-exponential distributions norms, , and , are then used to measure the
of basic events. The results are compared with those obtained by differences of the failure probability vectors. For a vector , the
using accurate analysis and simulation-based approaches. Sim- norms are defined as ,
ulations are run on a computer with a 3.10 GHz i3-2100 micro- and .
processor and 6 GB memory. The results are shown in Table I for the two-input PAND gate
with various sequence lengths for the stochastic approach. The
A. Validation of the Stochastic PAND Models average run time is also shown for comparing the efficiency. Un-
Example 1: For a two-input PAND gate and a three-input less otherwise noted, ten experiments are run in each case study
PAND gate, as shown in Figs. 1(a) and 6(a), the failure probabil- for obtaining the norm values and average run time. As shown
TABLE I
ACCURACY AND RUN TIME OF THE STOCHASTIC APPROACH AND MONTE CARLO (MC) SIMULATION [23],
COMPARED TO AN ACCURATE ANALYSIS [7], FOR THE TWO-INPUT PAND GATE IN FIG. 1(A)
Fig. 8. The differences in the failure probability obtained by using the stochastic approach and an accurate analysis at different mission times for the two-input
PAND gate.
Fig. 9. The differences in the failure probability obtained by using the stochastic approach and an accurate analysis at different mission times for the three-input
PAND gate.
in Table I, the smaller norm values and shorter run time indicate in Fig. 8, the stochastic approach can produce very accurate
that the stochastic analysis using the non-Bernoulli sequences results as compared to the accurate analysis [7] by using a large
is more accurate and more efficient than the MC method. sequence length (e.g. 100k bits) for the two-input PAND gate.
The accuracy of the stochastic approach can further be Next, the failure probability of a three-input PAND gate is
improved by using longer stochastic sequences. As shown evaluated by using the successive cascading PAND model and
TABLE II
ACCURACY AND RUN TIME OF THE STOCHASTIC APPROACH AND MONTE CARLO (MC) SIMULATION [23],
COMPARED TO AN ACCURATE ANALYSIS [7], FOR THE THREE-INPUT PAND GATE IN EXAMPLE 1
the stochastic approach. Simulations are run for different se-

quence lengths, and the obtained failure probability vectors are
compared with those obtained by an accurate analysis. As re-
vealed in Table II, the norms of the differences of the computed
failure probability vectors indicate that a stochastic analysis of
the PAND model is more accurate and more efficient than an
MC method. As shown in Fig. 9, moreover, the accuracy of the
stochastic approach can further be improved by using longer
stochastic sequences.
B. A DFT With Repeated Events

A DFT with PAND gates and repeated events is analyzed next
using the stochastic approach.
Example 2 (From [2]): A DFT consists of 5 logic gates (4 OR
gates, 1 AND gate) and 2 dynamic gates (PANDs) with 9 basic
events, as shown in Fig. 10. The failure rates of the basic events
are exponentially distributed with for .
The basic events and are repeated events. The maximum
mission time is 300 hours.
The simulation results by the stochastic approach with dif-
ferent sequence lengths and the MC method [23] with different
numbers of simulations are shown in Table III for several mis-
sion times. As can be seen, the stochastic approach computes
the failure probability of the top event with a better efficiency
than the MC method. This indicates that the stochastic ap-
proach using the non-Bernoulli sequences as initial inputs can
efficiently evaluate the reliability of a dynamic system with
repeated events. The accuracy improves with the increase of
the length of the stochastic sequences.
Fig. 10. Example 2: a DFT with repeated events and [2].
C. A DFT With Events of Non-Exponential Distributions
The presence of a large number of basic events makes it very
difficult to derive the top event’s failure probability using an ac- significantly with the number of basic events. It is also diffi-
curate analysis approach, because a large number of states need cult to evaluate a PAND gate with intermediate events as in-
to be considered, and the complexity of an analysis increases puts. The problem becomes even more challenging when the
TABLE III
THE TOP EVENT’S FAILURE PROBABILITY OF THE DFT IN FIG. 10, WITH THE TOTAL MISSION TIME OF 300 HOURS
TABLE IV
THE FAILURE RATES OF THE BASIC EVENTS IN EXAMPLE 3 [23]
The failure probability of the top event can be obtained by the

algebraic analysis in [11] as
(44)
where
(45)
(46)
In a practical system, a non-exponential distribution may be

required for a more accurate modeling of a basic event’s failure.
Although an approximate result can be obtained by using an al-
gebraic analysis, it becomes cumbersome for an algebraic anal-
ysis to accurately evaluate such systems due to the complexity
involved in deriving a closed form of analytical expressions.
Fig. 11. Example 3, a DFT with intermediate events as the inputs of a PAND In this section, the Weibull distribution is considered to show
gate [23]. that a DFT with non-exponentially distributed basic events can
be handled by the stochastic approach. The probability den-
sity function (pdf) and cumulative density function (cdf) of the
basic events’ failures are not exponentially distributed. In this Weibull distribution is given by
section, it is shown that these issues are effectively addressed
by the stochastic approach, as illustrated by Example 3. (47)
Example 3 (From [23]): A DFT consists of a relatively large
number of basic events, while the inputs of a PAND gate are two and
intermediate events, as shown in Fig. 11. (48)
Fig. 12. The failure probability of the top event with non-exponentially distributed basic events.
respectively, where , and are the shape, and scale parameters

of the Weibull distribution respectively.
Assume that, in the DFT in Fig. 11, the basic events J, K, L
follow a Weibull distribution with and , while
the other basic events are exponentially distributed with failure
rates given in Table IV [23].
For this system, the failure probability of the top event is
plotted for a mission time of 300 hours, as shown in Fig. 12,
for both the stochastic approach and the MC method [23]. The
norms of the differences of the failure probability vectors ob-
tained by the stochastic and Monte Carlo (MC) methods are
, and .
Because the encoding of a failure probability into a stochastic
sequence is not limited to those of exponential distributions,
a DFT with non-exponentially distributed basic events can be
accurately evaluated by the stochastic approach, as shown in
Fig. 12. Hence, the proposed stochastic approach is applicable Fig. 13. Example 4, a fault tree with repeated events and non-exponentially
to both exponential and non-exponential distributions in a DFT distributed ones.
analysis.
D. A Fault Tree With Repeated Events and Non-Exponential As revealed in Fig. 14 and Table V, a stochastic analysis of
Distributed Ones the fault tree is more accurate and more efficient than an MC
Finally, a fault tree without dynamic gates, but with repeated method compared with the accurate analysis [7], as shown by
events and non-exponentially distributed ones, is considered. the run time and norms of the differences in the failure proba-
This fault tree is developed from the DFT in Fig. 11 by replacing bility vectors. Hence, a DFT with non-exponentially distributed
the PAND gate with an AND gate and inserting a repeated event basic events and repeated events can be efficiently evaluated by
, as Example 4 shown in Fig. 13. The failure rates of the ex- the stochastic approach.
ponentially-distributed basic events are assumed to be the same
as those in Example 3, while the non-exponentially distributed VI. CONCLUSION
events J, K, L follow a Weibull distribution with and A stochastic model is proposed for the analysis of a two-input
. PAND gate in a dynamic fault tree (DFT). This model is then
For this fault tree, the failure probability of the top event is used in a successive cascading structure for the analysis of a
plotted for a mission time of 300 hours, as shown in Fig. 14, for general multiple-input PAND gate. For a DFT with PAND gates,
both the stochastic approach and the MC method [23]. A more a stochastic approach using the proposed models provides an ef-
detailed comparison is given in Table V. ficient analysis of the DFT compared to an accurate or algebraic
Fig. 14. The failure probability of the top event with non-exponentially distributed basic events.
TABLE V
ACCURACY COMPARISON AND RUN TIME OF THE STOCHASTIC APPROACH AND MONTE CARLO (MC) SIMULATION [23] FOR THE DFT IN EXAMPLE 4
approach. The use of non-Bernoulli sequences of random per- REFERENCES

mutations of fixed numbers of 1s and 0s as initial input event
probabilities makes the stochastic approach more efficient and [1] C. A. Ericson, II, “Fault tree analysis—A history,” in Proc. 17th Int. l
Syst. Safety Conf., Aug. 16–21, 1999.
accurate than Monte Carlo simulation. The stochastic approach [2] T. Yuge and S. Yanagi, “Quantitative analysis of a fault tree with
has the following features. priority AND gates,” Rel. Eng. Syst. Safety, vol. 93, no. 11, pp.
• The failure probability of a basic event is not limited to 1577–1583, 2008.
[3] N. G. Leveson, Safeware: Syst. Safety Computers. : Addison-Wesley,
an exponential distribution; any failure distribution can be 1995.
analyzed by an appropriate sampling and coding into the [4] H. Boudali, P. Crouzen, and M. Stoelinga, “A rigorous, compositional,
stochastic non-Bernoulli sequences. and extensible framework for dynamic fault tree analysis,” IEEE Trans.
Dependable Secure Comput., vol. 7, no. 2, pp. 128–143, 2010.
• Repeated events are correctly and readily handled in a DFT [5] M. Stamatelatos and W. Vesely, NASA Office of Safety and Mission As-
analysis, because signal correlation is maintained in the surance. : Fault Tree Handbook With Aerospace Applications, 2002,
random binary bit streams and the propagation of the sto- vol. 1.1, pp. 1–205.
[6] E. J. Henley and H. Kumamoto, Reliability Engineering and Risk Ass-
chastic sequences in a fault tree analysis. esment. Englewood Cliffs, NJ, USA: Prentice Hall, 1981.
• The stochastic approach avoids the state-space explosion [7] S. Amari, G. Dill, and E. Howald, “A new approach to solve dynamic
problem or the large computational complexity typically fault trees,” in Proc. Int. Annu. IEEE Rel. Maintainability Symp., 2003,
pp. 374–379.
encountered in a Markov or analytical method, thus it is [8] M. A. Boyd, “Dynamic Fault Tree Models: Techniques for Analyses of
scalable for use in a general DFT analysis. Advanced Fault Tolerant Computer Systems,” Ph.D. dissertation, Dept.
Ongoing work includes the stochastic modeling of other types Comput. Sci., Duke Univ., Durham , NC, USA, 1991.
[9] J. B. Dugan, S. J. Bavuso, and M. A. Boyd, “Dynamic fault-tree models
of gates in a DFT, and the incorporation of repair schemes and for fault-tolerant computer systems,” IEEE Trans. Rel., vol. 41, no. 3,
common cause failures into an FTA. pp. 363–377, Sep. 1992.
[10] J. B. Dugan, K. J. Sullivan, and D. Coppit, “Developing a low-cost Peican Zhu received the B.S. degree in 2008, and the M.Sc. degree in 2011,
high-quality software tool for dynamic fault-tree analysis,” IEEE both from the Northwestern Polytechnical University (NWPU), Xi’an, Shaanxi,
Trans. Rel., vol. 49, no. 1, pp. 49–59, Mar. 2000. China. He is currently working towards the Ph.D. degree in the Department of
[11] G. Merle, J.-M. Roussel, J.-J. Lesage, and A. Bobbio, “Probabilistic al- Electrical and Computer Engineering, University of Alberta, Edmonton, AB,
gebraic analysis of fault trees with priority dynamic gates and repeated Canada. His current research interests include stochastic computational models
events,” IEEE Trans. Rel. , vol. 59, no. 1, pp. 250–261, Mar. 2010. for system reliability analysis, gene network models, and pathway analysis.
[12] H. Boudali, P. Crouzen, and M. Stoelinga, “Dynamic fault tree analysis
through input/output interactive Markov chains,” in Proc. Int. Conf.
Dependable Syst. Netw. (DSN 2007), 2007, pp. 25–38.
[13] A. Ejlali and S. Miremadi, “FPGA-based Monte Carlo simulation for Jie Han received the B.Sc. degree in electronic engineering from Tsinghua Uni-
fault tree analysis,” Microelectron. Rel., vol. 44, no. 6, pp. 1017–1028, versity, Beijing, China, in 1999, and the Ph.D. degree from Delft University of
2004. Technology, The Netherlands, in 2004. He is currently an assistant professor in
[14] G. Merle, J.-M. Roussel, and J.-J. Lesage, “Improving the efficiency the Department of Electrical and Computer Engineering at the University of Al-
of dynamic fault tree analysis by considering gates FDEP as static,” berta, Edmonton, AB, Canada. His research interests include reliability and fault
in Proc. Eur. Safety Rel. Conf. 2010 (ESREL2010), Rhodes, Greece, tolerance, nanoelectronic circuits and systems, and novel computational models
2010, pp. 845–851. for nanoscale and biological applications. Dr. Han was nominated for the 2006
[15] H. Aliee and H. R. Zarandi, “A fast and accurate fault tree analysis Christiaan Huygens Prize of Science by the Royal Dutch Academy of Science
based on stochastic logic implemented on field-programmable gate ar- (Koninklijke Nederlandse Akademie van Wetenschappen (KNAW) Christiaan
rays,” IEEE Trans. Rel., vol. 62, no. 1, pp. 13–22, Mar. 2013. Huygens Wetenschapsprijs). His work was recognized by the 125th anniversary
[16] L. Xing and G. Levitin, “Combinatorial algorithm for reliability anal- issue of Science, for developing theory of fault-tolerant nanocircuits. Dr. Han
ysis of multistate systems with propagated failures and failure isolation served as a General Chair and Technical Program Chair in IEEE International
effect,” IEEE Transa. Syst., Man, Cybern., Part A: Syst. Humans 2011, Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Sys-
vol. 41, no. 6, pp. 1156–1165. tems (DFTS) 2013 and 2012, respectively, and as a Technical Program Com-
[17] L. Xing, “An efficient binary-decision-diagram-based approach for mittee Member in several other international symposia and conferences.
network reliability and sensitivity analysis,” IEEE Trans. Syst., Man,
Cybern., vol. 38, no. 1, pp. 105–115, Jan. 2007.
[18] H. Boudali and J. B. Dugan, “A discrete-time Bayesian network reli-
ability modeling and analysis framework,” Rel. Eng. Syst. Safety, vol.
87, no. 3, pp. 337–349, 2005. Leibo Liu received the B.S. degree in electronic engineering from Tsinghua
[19] O. Tannous, L. Xing, and J. B. Dugan, “Reliability analysis of warm University, Beijing, China, in 1999 and the Ph.D. degree in Institute of Micro-
standby systems using sequential BDD,” in Proc. 57th Annu. Rel. electronics, Tsinghua University, in 2004. He currently serves as an Associate
Maintainability Symp., FL, USA, 2011. Professor in Institute of Microelectronics, Tsinghua University. His research in-
[20] L. Xing, O. Tannous, and J. B. Dugan, “Reliability analysis of non- terests include Reconfigurable Computing, Mobile Computing and VLSI DSP.
repairable cold-standby systems using sequential binary decision dia- Dr. Liu has published more than 70 refereed papers, and served as TPC member
grams,” IEEE Trans. Syst., Man Cybern.—Part A: Syst. Humans, vol. or reviewers for several international key conferences and leading journals.
42, no. 3, May 2012.
[21] A. Rauzy, “Sequence algebra, sequence decision diagrams and dy-
namic fault trees,” Rel. Eng. Syst. Safety, vol. 96, no. 7, p. 8, 2011.
[22] L. Xing, A. Shrestha, and Y. Dai, “Exact combinatorial reliability anal- Ming J. Zuo received the Bachelor of Science degree in agricultural engi-
ysis of dynamic systems with sequence-dependent failures,” Rel. Eng. neering in 1982 from Shandong Institute of Technology, China; and the Master
Syst. Safety, vol. 96, no. 10, pp. 1375–1385, 2011. of Science degree in 1986, and the Ph.D. degree in 1989, both in industrial en-
[23] R. K. Durga, V. Gopika, and R. V. Sanyasi, “Dynamic fault tree anal- gineering from Iowa State University, Ames, Iowa, U.S.A. He is currently Pro-
ysis using Monte Carlo simulation in probabilistic safety assessment,” fessor in the Department of Mechanical Engineering at the University of Al-
Rel. Eng. Syst. Safety, vol. 94, no. 4, pp. 872–883, 2009. berta, Canada. His research interests include system reliability analysis, main-
[24] H. Chen and J. Han, “Stochastic computational models for accurate re- tenance modeling and optimization, signal processing, and fault diagnosis. He
liability evaluation of logic circuits,” in Proc. Great Lakes Symp. VLSI is an Associate Editor of IEEE Transactions on Reliability, Department Ed-
(GLVLSI), Providence, RI, USA, 2010, pp. 61–66. itor of IIE Transactions (2005–2008, 2011–present), Regional Editor for North
[25] J. Han, H. Chen, J. Liang, P. Zhu, Z. Yang, and F. Lombardi, “A sto- and South American region for International Journal of Strategic Engineering
chastic computational approach for accurate and efficient reliability Asset Management, and Editorial Board Member of Reliability Engineering
evaluation,” IEEE Trans. Comput., 2013, in press. Advance access in and System Safety, Journal of Traffic and Transportation Engineering, Inter-
IEEE xplore. national Journal of Quality, Reliability and Safety Engineering, and Interna-
[26] J. B. Dugan, S. J. Bavuso, and M. A. Boyd, “Fault trees and sequence tional Journal of Performability Engineering. He is Fellow of the Institute of
dependencies,” in Proc. Rel. Maintainable Symp., 1990, pp. 286–293. Industrial Engineers (IIE), Fellow of the Engineering Institute of Canada (EIC),
[27] B. R. Gaines, “Stochastic computing systems,” Adv. Inf. Syst. Sc., vol. Founding Fellow of the International Society of Engineering Asset Management
2, pp. 37–172, 1969. (ISEAM), and Senior Member of IEEE.
[28] J. von Neumann, “Probabilistic logics and the synthesis of reliable
organisms from unreliable components,” in Automata Studies, C. E.
Shannon and J. McCarthy, Eds. Princeton, NJ, USA: Princeton Univ.
Press, 1956, pp. 43–98.
[29] J. Han, Fault-Tolerant Architectures for Nanoelectronic and Quantum
Devices. Veenendaal, The Netherlands: Universal Press, 2004, A
Ph.D. dissertation of the Delft University of Technology, 1-135. ISBN:
90-9018888-6.

A Stochastic Approach For The Analysis of Fault

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

A Stochastic Approach For The Analysis of Fault

Uploaded by

Copyright:

Available Formats

480 IEEE TRANSACTIONS ON RELIABILITY, VOL. 63, NO.

A Stochastic Approach for the Analysis of Fault

Fig. 3. A stochastic encoding using a sequence of 10 bits for a probability of

, and 0 otherwise. A generic timing diagram for A. Stochastic Computational Models

number of 1s in these sequences for the two probabilities are

The difference of the number of 1s is then

Further assume that the non-Bernoulli sequence for the prob-

where is the number of equally discretized time intervals . (26)

function of a two-input PAND gate for exponentially distributed When , we have

by the stochastic model of PAND in Fig. 5. This result proves

the stochastic approach. Simulations are run for different se-

B. A DFT With Repeated Events

The failure probability of the top event can be obtained by the

In a practical system, a non-exponential distribution may be

respectively, where , and are the shape, and scale parameters

approach. The use of non-Bernoulli sequences of random per- REFERENCES

You might also like