Failure Modes, Effects and Diagnostic Analysis: Project: Plant-STOP 9469
Failure Modes, Effects and Diagnostic Analysis: Project: Plant-STOP 9469
Project:
Plant-STOP 9469
Customer:
R. STAHL Schaltgeräte GmbH
Waldenburg
Germany
Jan Hettenbach
The document was prepared using best effort. The authors make no warranty of any kind and shall not be liable in
any event for incidental or consequential damages in connection with the application of the document.
© All rights on the format of this technical report reserved.
Management summary
This report summarizes the results of the hardware assessment carried out on the Universal
Module HART 9469 with hardware version as listed in the drawings referenced in section 2.5.1.
Table 1 gives an overview of the considered variants.
The hardware assessment consists of a Failure Modes, Effects and Diagnostics Analysis
(FMEDA). A FMEDA is one of the steps taken to achieve functional safety assessment of a
device per IEC 61508. From the FMEDA, failure rates are determined and consequently the
Safe Failure Fraction (SFF) can be calculated for a subsystem. For full assessment purposes all
requirements of IEC 61508 must be considered.
Table 1: Configuration overview
Configuration Description
2-wire Iout (channel 0-7) and 4-wire supply output (channel 4-7), only Plant
9469/35-08-12
Stop Function considered
Channel 0 1 2 3 4 5 6 7
Output (+24 V) - - - - 9 13 17 21
Iout 1 3 5 7 10 14 18 22
Earth (-) 2 4 6 8 12 16 20 24
Because of the functionality, in a mixed input output configuration, the inputs (Iin) are switched
off as well, but this is not part of the safety function!
For safety applications only the described variant of the Universal Module HART 9469 have
been considered. All other possible variants and configurations are not covered by this report.
The failure modes used in this analysis are from the exida Electrical Component Reliability
Handbook (see [N2]). The failure rates used in this analysis are the basic failure rates from the
Siemens standard SN 29500 (see [N3]).
The Universal Module HART 9469 can be considered to be Type A1 element with a hardware
fault tolerance of 0. Only the Plant Stop Function of the Universal Module HART 9469 was
considered.
The failure rates are valid for the useful life of the Universal Module HART 9469 (see
Appendix A) when operating as defined in the considered scenarios
The following table show how the above stated requirements are fulfilled for the considered
Universal Module HART 9469.
1
Type A element: “Non-complex” element (all failure modes are well defined); for details see 7.4.4.1.2 of
IEC 61508-2.
© exida.com GmbH Stahl_9469_1505-054_R036_V1R1.docx; February 11, 2019
Jan Hettenbach Page 2 of 23
Table 2: Failure rates of Universal Module HART 9469, 2- wire output per IEC 61508:2010
SIL AC 5 SIL2
2
DU includes 5% common cause failures between redundant components of disconnection circuit.
3
Failures of redundant components of the Plant- STOP function are part of Annunciation Undetected failures.
4
The complete subsystem will need to be evaluated to determine the overall Safe Failure Fraction. The number listed
is for reference only.
5
SIL AC (architectural constraints) means that the calculated values are within the range for hardware architectural
constraints for the corresponding SIL but does not imply all related IEC 61508 requirements are fulfilled. In addition it
must be shown that the device has a suitable systematic capability for the required SIL and that the entire safety
function can fulfill the required PFD values.
© exida.com GmbH Stahl_9469_1505-054_R036_V1R1.docx; February 11, 2019
Jan Hettenbach Page 3 of 23
Table 3: Failure rates of Universal Module HART 9469, 4- wire output per IEC 61508:2010
SIL AC 9 SIL2
6
DU includes 5% common cause failures between redundant components of disconnection circuit.
7
Failures of redundant components of the Plant- STOP function are part of Annunciation Undetected failures.
8
The complete subsystem will need to be evaluated to determine the overall Safe Failure Fraction. The number listed
is for reference only.
9
SIL AC (architectural constraints) means that the calculated values are within the range for hardware architectural
constraints for the corresponding SIL but does not imply all related IEC 61508 requirements are fulfilled. In addition it
must be shown that the device has a suitable systematic capability for the required SIL and that the entire safety
function can fulfill the required PFD values.
© exida.com GmbH Stahl_9469_1505-054_R036_V1R1.docx; February 11, 2019
Jan Hettenbach Page 4 of 23
Table of Contents
Management summary .................................................................................................... 2
1 Purpose and Scope ................................................................................................... 6
2 Project management .................................................................................................. 7
2.1 exida ...............................................................................................................................7
2.2 Roles of the parties involved ...........................................................................................7
2.3 Standards / Literature used .............................................................................................8
2.4 exida tools used .............................................................................................................8
2.5 Reference documents .....................................................................................................9
2.5.1 Documentation provided by the customer ............................................................9
2.5.2 Documentation generated by the customer and exida ........................................9
3 Product Description.................................................................................................. 10
4 Failure Modes, Effects, and Diagnostic Analysis ..................................................... 11
4.1 Description of the failure categories ..............................................................................11
4.2 Methodology – FMEDA, Failure rates ...........................................................................12
4.2.1 FMEDA...............................................................................................................12
4.2.2 Failure rates .......................................................................................................12
4.2.3 Assumptions.......................................................................................................13
4.3 Results ..........................................................................................................................14
4.3.1 Failure rates of Universal Module HART 9469, 2- wire output per IEC
61508:2010 ........................................................................................................15
4.3.2 Failure rates of Universal Module HART 9469, 4- wire output per IEC
61508:2010 ........................................................................................................16
5 Using the FMEDA results......................................................................................... 17
5.1 Example PFDAVG / PFH calculation ...............................................................................18
6 Terms and Definitions .............................................................................................. 20
7 Status of the document ............................................................................................ 21
7.1 Liability ..........................................................................................................................21
7.2 Releases .......................................................................................................................21
7.3 Release Signatures.......................................................................................................21
Appendix A: Lifetime of Critical Components................................................................. 22
Appendix B: Proof tests to detect dangerous undetected faults .................................... 23
This document shall describe the results of the hardware assessment carried out on the
Universal Module HART 9469 with hardware version as listed in the drawings referenced in
section 2.5.1.
The FMEDA builds the basis for an evaluation whether an element including the described
Universal Module HART 9469 meets the average Probability of Failure on Demand (PFDAVG) /
Probability of dangerous Failure per Hour (PFH) requirements and if applicable the architectural
constraints / minimum hardware fault tolerance requirements per IEC 61508 / IEC 61511. It
does not consider any calculations necessary for proving intrinsic safety.
2.1 exida
exida is one of the world’s leading accredited Certification Bodies and knowledge companies
specializing in automation system safety, availability, and cybersecurity with over 500 person
years of cumulative experience in functional safety, alarm management, and cybersecurity.
Founded by several of the world’s top reliability and safety experts from manufacturers,
operators and assessment organizations, exida is a global corporation with offices around the
world. exida offers training, coaching, project oriented consulting services, safety engineering
tools, detailed product assurance and ANSI accredited functional safety and cybersecurity
certification. exida maintains a comprehensive failure rate and failure mode database on
electronic and mechanical equipment and a comprehensive database on solutions to meet
safety standards such as IEC 61508.
[D1] 9400 0 000 050 0_00.docx Product and functional safety description, Index
0 of 09.01.2019
[D2] 9469 6 020 010 0_03.pdf Circuit diagram, Index 3 of 05.06.2018
[D3] 946960310010_00_de_en.pdf Datasheet V0 of 19.02.2018
The list above only means that the referenced documents were provided as basis for the
FMEDA but it does not mean that exida checked the correctness and completeness of these
documents.
[R1] FMEDA_Stahl-9469-V1R1.efmx
The Universal Module HART 9469 is universal supply unit for 2-wire or 4-wire signal
transformer. It can also be used as supply for output for final elements. It is intended to be used
in explosion hazardous areas. Safe state of the Universal Module HART 9469 is to switch-off
the supply voltage of the output signals, which is leading to signal loss in case of use as signal
transformer or power down in case of use as supply for final elements.
The usecase of the available outputs can be configured by software. The configuration and the
controller part of the Universal Module HART 9469 is not part of the safety function, so only the
power- down part was considered and thereby, it can be considered to be Type A element with
a hardware fault tolerance of 0. Decoupling components are also part of the safety function.
The safety function of the Universal Module HART 9469 is to switch-off the supply voltage of the
output signals by using the Plant-STOP function. This function is only available, if the analog or
digital output of the Universal Module HART 9469 is used in an application.
Power Supply
Emergency stop
button Supply
output
Signal
conditioning
Disconnection circuit
Decoupling Decoupling
Part of FMEDA
Microcontroller
The Failure Modes, Effects, and Diagnostic Analysis was done together with R. STAHL
Schaltgeräte GmbH and is documented in [R1].
A Failure Modes and Effects Analysis (FMEA) is a systematic way to identify and evaluate the
effects of different component failure modes, to determine what could eliminate or reduce the
chance of failure, and to document the system in consideration.
An FMEDA (Failure Mode Effect and Diagnostic Analysis) is an FMEA extension. It combines
standard FMEA techniques with extension to identify online diagnostics techniques and the
failure modes relevant to safety instrumented system design. It is a technique recommended to
generate failure rates for each important category (safe detected, safe undetected, dangerous
detected, dangerous undetected, fail high, fail low) in the safety models. The format for the
FMEDA is an extension of the standard FMEA format from MIL STD 1629A, Failure Modes and
Effects Analysis.
The failure modes used in this analysis are from the exida Electrical Component Reliability
Handbook (see [N2]). The failure rates used in this analysis are the basic failure rates from the
Siemens standard SN 29500 (see [N3]). The rates were chosen in a way that is appropriate for
safety integrity level verification calculations and the intended applications. It is expected that
the actual number of field failures due to random events will be less than the number predicted
by these failure rates.
For hardware assessment according to IEC 61508 only random equipment failures are of
interest. It is assumed that the equipment has been properly selected for the application and is
adequately commissioned such that early life failures (infant mortality) may be excluded from
the analysis.
Failures caused by external events should be considered as random failures. Examples of such
failures are loss of power or physical abuse.
The assumption is also made that the equipment is maintained per the requirements of IEC
61508 or IEC 61511 and therefore a preventative maintenance program is in place to replace
equipment before the end of its “useful life”.
The user of these numbers is responsible for determining their applicability to any particular
environment. Some industrial plant sites have high levels of stress. Under those conditions the
failure rate data is adjusted to a higher value to account for the specific conditions of the plant.
Accurate plant specific data may be used for this purpose. If a user has data collected from a
good proof test reporting system such as exida SILStatTM that indicates higher failure rates, the
higher numbers shall be used.
The following assumptions have been made during the Failure Modes, Effects, and Diagnostic
Analysis of the Universal Module HART 9469.
Failure rates are constant, wear out mechanisms are not included.
Propagation of failures is not relevant.
Sufficient tests are performed prior to shipment to verify the absence of vendor and/or
manufacturing defects that prevent proper operation of specified functionality to product
specifications or cause operation different from the design analyzed.
Practical fault insertion tests can demonstrate the correctness of the failure effects assumed
during the FMEDA and the diagnostic coverage provided by the automatic diagnostics.
External power supply failure rates are not included.
The Mean Time To Restoration (MTTR) is considered to be 24 hours.
The Universal Module HART 9469 is installed per the manufacturer’s instructions.
The listed failure rates are valid for operating stress conditions typical of an industrial field
environment with temperature limits within the manufacturer’s rating and an average
temperature over a long period of time of 40°C. For higher average temperatures, the failure
rates should be multiplied with an experience based factor of e.g. 1.5 for 50°C, 2.5 for 60°C
and 5 for 80°C.
Only the described variants can be used for safety applications.
Only the shut-off path of the Universal Module HART 9469 was considered. The controller
part is decoupled from the signal path and thereby not part of the safety function.
Leakage currents ≤ 1.5 mA and 5.3 V for 4-wire output or ≤ 0.1 mA and 5.3 V for 2-wire
output are assumed to be uncritical and thereby safe failures.
According to IEC 61508 the architectural constraints of an element must be determined. This
can be done by following the 1H approach according to 7.4.4.2 of IEC 61508-2 or the 2H
approach according to 7.4.4.3 of IEC 61508-2.
The 1H approach involves calculating the Safe Failure Fraction for the entire element.
The 2H approach involves assessment of the reliability data for the entire element according to
7.4.4.3.3 of IEC 61508-2.
This assessment supports the 1H approach.
According to 3.6.15 of IEC 61508-4, the Safe Failure Fraction is the property of a safety related
element that is defined by the ratio of the average failure rates of safe plus dangerous detected
failures and safe plus dangerous failures. This ratio is represented by the following equation:
SFF = (ΣλS avg + ΣλDD avg) / (ΣλS avg + ΣλDD avg + ΣλDU avg)
When the failure rates are based on constant failure rates, as in this analysis, the equation can
be simplified to:
SFF = (ΣλS + ΣλDD) / (ΣλS + ΣλDD + ΣλDU)
Where:
λS = Fail Safe
λDD = Fail Dangerous Detected
λDU = Fail Dangerous Undetected
As the Universal Module HART 9469 is only one part of an element, the architectural constraints
should be determined for the entire sensor element.
The FMEDA carried out on the Universal Module HART 9469 under the assumptions described
in section 4.2.3 and the definitions given in section 4.1 and 4.3 leads to the following failure
rates:
SIL AC 13 SIL2
10
DU includes 5% common cause failures between redundant components of disconnection circuit.
11
Failures of redundant components of the Plant- STOP function are part of Annunciation Undetected failures.
12
The complete subsystem will need to be evaluated to determine the overall Safe Failure Fraction. The number
listed is for reference only.
13
SIL AC (architectural constraints) means that the calculated values are within the range for hardware architectural
constraints for the corresponding SIL but does not imply all related IEC 61508 requirements are fulfilled. In addition it
must be shown that the device has a suitable systematic capability for the required SIL and that the entire safety
function can fulfill the required PFD values.
© exida.com GmbH Stahl_9469_1505-054_R036_V1R1.docx; February 11, 2019
Jan Hettenbach Page 15 of 23
4.3.2 Failure rates of Universal Module HART 9469, 4- wire output per IEC 61508:2010
The FMEDA carried out on the Universal Module HART 9469 under the assumptions described
in section 4.2.3 and the definitions given in section 4.1 and 4.3 leads to the following failure
rates:
SIL AC 17 SIL2
14
DU includes 5% common cause failures between redundant components of disconnection circuit.
15
Failures of redundant components of the Plant- STOP function are part of Annunciation Undetected failures.
16
The complete subsystem will need to be evaluated to determine the overall Safe Failure Fraction. The number
listed is for reference only.
17
SIL AC (architectural constraints) means that the calculated values are within the range for hardware architectural
constraints for the corresponding SIL but does not imply all related IEC 61508 requirements are fulfilled. In addition it
must be shown that the device has a suitable systematic capability for the required SIL and that the entire safety
function can fulfill the required PFD values.
© exida.com GmbH Stahl_9469_1505-054_R036_V1R1.docx; February 11, 2019
Jan Hettenbach Page 16 of 23
5 Using the FMEDA results
Using the failure rate data displayed in section 4.3, and the failure rate data for the associated
element devices, an average the Probability of Failure on Demand (PFDAVG) calculation can be
performed for the entire safety function.
Probability of Failure on Demand (PFDAVG) calculation uses several parameters, many of which
are determined by the particular application and the operational policies of each site. Some
parameters are product specific and the responsibility of the manufacturer. Those manufacturer
specific parameters are given in this third party report.
Probability of Failure on Demand (PFDAVG) calculation is the responsibility of the owner/operator
of a process and is often delegated to the SIF designer. Product manufacturers can only
provide a PFDAVG by making many assumptions about the application and operational policies
of a site. Therefore use of these numbers requires complete knowledge of the assumptions and
a match with the actual application and site.
For SIL2 the overall PFDAVG shall be better than 1.00E-02. As the Universal Module HART 9469
is contributing to the entire safety function they should only consume a certain percentage of the
allowed range. Assuming 10% of this range as a reasonable budget they should be better than
or equal to 1.0E-03. The calculated PFDAVG / PFH values are within the allowed range for SIL 2
according to table 2 of IEC 61508-1 and do fulfill the assumption to not claim more than 10% of
the allowed range, i.e. to be better than or equal to 1.0E-03.
The resulting PFDAVG graph is generated for a proof test of 1 year are displayed in Figure 2.
7.1 Liability
exida prepares reports based on methods advocated in International standards. Failure rates
are obtained from a collection of industrial databases. exida accepts no liability whatsoever for
the use of these numbers or for the correctness of the standards on which the general
calculation methods are based.
Due to future potential changes in the standards, best available information and best practices,
the current FMEDA results presented in this report may not be fully consistent with results that
would be presented for the identical product at some future time. As a leader in the functional
safety market place, exida is actively involved in evolving best practices prior to official release
of updated standards so that our reports effectively anticipate any known changes. In addition,
most changes are anticipated to be incremental in nature and results reported within the
previous three year period should be sufficient for current usage without significant question.
Most products also tend to undergo incremental changes over time. If an exida FMEDA has not
been updated within the last three years and the exact results are critical to the SIL verification
you may wish to contact the product vendor to verify the current validity of the results.
7.2 Releases
Version History: V1R1: Results updated; February 11, 2019
V1R0. Review comments included; February 5, 2019
V0R1: Initial version; December 20, 2018
Author: Jan Hettenbach
Review: V0R1: Andreas Bagusch (R. STAHL Schaltgeräte GmbH), Stephan
Aschenbrenner (exida)
Release status: V1R0: Released to R. STAHL Schaltgeräte GmbH
Dipl. -Ing. (Univ.) Jan Hettenbach Dipl.-Ing. (Univ.) Stephan Aschenbrenner, Partner
According to section 7.4.9.5 of IEC 61508-2, a useful lifetime, based on experience, should be
assumed.
Although a constant failure rate is assumed by the probabilistic estimation method (see section
4.2.3) this only applies provided that the useful lifetime18 of components is not exceeded.
Beyond their useful lifetime, the result of the probabilistic calculation method is meaningless, as
the probability of failure significantly increases with time. The useful lifetime is highly dependent
on the component itself and its operating conditions.
This assumption of a constant failure rate is based on the bathtub curve. Therefore it is obvious
that the PFDAVG calculation is only valid for components which have this constant domain and
that the validity of the calculation is limited to the useful lifetime of each component.
It is assumed that early failures are detected to a huge percentage during the installation period
and therefore the assumption of a constant failure rate during the useful lifetime is valid.
There are no components with reduced life-time within the considered safety function.
18
Useful lifetime is a reliability engineering term that describes the operational time interval where the failure rate of a
device is relatively constant. It is not a term which covers product obsolescence, warranty, or other commercial
issues.
© exida.com GmbH Stahl_9469_1505-054_R036_V1R1.docx; February 11, 2019
Jan Hettenbach Page 22 of 23
Appendix B: Proof tests to detect dangerous undetected faults
According to section 7.4.5.2 f) of IEC 61508-2 proof tests shall be undertaken to reveal
dangerous faults which are undetected by diagnostic tests. This means that it is necessary to
specify how dangerous undetected faults which have been noted during the FMEDA can be
detected during proof testing.
A suggested proof test consists of the following steps, as described Table 7.
Table 7 Steps for Proof Test
Step Action
1. Bypass the safety function and take appropriate action to avoid a false trip.
2. Force the Universal Module HART 9469 to go to the safe state and verify that the
safe state is reached.
3. Measure the output current of each channel if it is less than 1.5 mA and 5.3 V for 4-
wire output and less than 0.1 mA and 5.3 V for 2-wire output
4. Restore the loop to full operation.
5. Remove the bypass from the safety PLC or otherwise restore normal operation.