

Speaking Of Reliability: Friends Discussing Reliability Engineering Topics | Warranty | Plant Maintenance
Reliability.FM: Accendo Reliability, focused on improving your reliability program and career
Gain the experience of your peers to accelerate improvement of your program and career. Improve your product development process, reliability or warranty performance; or your plant uptime or asset performance. Learn about reliability and maintenance engineering practical approaches, skills, and techniques. Join the conversation today.
Episodes
Mentioned books

May 6, 2024 • 0sec
Proving HALT Works
Proving HALT Works
Abstract
Kirk and Fred discuss the challenge of showing those new to limit discovery using HALT and proving does find relevant future field issues that either already have occurred in a new released product, or in a product under development.
Key Points
Join Kirk and Fred as they discuss finding potential weaknesses in a new or established product using HALT, and how we can connect the weakness to field reliability, first, if the field issue has already been corrected and all products have been retrofitted with a fix, and second, those weaknesses in development that are found in “conditions the product will never experience in the field (HALT)”
Topics include:
The challenge of proving the relevance of a failure under HALT is very dependent on the weakness found. Failures such as component spacing and shorting are typically catastrophic, and most engineers will quickly correct them in the design. Other failures, such as a significant repeating transient voltage spike that damages an I/O interface, will be more challenging to link to field issues if they have not already been observed.
Comparing limits and observing large distributions of those limits among the three or more samples used in HALT can help establish the case for the lot or manufacturing variation leading to weak products.
Many rush into new product development to HALT before known failures and weaknesses are corrected. Before HALT can be useful, all the prototypes must function correctly, and all time-zero failures must be corrected.
Enjoy an episode of Speaking of Reliability. Where you can join friends as they discuss reliability topics. Join us as we discuss topics ranging from design for reliability techniques to field data analysis approaches.
Download Audio RSS
Show Notes
Please click on this link to access a relatively new analysis of traditional reliability prediction methods article from the US ARMY and CALCE titled “Reliability Prediction – Continued Reliance on a Misleading Approach”. It is in the public domain, so please distribute freely. Trying to predict reliability for development is a misleading a costly approach.
Here is a link to Kirk’s article “Thermal HALT A Tool for Discovery of Signal Integrity and Software Reliability Issues”
You can now purchase the most recent recording of Kirk Gray’s Hobbs Engineering 8 (two 4 hour sessions) hour Webinar “Rapid and Robust Reliability Development 2022 HALT & HASS Methodologies Online Seminar” from this link.
For more information on the newest discovery testing methodology here is a link to the book “Next Generation HALT and HASS: Robust design of Electronics and Systems” written by Kirk Gray and John Paschkewitz.
The post SOR 963 Proving HALT Works appeared first on Accendo Reliability.

May 3, 2024 • 0sec
Limits of Block Diagrams
Limits of Block Diagrams
Abstract
Chris and Fred discuss how we go about modeling the reliability of systems … particularly with things called ‘block diagrams.’ Might this help you?
Key Points
Join Chris and Fred as they discuss how we can go about modelling a system, mainly in response to a listener question. The question revolves around modeling a ‘complex’ system that involves a relief valve (which means it only needs to work at certain times), and other valves that redirect things in pipes to three different processes. Where do we start?
Topics include:
What are you trying to achieve? As in … what decision are you trying to inform? Is this to optimize maintenance? … or see if you meet reliability requirements? … or to minimize downtime? … what is it?
So what is a Reliability Block Diagram (RBD)? It’s like a fault tree (if you have heard of that) which essentially tells us what combinations of components need to work for the system to work. Now RBDs can’t of themselves tell us if a system is (for example) a parallel system. An RBD might look the same for a two-component load-sharing system as it does for a two-component parallel system. It’s up to you to work out how to model it.
And it can be a little complicated. If your emergency relief valve has failed, then your system could still be ‘happily’ working. Until an emergency comes along. So is your system that is still working with a failed relief valve … failed? Your system will only fail when an ’emergency’ comes along (if nothing else fails). So you need to know how often those emergencies come along … There is nothing wrong with an RBD. It’s just that it can’t do all the thinking.
Enjoy an episode of Speaking of Reliability. Where you can join friends as they discuss reliability topics. Join us as we discuss topics ranging from design for reliability techniques to field data analysis approaches.
Download Audio RSS
Show Notes
The post SOR 962 Limits of Block Diagrams appeared first on Accendo Reliability.

Apr 29, 2024 • 0sec
Where do Confidence Bounds Come From
Where do Confidence Bounds Come From
Abstract
Chris and Fred discuss where the ideas of ‘confidence bounds’ come from … and perhaps what they mean.
Key Points
Join Chris and Fred as they discuss how we come up with things we call ‘confidence bounds.’ What are they? … and how do they help?
Topics include:
What are ‘confidence bounds’? Confidence bounds are usually explained as limits on what we believe some actual value is. A simple example might be when we judge a distance. For example, if you are standing in a field and see a tree, you might think to yourself that the tree might be 70 – 100 meters away. Your best guess might be around 85 meters, but the values 75 and 100 represent the ‘confidence bounds’ on this best guess because you know there is uncertainty involved.
So how do we get ‘confidence bounds’? .. it starts with ‘likelihoods.’ Let’s say that you find a size 6.5 shoe in the street (adjusting for the difference in how manufacturers calculate their shoe sizes fore male and female shoes.) We also know that women’s feet tend to need shoes of sizes of 6.5 to 7.5. We also know that men’s feet tend to need shoes of sizes 9 to 10. Some women will randomly have large feet that can exceed sizes 10, 11, 12 and so on. And likewise, some men will randomly have small feet that are less than sizes 6.5, 6, 5.5 and so on. But that said, we know that the shoe with size 6.5 that we found is more likely to be worn by a woman. And that is the basis of everything!
Gammas, Chi-squareds, Students-t … what are we talking about? Some really smart people have been able to take the concept of likelihood for things like the mean (times to failure) for random processes. And probability distributions have been developed to help us get confidence bounds based on how each thing fails. These probability distributions quantify the likelihood that potential mean (times to failure) values are ‘true.’ Which can be really helpful … sometimes.
Enjoy an episode of Speaking of Reliability. Where you can join friends as they discuss reliability topics. Join us as we discuss topics ranging from design for reliability techniques to field data analysis approaches.
Download Audio RSS
Show Notes
The post SOR 961 Where do Confidence Bounds Come From appeared first on Accendo Reliability.

Apr 26, 2024 • 0sec
FMEA Approaches Debate
Differing FMEA Approaches
Abstract
Carl and Fred discuss their overall approach to FMEA, what works and doesn’t work.
Key Points
Join Carl and Fred as they discuss how they approach FMEA to keep it lean, effective and workable. Topics include:
Does a longer FMEA make for a better FMEA?
If you have the right FMEA team and no one is concerned about a problem, no need to include it in the FMEA.
Not seeing forest for trees is high risk
Bottom-up FMEA vs Top-down FMEA
Problems with “Bottom-up FMEA”
Even lower-level FMEAs begin with functions
FMEAs use engineering judgment
Human judgment of FMEA team members is critical part of process
There is not the time or bandwidth to FMEA everything
Benefits of starting with System FMEA
FMEA “nesting”
Tracing component failure propagation to next level and up to the system is critical to assessing risk
If one person on an FMEA team is concerned about an issue, the team needs to discuss the issue
Enjoy an episode of Speaking of Reliability. Where you can join friends as they discuss reliability topics. Join us as we discuss topics ranging from design for reliability techniques to field data analysis approaches.
Download Audio RSS
Show Notes
The post SOR 960 Differing FMEA Approaches appeared first on Accendo Reliability.

Apr 22, 2024 • 0sec
Knotty Detection
Knotty Detection
Abstract
Carl and Fred discuss reader questions on FMEA detection, a subject which can be challenging and confusing. Detection is a key part of FMEA during product development as well as in operation. This podcast will discuss some of the “knottiest” challenges with understanding detection in FMEA.
Key Points
Join Carl and Fred as they discuss when and how to use detection in FMEAs. Topics include:
Where and when are we detecting the problem?
Detection scales can appear reversed: high likelihood of detection is low score
MIL-STD 1629a does not use Detection scale during product development
There is risk from lack of detection during product development
Subject of detection during product development vs detection in operation
Example of oil light in vehicle
Monitoring and System Response (MSR)
Case studies where confusion exists with detection with tests and detection in operation
How to detect intermittent problems
What to do when conducting an FMEA and the “answer is not in the room”
Detection scale can be 1 to 5 or 1 to 10, the key is prioritizing risk
You want to detect the problem early in product development, if possible
Keep focus on creating a better product
Enjoy an episode of Speaking of Reliability. Where you can join friends as they discuss reliability topics. Join us as we discuss topics ranging from design for reliability techniques to field data analysis approaches.
Download Audio RSS
Show Notes
The post SOR 959 Knotty Detection appeared first on Accendo Reliability.

Apr 19, 2024 • 0sec
Learning Weibull Analysis
Learning Weibull Analysis
Abstract
Chris and Fred discuss Weibull Analysis and how it can help you can first take your ‘tentative’ steps to learn more about it.
Key Points
Join Chris and Fred as they discuss Weibull analysis. This is perhaps one of the most talked about forms of analysis reliability engineers talk about. And so for some people who are first starting to do reliability stuff, it can be a little intimidating to not know about this analysis methodology that everyone else seems to use. So where do you start?
Topics include:
Usually we start by saying ‘find your decision’ … but perhaps this is not the thing you need to do when it comes to trying to learn about what Weibull analysis does. How can you know if your decision can even be helped by Weibull analysis..
Humans are visual creatures who can see patterns in things. This is really important. Computers aren’t close to what we can do when it comes to finding corners in straight lines and things like that.
Weibull analysis (at its best) is all about turning things like failure data into visual patterns we can see. Data starts looking like a table of numbers in a spreadsheet or something similar. Weibull analysis turns these numbers into points that can be visualized as curves. And these patterns can tell you things like … what should my servicing interval be? … what percentage of products experience infant mortality? … what is the likely dominant failure mechanism?
… and it’s not all about software/numbers. Reliability engineering isn’t about force-feeding numbers into Weibull analysis plotting software. If you put numbers in and get numbers out, and don’t know what those numbers mean (or if they are relevant) you will make bad decisions. All the time.
Enjoy an episode of Speaking of Reliability. Where you can join friends as they discuss reliability topics. Join us as we discuss topics ranging from design for reliability techniques to field data analysis approaches.
Download Audio RSS
Show Notes
The post SOR 958 Learning Weibull Analysis appeared first on Accendo Reliability.

Apr 15, 2024 • 0sec
Learning From Those Closest
Learning From Those Closest
Abstract
Kirk and Fred discuss the fact that many times those on the assembly and production lines are the ones that have the most information for assembly issues and causes of failures, yet the information they have is not heard by the engineers and management that could improve it.
Key Points
Join Kirk and Fred as they discuss getting the information on reliability issues from those workers and technicians assembling the product or running production equipment to the engineers who made the assembly procedures.
Topics include:
Getting engineers to sit on the production lines and perform the procedure they wrote can be difficult even though watching the challenges and potential difficulty of the procedure and failures can be extremely beneficial and can help them relate to the assembly issues.
Management by walking around is a common method for knowing the real issues on the production floor, and allows managers and engineers to have a more macro perspective of the entire manufacturing process.
Fred tells of his experience finding a solution from a line worker for floating components in a wave solder using a ceramic bead bag that was very cost-effective, even though the engineers had come up with a much more expensive fixture.
Enjoy an episode of Speaking of Reliability. Where you can join friends as they discuss reliability topics. Join us as we discuss topics ranging from design for reliability techniques to field data analysis approaches.
Download Audio RSS
Show Notes
Please click on this link to access a relatively new analysis of traditional reliability prediction methods article from the US ARMY and CALCE titled “Reliability Prediction – Continued Reliance on a Misleading Approach”. It is in the public domain, so please distribute freely. Trying to predict reliability for development is a misleading a costly approach.
You can now purchase the most recent recording of Kirk Gray’s Hobbs Engineering 8 (two 4 hour sessions) hour Webinar “Rapid and Robust Reliability Development 2022 HALT & HASS Methodologies Online Seminar” from this link.
For more information on the newest discovery testing methodology here is a link to the book “Next Generation HALT and HASS: Robust design of Electronics and Systems” written by Kirk Gray and John Paschkewitz.
The post SOR 957 Learning From Those Closest appeared first on Accendo Reliability.

Apr 12, 2024 • 0sec
Getting Failure Feedback
Getting Failure Feedback
Abstract
Kirk and Fred discuss the many required tests before market release and post market ongoing reliability testing and why testing is so necessary.
Key Points
Join Kirk and Fred as they discuss the reasons we have to do so many tests to get the feedback on failures sometimes long after the tests have no failures for long periods.
Topics include:
Some companies have big investments in chambers and processes to perform “burn-in” testing, which may have a poor ROI, but they found a reliability issue months ago and that justifies it forever.
Testing to find margins and improve them where it is possible (HALT) is the most cost-effective early testing and helps products withstand component and vendor variations. The test should always be compared to what failures are occurring in the field and if not relevant to the field should be eliminated.
Field failures are the best and most valuable data on reliability issues, but getting failed parts back for failure analysis can be extremely difficult, and field service engineers are rewarded for quick repair and sending back failed parts is a low priority.
Sometimes when a company has an issue with a particular component type, such as Al Electrolytic capacitors, which drives them to develop ongoing vendor highly focused test requirements for every vendor that makes that component type, and while no failures occur, the past fears require them to keep testing regardless of the fact that 100% pass.
Enjoy an episode of Speaking of Reliability. Where you can join friends as they discuss reliability topics. Join us as we discuss topics ranging from design for reliability techniques to field data analysis approaches.
Download Audio RSS
Show Notes
Please click on this link to access a relatively new analysis of traditional reliability prediction methods article from the US ARMY and CALCE titled “Reliability Prediction – Continued Reliance on a Misleading Approach”
You can now purchase the most recent recording of Kirk Gray’s Hobbs Engineering 8 (two 4 hour sessions) hour Webinar “Rapid and Robust Reliability Development 2022 HALT & HASS Methodologies Online Seminar” from this link.
For more information on the newest discovery testing methodology here is a link to the book “Next Generation HALT and HASS: Robust design of Electronics and Systems” written by Kirk Gray and John Paschkewitz.
The post SOR 956 Getting Failure Feedback appeared first on Accendo Reliability.

Apr 8, 2024 • 0sec
Data Analysis Assumptions
Data Analysis Assumptions
Abstract
Greg and Fred discuss the importance and context of assumptions in data analysis.
Key Points
Join Greg and Fred as they discuss assumptions making decisions using non parametric data. Topics include:
What is non parametric data?
Why, when, and how to use non parametric data?
How to make good decisions using non parametric data and their assumptions?
Enjoy an episode of Speaking of Reliability. Where you can join friends as they discuss reliability topics. Join us as we discuss topics ranging from design for reliability techniques to field data analysis approaches.
Download Audio RSS
Show Notes
The post SOR 955 Data Analysis Assumptions appeared first on Accendo Reliability.

Apr 5, 2024 • 0sec
Ai Challenges and Opportunities
AI Challenges and Opportunities
Abstract
Greg and Fred discuss AI – both the challenges and opportunities for quality and reliability professionals.
Key Points
Join Greg and Fred as they discuss how AI may have the same impact as the discovery of electricity or the steam engine. Both created economic disruption and personal change. So, hang on. Be ready for a bumpy ride.
Topics include:
What’s the AI proof of concept for product dev?
What is AI RMF and RAG?
Why is the AI RMF the hub for all AI product dev in the US?
Enjoy an episode of Speaking of Reliability. Where you can join friends as they discuss reliability topics. Join us as we discuss topics ranging from design for reliability techniques to field data analysis approaches.
Download Audio RSS
Show Notes
The post SOR 954 Ai Challenges and Opportunities appeared first on Accendo Reliability.