Prognostics and Health Management — An Interactive Primer

1

Forward in time.

A bearing has a crack. It is small. The machine is running. The question that matters is not only is something wrong — diagnostics answered that — but how long do I have? Weeks? Hours? Until the next shift? Prognostics is the discipline of answering that question, quantitatively, with honest uncertainty.

Prognostics and Health Management (PHM) sits where diagnostics becomes a planning tool. A diagnostic system catches a fault the moment it becomes distinguishable from noise. A prognostic system reads the fault's trajectory and projects forward — combining what it has seen with what it knows about how this kind of fault tends to grow — to estimate when the asset will reach the end of its useful life.

The central object is the health indicator (HI), a scalar that captures the asset's current condition, and its evolution — degradation — over time. The central output is Remaining Useful Life (RUL): a distribution, ideally, over the time until the health indicator crosses a failure threshold.

Now t = 45

Figure 1 · InteractiveThe canonical PHM picture. Black is what has happened, teal is what might happen. Drag the “now” slider. As more of the past is observed, the uncertainty cone narrows and the predicted RUL distribution sharpens. The RUL is not a number; it is a probability distribution over the time-to-threshold-crossing.

Two distinctions worth making up front. Diagnostics vs. prognostics: diagnostics is a present-tense claim about the system’s state; prognostics is a future-tense claim, necessarily uncertain. And point estimate vs. distribution: a prognostic that outputs “42 hours” without a confidence band is almost useless. A prognostic that outputs “RUL is log-normal with median 42 h and 90% CI [18, 97]” lets a maintenance planner actually decide something.

2

Remaining Useful Life — a first-passage time.

RUL is the time from “now” until the asset first crosses a failure threshold. Every prognostic method, whatever its machinery, is ultimately trying to estimate this distribution.

RUL(t) = inf { τ > 0 : H(t + τ) ≤ Hfail } given everything observed up to time t

Three families dominate in practice. They differ in whether they rely on a physics model, a population of historical units, or raw pattern recognition.

2.1Particle filter — physics plus Bayes

The particle filter works when you have a stochastic degradation model and noisy measurements. At each time step, it carries a cloud of particles — each one a hypothesis about the system’s current hidden state. Particles are propagated forward by the degradation model. When a measurement arrives, particles are weighted by how well they explain it, and the cloud is resampled to concentrate near the likely state.

To estimate RUL, each particle is extrapolated further into the future (without new measurements) until it crosses the failure threshold. The collected crossing times form an empirical RUL distribution.

Particles N = 80

Figure 2 · InteractiveEach thin teal line is a particle. Step time forward to watch them propagate; click “forecast RUL” to extrapolate every particle to the threshold and build the RUL histogram on the right. Try it with 20 vs. 200 particles — fewer is faster, more is smoother.

2.2Similarity-based — the library of histories

Sometimes you have no physics model but you do have a library of run-to-failure histories from similar units. For the current asset, you find the historical units whose trajectory most closely resembles the one you are observing, and use their remaining lives to estimate yours.

This is simple, surprisingly effective on real data, and scales beautifully as the library grows. It assumes the current unit will age like the library did — an assumption that needs checking when populations drift.

2.3Deep learning — learn the mapping

Given enough labeled run-to-failure data, deep networks can map raw sensor windows directly to RUL. CNN and LSTM architectures dominate the public benchmarks (C-MAPSS, PRONOSTIA, PHM Society challenges). Transformers and temporal convolutional networks have joined the fray more recently.

How they compare in practice

Particle filterPrincipled uncertainty, requires a physics or stochastic model, computationally heavy for large N.
Similarity-basedMinimal modeling, needs a run-to-failure library, honest uncertainty via nearest-neighbor spread.
Deep learningHigh performance when data is abundant and stationary, uncertainty estimation is an open research problem.

3

Degradation modeling — the shape of getting old.

Before predicting when an asset will fail, you need a model of how it ages. Three stochastic processes dominate the literature, each capturing a different character of aging.

Wiener processDamage accumulates as a drift plus Brownian noise. Can fluctuate up and down — appropriate when the health indicator is noisy or reversible in the short term. First-passage-time to a threshold is inverse-Gaussian distributed, giving a closed-form RUL.
Gamma processDamage arrives in strictly positive random increments. Paths are monotonically increasing and often jumpy. The right model for wear, crack growth, corrosion — anything that cannot spontaneously heal.
Inverse Gaussian processA generalization with a specific moment structure; useful when the data shows heavier right tails in its failure times than Gamma can capture.

Figure 3 · InteractiveClick between models and resample. Notice how Wiener paths wiggle up and down, Gamma paths march monotonically, and the inverse-Gaussian failure-time distribution has a characteristic long right tail. Different asset types demand different models.

4

Health indicators — squeezing many sensors into one scalar.

A machine is a forest of sensors. Vibration RMS, bearing temperature, motor current harmonics, oil particle counts, acoustic envelopes. None of them, alone, tells the whole story of aging. A health indicator is a single scalar derived from the raw sensor set that captures “how healthy is this thing, overall.”

Construction is part craft, part mathematics. The common recipes:

Weighted sumEngineering-assigned weights on normalized sensors. Transparent, tunable, common in production.
PCA projectionProject onto the first principal component of the sensor matrix during degradation. Data-driven, captures the dominant degradation direction.
Autoencoder latentTrain a neural network on healthy data; use the reconstruction error or a latent-space distance as HI.
Mahalanobis distanceDistance from the current sensor vector to the healthy-distribution centroid, scaled by covariance.

The key property of a good health indicator is monotonicity: it should trend consistently from “healthy” to “failed” over the asset’s life, with as little back-and-forth as possible. A non-monotonic HI makes RUL estimation brittle and maintenance decisions confusing.

Vibration RMS Bearing Temp Current Harmonic Acoustic Env.

Toggle sensors on and off. As more sensors are fused, the HI becomes smoother and more monotonic. Removing a noisy sensor can actually help; removing too many sensors leaves noise dominant.

Figure 4 · InteractiveFour raw sensors (top), each noisy and partially informative. Their weighted fusion (teal, bottom) produces a smoother, more monotonic health indicator. The monotonicity score quantifies how well-behaved the HI is — a crucial property for downstream RUL estimation.

5

Uncertainty — often more important than the estimate.

A prognostic that says “RUL = 50 hours” is not actionable. A prognostic that says “RUL is 50 hours ± 3 hours with 95% confidence” schedules a maintenance visit. A prognostic that says “RUL is 50 hours ± 40 hours with 95% confidence” schedules a second measurement. The width of the interval is the content.

Two kinds of uncertainty must be tracked. Aleatoric uncertainty is the irreducible randomness of the degradation process — even perfect knowledge of the current state cannot predict the future Brownian noise. Epistemic uncertainty is the modeler’s own ignorance — uncertain parameters, mismatched models, insufficient observations. Aleatoric uncertainty narrows as you approach the threshold; epistemic uncertainty narrows as you acquire more data.

total variance = aleatoric (irreducible) + epistemic (reducible by more data)

The practical output is a reliability function: the probability that the asset will still be operating at time τ in the future. This is the object a maintenance planner actually uses.

Planning horizon τ τ = 45h

Figure 5 · InteractiveThe left panel shows the RUL distribution with the shaded region representing P(fail before τ). The right panel shows the reliability curve R(τ) = 1 − CDF. Slide τ and watch the probability update. A maintenance planner might require R(τ) ≥ 0.98 before dispatching an aircraft — the horizon at which that holds is the inspection deadline.

6

Condition-based and predictive maintenance.

The economic point of PHM. A health indicator that no one reads and an RUL distribution that no one acts on is a research project, not an engineering system. Maintenance strategies are how PHM pays for itself.

Four archetypes span the landscape:

ReactiveRun to failure. Cheapest until the failure occurs; then it is the most expensive, with unplanned downtime and possible collateral damage.
Scheduled (time-based)Maintain on a fixed calendar — every 5,000 hours, every 12 months. Prevents most failures but wastes remaining life on everything replaced early.
Condition-based (CBM)Maintain when the current health indicator crosses a warning threshold. Uses diagnostics but not forecasting. Good improvement over scheduled.
Predictive (prognostic-driven, PdM)Maintain when the RUL distribution predicts failure within a planning horizon. Consumes maximum useful life and minimizes surprise failures.

Prognostic accuracy 75%

Figure 6 · InteractiveA simulated 10-year operation. Ochre diamonds mark scheduled or triggered maintenance; burgundy stars mark unplanned failures. Each strategy accumulates cost differently. Slide prognostic accuracy — as it drops, predictive maintenance degrades toward (but not below) condition-based, because it is still acting on a signal that exists.

7

References & further reading

A curated list. Textbooks first, then topic-specific surveys and applications.

Foundational texts

Vachtsevanos, G., Lewis, F. L., Roemer, M., Hess, A., Wu, B. — Intelligent Fault Diagnosis and Prognosis for Engineering Systems. Wiley, 2006. The classic PHM textbook.
Pecht, M. — Prognostics and Health Management of Electronics. Wiley, 2008. The standard reference for electronics-specific PHM.
Kim, N.-H., An, D., Choi, J.-H. — Prognostics and Health Management of Engineering Systems: An Introduction. Springer, 2017. Well-structured modern pedagogical treatment with worked examples.
Si, X.-S., Wang, W., Hu, C.-H., Zhou, D.-H. — “Remaining useful life estimation — a review on the statistical data driven approaches.” European J. Operational Research, 2011. The canonical survey.

RUL estimation

Orchard, M. E. and Vachtsevanos, G. J. — “A particle-filtering approach for on-line fault diagnosis and failure prognosis.” Trans. Institute of Measurement and Control, 2009. Foundational particle-filter-for-PHM paper.
Wang, T., Yu, J., Siegel, D., Lee, J. — “A similarity-based prognostics approach for Remaining Useful Life estimation of engineered systems.” PHM, 2008. The similarity-based method applied to C-MAPSS.
Lei, Y., Li, N., Guo, L., Li, N., Yan, T., Lin, J. — “Machinery health prognostics: A systematic review from data acquisition to RUL prediction.” Mechanical Systems and Signal Processing, 2018. Current-decade survey of data-driven RUL.
Saxena, A., Goebel, K., Simon, D., Eklund, N. — “Damage propagation modeling for aircraft engine run-to-failure simulation.” PHM, 2008. The C-MAPSS benchmark dataset paper — still the community standard.

Degradation modeling

Ye, Z.-S. and Xie, M. — “Stochastic modelling and analysis of degradation for highly reliable products.” Applied Stochastic Models in Business and Industry, 2015. Modern survey of Wiener / Gamma / IG processes for degradation.
Wang, X. — “Wiener processes with random effects for degradation data.” J. Multivariate Analysis, 2010. Heterogeneity-aware Wiener models.
van Noortwijk, J. M. — “A survey of the application of gamma processes in maintenance.” Reliability Engineering & System Safety, 2009. The reference for Gamma-process degradation.

Deep learning for PHM

Zhao, R., Yan, R., Chen, Z., Mao, K., Wang, P., Gao, R. X. — “Deep learning and its applications to machine health monitoring.” Mechanical Systems and Signal Processing, 2019.
Fink, O., Wang, Q., Svensen, M., Dersin, P., Lee, W.-J., Ducoffe, M. — “Potential, challenges and future directions for deep learning in prognostics and health management.” Engineering Applications of Artificial Intelligence, 2020. Honest assessment of deep-learning PHM including its weaknesses.

Uncertainty quantification

Sankararaman, S. — “Significance, interpretation, and quantification of uncertainty in prognostics and remaining useful life prediction.” Mechanical Systems and Signal Processing, 2015. The reference paper on aleatoric/epistemic decomposition for PHM.
Saxena, A., Celaya, J., Saha, B., Saha, S., Goebel, K. — “Metrics for offline evaluation of prognostic performance.” Intl. J. Prognostics and Health Management, 2010. The canonical metrics — including the asymmetric scoring that rewards early, penalizes late.

Applications

Xing, Y., Ma, E. W. M., Tsui, K. L., Pecht, M. — “Battery management systems in electric and hybrid vehicles.” Energies, 2011. A readable entry to Li-ion SoH estimation.
Plumley, C. E. — “Condition monitoring of wind turbines: A review.” various editions and updates. Wind-turbine monitoring is PHM’s most visible industrial win.
Jardine, A. K. S., Lin, D., Banjevic, D. — “A review on machinery diagnostics and prognostics implementing condition-based maintenance.” Mechanical Systems and Signal Processing, 2006. The bridge from prognostics to CBM/PdM business logic.