First-hitting-time model

Last updated January 03, 2025

More colloquially, a first passage time in a stochastic system, is the time taken for a state variable to reach a certain value. Understanding this metric allows one to further understand the physical system under observation, and as such has been the topic of research in very diverse fields, from economics to ecology.^[1]

The idea that a first hitting time of a stochastic process might describe the time to occurrence of an event has a long history, starting with an interest in the first passage time of Wiener diffusion processes in economics and then in physics in the early 1900s.^[2]^[3]^[4] Modeling the probability of financial ruin as a first passage time was an early application in the field of insurance.^[5] An interest in the mathematical properties of first-hitting-times and statistical models and methods for analysis of survival data appeared steadily between the middle and end of the 20th century.^[6]^[7]^[8]^[9]^[10]

Examples

A common example of a first-hitting-time model is a ruin problem, such as Gambler's ruin. In this example, an entity (often described as a gambler or an insurance company) has an amount of money which varies randomly with time, possibly with some drift. The model considers the event that the amount of money reaches 0, representing bankruptcy. The model can answer questions such as the probability that this occurs within finite time, or the mean time until which it occurs.

First-hitting-time models can be applied to expected lifetimes, of patients or mechanical devices. When the process reaches an adverse threshold state for the first time, the patient dies, or the device breaks down.

A financial application of the first hitting time probability has been developed by Marcello Minenna in order to compute the minimum investment time horizon.^[11]^[12]

First passage time of a 1D Brownian particle

One of the simplest and omnipresent stochastic systems is that of the Brownian particle in one dimension. This system describes the motion of a particle which moves stochastically in one dimensional space, with equal probability of moving to the left or to the right. Given that Brownian motion is used often as a tool to understand more complex phenomena, it is important to understand the probability of a first passage time of the Brownian particle of reaching some position distant from its start location. This is done through the following means.

The probability density function (PDF) for a particle in one dimension is found by solving the one-dimensional diffusion equation. (This equation states that the position probability density diffuses outward over time. It is analogous to say, cream in a cup of coffee if the cream was all contained within some small location initially. After a long time the cream has diffused throughout the entire drink evenly.) Namely,

{\frac {\partial p(x,t\mid x_{0})}{\partial t}}=D{\frac {\partial ^{2}p(x,t\mid x_{0})}{\partial x^{2}}},

given the initial condition $p(x,t={0}\mid x_{0})=\delta (x-x_{0})$ ; where $x(t)$ is the position of the particle at some given time, $x_{0}$ is the tagged particle's initial position, and $D$ is the diffusion constant with the S.I. units $m^{2}s^{-1}$ (an indirect measure of the particle's speed). The bar in the argument of the instantaneous probability refers to the conditional probability. The diffusion equation states that the rate of change over time in the probability of finding the particle at $x(t)$ position depends on the deceleration over distance of such probability at that position.

It can be shown that the one-dimensional PDF is

p(x,t;x_{0})={\frac {1}{\sqrt {4\pi Dt}}}\exp \left(-{\frac {(x-x_{0})^{2}}{4Dt}}\right).

This states that the probability of finding the particle at $x(t)$ is Gaussian, and the width of the Gaussian is time dependent. More specifically the Full Width at Half Maximum (FWHM) – technically, this is actually the Full Duration at Half Maximum as the independent variable is time – scales like

{\rm {FWHM}}\sim {\sqrt {t}}.

Using the PDF one is able to derive the average of a given function, $L$ , at time $t$ :

\langle L(t)\rangle \equiv \int _{-\infty }^{\infty }L(x,t)p(x,t)\,dx,

where the average is taken over all space (or any applicable variable).

The First Passage Time Density (FPTD) is the probability that a particle has first reached a point $x_{c}$ at exactly time $t$ (not at some time during the interval up to $t$ ). This probability density is calculable from the Survival probability (a more common probability measure in statistics). Consider the absorbing boundary condition $p(x_{c},t)=0$ (The subscript c for the absorption point $x_{c}$ is an abbreviation for cliff used in many texts as an analogy to an absorption point). The PDF satisfying this boundary condition is given by

p(x,t;x_{0},x_{c})={\frac {1}{\sqrt {4\pi Dt}}}\left(\exp \left(-{\frac {(x-x_{0})^{2}}{4Dt}}\right)-\exp \left(-{\frac {(x-(2x_{c}-x_{0}))^{2}}{4Dt}}\right)\right),

for $x<x_{c}$ . The survival probability, the probability that the particle has remained at a position $x<x_{c}$ for all times up to $t$ , is given by

S(t)\equiv \int _{-\infty }^{x_{c}}p(x,t;x_{0},x_{c})\,dx=\operatorname {erf} \left({\frac {x_{c}-x_{0}}{2{\sqrt {Dt}}}}\right),

where $\operatorname {erf}$ is the error function. The relation between the Survival probability and the FPTD is as follows: the probability that a particle has reached the absorption point between times $t$ and $t+dt$ is $f(t)\,dt=S(t)-S(t+dt)$ . If one uses the first-order Taylor approximation, the definition of the FPTD follows):

f(t)=-{\frac {\partial S(t)}{\partial t}}.

By using the diffusion equation and integrating, the explicit FPTD is

f(t)\equiv {\frac {|x_{c}-x_{0}|}{\sqrt {4\pi Dt^{3}}}}\exp \left(-{\frac {(x_{c}-x_{0})^{2}}{4Dt}}\right).

The first-passage time for a Brownian particle therefore follows a Lévy distribution.

For $t\gg {\frac {(x_{c}-x_{0})^{2}}{4D}}$ , it follows from above that

f(t)={\frac {\Delta x}{\sqrt {4\pi Dt^{3}}}}\sim t^{-3/2},

where $\Delta x\equiv |x_{c}-x_{0}|$ . This equation states that the probability for a Brownian particle achieving a first passage at some long time (defined in the paragraph above) becomes increasingly small, but is always finite.

The first moment of the FPTD diverges (as it is a so-called heavy-tailed distribution), therefore one cannot calculate the average FPT, so instead, one can calculate the typical time, the time when the FPTD is at a maximum ( $\partial f/\partial t=0$ ), i.e.,

\tau _{\rm {ty}}={\frac {\Delta x^{2}}{6D}}.

First-hitting-time applications in many families of stochastic processes

First hitting times are central features of many families of stochastic processes, including Poisson processes, Wiener processes, gamma processes, and Markov chains, to name but a few. The state of the stochastic process may represent, for example, the strength of a physical system, the health of an individual, or the financial condition of a business firm. The system, individual or firm fails or experiences some other critical endpoint when the process reaches a threshold state for the first time. The critical event may be an adverse event (such as equipment failure, congested heart failure, or lung cancer) or a positive event (such as recovery from illness, discharge from hospital stay, child birth, or return to work after traumatic injury). The lapse of time until that critical event occurs is usually interpreted generically as a ‘survival time’. In some applications, the threshold is a set of multiple states so one considers competing first hitting times for reaching the first threshold in the set, as is the case when considering competing causes of failure in equipment or death for a patient.

Threshold regression: first-hitting-time regression

Practical applications of theoretical models for first hitting times often involve regression structures. When first hitting time models are equipped with regression structures, accommodating covariate data, we call such regression structure threshold regression.^[13] The threshold state, parameters of the process, and even time scale may depend on corresponding covariates. Threshold regression as applied to time-to-event data has emerged since the start of this century and has grown rapidly, as described in a 2006 survey article ^[13] and its references. Connections between threshold regression models derived from first hitting times and the ubiquitous Cox proportional hazards regression model ^[14] was investigated in.^[15] Applications of threshold regression range over many fields, including the physical and natural sciences, engineering, social sciences, economics and business, agriculture, health and medicine.^[16]^[17]^[18]^[19]^[20]

Latent vs observable

In many real world applications, a first-hitting-time (FHT) model has three underlying components: (1) a parent stochastic process $\{X(t)\}\,\,$ , which might be latent, (2) a threshold (or the barrier) and (3) a time scale. The first hitting time is defined as the time when the stochastic process first reaches the threshold. It is very important to distinguish whether the sample path of the parent process is latent (i.e., unobservable) or observable, and such distinction is a characteristic of the FHT model. By far, latent processes are most common. To give an example, we can use a Wiener process $\{X(t),t\geq 0\,\}\,$ as the parent stochastic process. Such Wiener process can be defined with the mean parameter ${\mu }\,\,$ , the variance parameter ${\sigma ^{2}}\,\,$ , and the initial value $X(0)=x_{0}>0\,$ .

Operational or analytical time scale

The time scale of the stochastic process may be calendar or clock time or some more operational measure of time progression, such as mileage of a car, accumulated wear and tear on a machine component or accumulated exposure to toxic fumes. In many applications, the stochastic process describing the system state is latent or unobservable and its properties must be inferred indirectly from censored time-to-event data and/or readings taken over time on correlated processes, such as marker processes. The word ‘regression’ in threshold regression refers to first-hitting-time models in which one or more regression structures are inserted into the model in order to connect model parameters to explanatory variables or covariates. The parameters given regression structures may be parameters of the stochastic process, the threshold state and/or the time scale itself.

Related Research Articles

Brownian motion is the random motion of particles suspended in a medium.

In physics, a Langevin equation is a stochastic differential equation describing how a system evolves when subjected to a combination of deterministic and fluctuating ("random") forces. The dependent variables in a Langevin equation typically are collective (macroscopic) variables changing only slowly in comparison to the other (microscopic) variables of the system. The fast (microscopic) variables are responsible for the stochastic nature of the Langevin equation. One application is to Brownian motion, which models the fluctuating motion of a small particle in a fluid.

<span class="mw-page-title-main">Fokker–Planck equation</span> Partial differential equation

In statistical mechanics and information theory, the Fokker–Planck equation is a partial differential equation that describes the time evolution of the probability density function of the velocity of a particle under the influence of drag forces and random forces, as in Brownian motion. The equation can be generalized to other observables as well. The Fokker–Planck equation has multiple applications in information theory, graph theory, data science, finance, economics etc.

<span class="mw-page-title-main">Geometric Brownian motion</span> Continuous stochastic process

A geometric Brownian motion (GBM) (also known as exponential Brownian motion) is a continuous-time stochastic process in which the logarithm of the randomly varying quantity follows a Brownian motion (also called a Wiener process) with drift. It is an important example of stochastic processes satisfying a stochastic differential equation (SDE); in particular, it is used in mathematical finance to model stock prices in the Black–Scholes model.

In probability theory and statistics, a Gaussian process is a stochastic process, such that every finite collection of those random variables has a multivariate normal distribution. The distribution of a Gaussian process is the joint distribution of all those random variables, and as such, it is a distribution over functions with a continuous domain, e.g. time or space.

<span class="mw-page-title-main">Girsanov theorem</span> Theorem on changes in stochastic processes

In probability theory, Girsanov's theorem or the Cameron-Martin-Girsanov theorem tells how stochastic processes change under changes in measure. The theorem is especially important in the theory of financial mathematics as it tells how to convert from the physical measure, which describes the probability that an underlying instrument will take a particular value or values, to the risk-neutral measure which is a very useful tool for evaluating the value of derivatives on the underlying.

<span class="mw-page-title-main">Path integral formulation</span> Formulation of quantum mechanics

The path integral formulation is a description in quantum mechanics that generalizes the stationary action principle of classical mechanics. It replaces the classical notion of a single, unique classical trajectory for a system with a sum, or functional integral, over an infinity of quantum-mechanically possible trajectories to compute a quantum amplitude.

The Feynman–Kac formula, named after Richard Feynman and Mark Kac, establishes a link between parabolic partial differential equations and stochastic processes. In 1947, when Kac and Feynman were both faculty members at Cornell University, Kac attended a presentation of Feynman's and remarked that the two of them were working on the same thing from different directions. The Feynman–Kac formula resulted, which proves rigorously the real-valued case of Feynman's path integrals. The complex case, which occurs when a particle's spin is included, is still an open question.

<span class="mw-page-title-main">Ornstein–Uhlenbeck process</span> Stochastic process modeling random walk with friction

In mathematics, the Ornstein–Uhlenbeck process is a stochastic process with applications in financial mathematics and the physical sciences. Its original application in physics was as a model for the velocity of a massive Brownian particle under the influence of friction. It is named after Leonard Ornstein and George Eugene Uhlenbeck.

In probability theory, the inverse Gaussian distribution is a two-parameter family of continuous probability distributions with support on (0,∞).

In mathematics, Doob's martingale inequality, also known as Kolmogorov’s submartingale inequality is a result in the study of stochastic processes. It gives a bound on the probability that a submartingale exceeds any given value over a given interval of time. As the name suggests, the result is usually given in the case that the process is a martingale, but the result is also valid for submartingales.

In mathematics — specifically, in stochastic analysis — the infinitesimal generator of a Feller process is a Fourier multiplier operator that encodes a great deal of information about the process.

<span class="mw-page-title-main">Biological neuron model</span> Mathematical descriptions of the properties of certain cells in the nervous system

Biological neuron models, also known as spiking neuron models, are mathematical descriptions of the conduction of electrical signals in neurons. Neurons are electrically excitable cells within the nervous system, able to fire electric signals, called action potentials, across a neural network. These mathematical models describe the role of the biophysical and geometrical characteristics of neurons on the conduction of electrical activity.

In probability and statistics, the Tweedie distributions are a family of probability distributions which include the purely continuous normal, gamma and inverse Gaussian distributions, the purely discrete scaled Poisson distribution, and the class of compound Poisson–gamma distributions which have positive mass at zero, but are otherwise continuous. Tweedie distributions are a special case of exponential dispersion models and are often used as distributions for generalized linear models.

Stochastic mechanics is a framework for describing the dynamics of particles that are subjected to an intrinsic random processes as well as various external forces. The framework provides a derivation of the diffusion equations associated to these stochastic particles. It is best known for its derivation of the Schrödinger equation as the Kolmogorov equation for a certain type of conservative diffusion, and for this purpose it is also referred to as stochastic quantum mechanics.

The Monte Carlo method for electron transport is a semiclassical Monte Carlo (MC) approach of modeling semiconductor transport. Assuming the carrier motion consists of free flights interrupted by scattering mechanisms, a computer is utilized to simulate the trajectories of particles as they move across the device under the influence of an electric field using classical mechanics. The scattering events and the duration of particle flight is determined through the use of random numbers.

In statistical mechanics, the mean squared displacement is a measure of the deviation of the position of a particle with respect to a reference position over time. It is the most common measure of the spatial extent of random motion, and can be thought of as measuring the portion of the system "explored" by the random walker. In the realm of biophysics and environmental engineering, the Mean Squared Displacement is measured over time to determine if a particle is spreading slowly due to diffusion, or if an advective force is also contributing. Another relevant concept, the variance-related diameter, is also used in studying the transportation and mixing phenomena in the realm of environmental engineering. It prominently appears in the Debye–Waller factor and in the Langevin equation.

In probability theory, an interacting particle system (IPS) is a stochastic process $on some configuration space given by a site space, a countably-infinite-order graph and a local state space, a compact metric space . More precisely IPS are continuous-time Markov jump processes describing the collective behavior of stochastically interacting components. IPS are the continuous-time analogue of stochastic cellular automata.$

In probability theory, a McKean–Vlasov process is a stochastic process described by a stochastic differential equation where the coefficients of the diffusion depend on the distribution of the solution itself. The equations are a model for Vlasov equation and were first studied by Henry McKean in 1966. It is an example of propagation of chaos, in that it can be obtained as a limit of a mean-field system of interacting particles: as the number of particles tends to infinity, the interactions between any single particle and the rest of the pool will only depend on the particle itself.

The redundancy principle in biology expresses the need of many copies of the same entity to fulfill a biological function. Examples are numerous: disproportionate numbers of spermatozoa during fertilization compared to one egg, large number of neurotransmitters released during neuronal communication compared to the number of receptors, large numbers of released calcium ions during transient in cells, and many more in molecular and cellular transduction or gene activation and cell signaling. This redundancy is particularly relevant when the sites of activation are physically separated from the initial position of the molecular messengers. The redundancy is often generated for the purpose of resolving the time constraint of fast-activating pathways. It can be expressed in terms of the theory of extreme statistics to determine its laws and quantify how the shortest paths are selected. The main goal is to estimate these large numbers from physical principles and mathematical derivations.

References

↑ Redner, S. (2001). A guide to first-passage processes. Cambridge university press.
↑ Bachelier, L. Théorie de la spéculation. Annales scientifiques de l'École Normale Supérieure, Serie 3, Volume 17 (1900), pp. 21-86. doi : 10.24033/asens.476. http://www.numdam.org/articles/10.24033/asens.476/
↑ Von E 1900
↑ Smoluchowski 1915
↑ Lundberg, F. (1903) Approximerad Framställning av Sannolikehetsfunktionen, Återförsäkering av Kollektivrisker, Almqvist & Wiksell, Uppsala.
↑ Tweedie 1945
↑ Tweedie 1957–1
↑ Tweedie 1957–2
↑ Whitmore 1970
↑ Lancaster 1972
↑ "Extended abstract".
↑ "A Quantitative Framework to Assess the Risk-Reward Profile of non Equity Products".
1 2 Lee 2006
↑ Cox 1972
↑ Lee 2010
↑ Aaron 2010
↑ Chambaz 2014
↑ Aaron 2015
↑ He 2015
↑ Hou 2016

Whitmore, G. A. (1986). "First passage time models for duration data regression structures and competing risks". The Statistician. 35 (2): 207–219. doi:10.2307/2987525. JSTOR 2987525.
Whitmore, G. A. (1995). "Estimating degradation by a Wiener diffusion process subject to measurement error". Lifetime Data Analysis. 1 (3): 307–319. doi:10.1007/BF00985762. PMID 9385107. S2CID 28077957.
Whitmore, G. A.; Crowder, M. J.; Lawless, J. F. (1998). "Failure inference from a marker process based on a bivariate Wiener model". Lifetime Data Analysis. 4 (3): 229–251. doi:10.1023/A:1009617814586. PMID 9787604. S2CID 43301120.
Redner, S. (2001). A Guide to First-Passage Processes. Cambridge University Press. ISBN 0-521-65248-0.
Lee, M.-L. T.; Whitmore, G. A. (2006). "Threshold regression for survival analysis: Modeling event times by a stochastic process". Statistical Science. 21 (4): 501–513. arXiv: 0708.0346 . doi:10.1214/088342306000000330. S2CID 88518120.
Bachelier, L. (1900). "Théorie de la Spéculation". Annales Scientifiques de l'École Normale Supérieure. 3 (17): 21–86. doi: 10.24033/asens.476 .
Schrodinger, E. (1915). "Zur Theorie der Fall-und Steigversuche an Teilchen mit Brownscher Bewegung". Physikalische Zeitschrift. 16: 289–295.
Smoluchowski, M. V. (1915). "Notiz über die Berechnung der Brownschen Molekularbewegung bei der Ehrenhaft-millikanschen Versuchsanordnung". Physikalische Zeitschrift. 16: 318–321.
Lundberg, F. (1903). Approximerad Framställning av Sannolikehetsfunktionen, Återförsäkering av Kollektivrisker. Almqvist & Wiksell, Uppsala.
Tweedie, M. C. K. (1945). "Inverse statistical variates". Nature. 155 (3937): 453. Bibcode:1945Natur.155..453T. doi: 10.1038/155453a0 .
Tweedie, M. C. K. (1957). "Statistical properties of inverse Gaussian distributions – I". Annals of Mathematical Statistics. 28 (2): 362–377. doi: 10.1214/aoms/1177706964 .
Tweedie, M. C. K. (1957). "Statistical properties of inverse Gaussian distributions – II". Annals of Mathematical Statistics. 28 (3): 696–705. doi: 10.1214/aoms/1177706881 .
Whitmore, G. A.; Neufeldt, A. H. (1970). "An application of statistical models in mental health research". Bull. Math. Biophys. 32 (4): 563–579. doi:10.1007/BF02476771. PMID 5513393.
Lancaster, T. (1972). "A stochastic model for the duration of a strike". J. Roy. Statist. Soc. Ser. A. 135 (2): 257–271. doi:10.2307/2344321. JSTOR 2344321.
Cox, D. R. (1972). "Regression models and life tables (with discussion)". J R Stat Soc Ser B. 187: 187–230. doi:10.1111/j.2517-6161.1972.tb00899.x.
Lee, M.-L. T.; Whitmore, G. A. (2010). "Threshold Proportional hazards and threshold regression: their theoretical and practical connections". Lifetime Data Analysis. 16 (2): 196–214. doi:10.1007/s10985-009-9138-0. PMC 6447409 . PMID 19960249.
Aaron, S. D.; Ramsay, T.; Vandemheen, K.; Whitmore, G. A. (2010). "A threshold regression model for recurrent exacerbations in chronic obstructive pulmonary disease". Journal of Clinical Epidemiology. 63 (12): 1324–1331. doi:10.1016/j.jclinepi.2010.05.007. PMID 20800447.
Chambaz, A.; Choudat, D.; Huber, C.; Pairon, J.; Van der Lann, M. J. (2014). "Analysis of occupational exposure to asbestos based on threshold regression modeling of case-control data". Biostatistics. 15 (2): 327–340. doi: 10.1093/biostatistics/kxt042 . PMID 24115271.
Aaron, S. D.; Stephenson, A. L.; Cameron, D. W.; Whitmore, G. A. (2015). "A statistical model to predict one-year risk of death in patients with cystic fibrosis". Journal of Clinical Epidemiology. 68 (11): 1336–1345. doi:10.1016/j.jclinepi.2014.12.010. PMID 25655532.
He, X.; Whitmore, G. A.; Loo, G. Y.; Hochberg, M. C.; Lee, M.-L. T. (2015). "A model for time to fracture with a shock stream superimposed on progressive degradation: the Study of Osteoporotic Fractures". Statistics in Medicine. 34 (4): 652–663. doi:10.1002/sim.6356. PMC 4314426 . PMID 25376757.
Hou, W.-H.; Chuang, H.-Y.; Lee, M.-L. T. (2016). "A threshold regression model to predict return to work after traumatic limb injury". Injury. 47 (2): 483–489. doi:10.1016/j.injury.2015.11.032. PMID 26746983.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Redner, S. (2001). A guide to first-passage processes. Cambridge university press.

[2] Bachelier, L. Théorie de la spéculation. Annales scientifiques de l'École Normale Supérieure, Serie 3, Volume 17 (1900), pp. 21-86. doi : 10.24033/asens.476. http://www.numdam.org/articles/10.24033/asens.476/

[3] Von E 1900

[4] Smoluchowski 1915

[5] Lundberg, F. (1903) Approximerad Framställning av Sannolikehetsfunktionen, Återförsäkering av Kollektivrisker, Almqvist & Wiksell, Uppsala.

[6] Tweedie 1945

[7] Tweedie 1957–1

[8] Tweedie 1957–2

[9] Whitmore 1970

[10] Lancaster 1972

[11] "Extended abstract".

[12] "A Quantitative Framework to Assess the Risk-Reward Profile of non Equity Products".

[auto-13] 1 2 Lee 2006

[14] Cox 1972

[15] Lee 2010

[16] Aaron 2010

[17] Chambaz 2014

[18] Aaron 2015

[19] He 2015

[20] Hou 2016

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]