2026-06-30

Fokker-Planck Equation

The Fokker-Planck equation (FPE) is a partial differential equation that describes the time evolution of the probability density function of a stochastic process defined by a [[Stochastic Differential Equation (SDE)|stochastic differential equation]]. It provides the deterministic macroscopic description of the stochastic microscopic dynamics, bridging the gap between random particle trajectories and their ensemble distribution.

1. Core Concept

1.1 From Stochastic Trajectories to Deterministic Density

A single particle governed by an [[Stochastic Differential Equation (SDE)|SDE]] follows a random path:

d X_{t} = μ (X_{t}, t) d t + σ (X_{t}, t) d W_{t}

While each trajectory is random, the probability density $p (x, t)$ of finding the particle at position $x$ at time $t$ evolves deterministically according to the Fokker-Planck equation:

\frac{\partial p (x, t)}{\partial t} = - \frac{\partial}{\partial x} [μ (x, t) p (x, t)] + \frac{1}{2} \frac{\partial^{2}}{\partial x^{2}} [σ^{2} (x, t) p (x, t)]

[!NOTE] Key Insight
The Fokker-Planck equation transforms a stochastic trajectory problem into a deterministic density evolution problem — much like how the [[Schrödinger Equation]] governs quantum probability amplitudes. It is the master equation for continuous-state, continuous-time [[Markov Process|Markov processes]].

1.2 Physical Interpretation

Term	Expression	Physical Meaning
Drift term	$- \frac{\partial}{\partial x} [μ p]$	Probability flows in the direction of deterministic trend
Diffusion term	$\frac{1}{2} \frac{\partial^{2}}{\partial x^{2}} [σ^{2} p]$	Probability spreads out due to random noise

$μ (x, t)$ = drift coefficient: pushes the distribution’s mean
$σ (x, t)$ = diffusion coefficient: broadens the distribution

2. One-Dimensional Fokker-Planck Equation

2.1 Standard Form

For an [[Stochastic Differential Equation (SDE)|SDE]] with drift $μ (x, t)$ and diffusion $σ (x, t)$ :

\frac{\partial p}{\partial t} = - \frac{\partial}{\partial x} [μ (x, t) p] + \frac{1}{2} \frac{\partial^{2}}{\partial x^{2}} [σ^{2} (x, t) p]

2.2 Probability Current Form

The FPE can be rewritten as a continuity equation (conservation of probability):

\frac{\partial p}{\partial t} + \frac{\partial J}{\partial x} = 0

where the probability current (probability flux) $J (x, t)$ is:

J (x, t) = μ (x, t) p (x, t) - \frac{1}{2} \frac{\partial}{\partial x} [σ^{2} (x, t) p (x, t)]

[!NOTE] Conservation of Probability
Just like the continuity equation in fluid dynamics ( $\frac{\partial ρ}{\partial t} + \nabla \cdot (ρ v) = 0$ ), the FPE ensures that total probability is conserved: $\int_{- \infty}^{\infty} p (x, t) d x = 1$ for all $t$ .

2.3 Derivation from SDE

Step 1 — The infinitesimal generator $L$ of the [[Stochastic Differential Equation (SDE)|SDE]] is:

L f (x) = μ (x, t) \frac{\partial f}{\partial x} + \frac{1}{2} σ^{2} (x, t) \frac{\partial^{2} f}{\partial x^{2}}

Step 2 — Its adjoint operator $L^{*}$ governs the density evolution:

\frac{\partial p}{\partial t} = L^{*} p = - \frac{\partial}{\partial x} [μ p] + \frac{1}{2} \frac{\partial^{2}}{\partial x^{2}} [σ^{2} p]

Step 3 — Boundary conditions ensure probability conservation:

Natural boundary: $J (x, t) \to 0$ as $| x | \to \infty$
Reflecting boundary: $J = 0$ at boundary
Absorbing boundary: $p = 0$ at boundary

3. Key Special Cases

3.1 Pure Diffusion ([[Wiener Process|Wiener Process]] / Brownian Motion)

For $μ = 0$ , $σ = 1$ (standard [[Wiener Process|Wiener Process]]):

\frac{\partial p}{\partial t} = \frac{1}{2} \frac{\partial^{2} p}{\partial x^{2}}

This is the heat equation (or diffusion equation). The solution with initial condition $p (x, 0) = δ (x - x_{0})$ is:

p (x, t) = \frac{1}{\sqrt{2 π t}} \exp (- \frac{(x - x_{0})^{2}}{2 t})

3.2 Ornstein-Uhlenbeck Process

For $μ (x) = - θ x$ (mean-reverting), $σ$ constant:

\frac{\partial p}{\partial t} = \frac{\partial}{\partial x} [θ x p] + \frac{σ^{2}}{2} \frac{\partial^{2} p}{\partial x^{2}}

Stationary solution:

p_{\infty} (x) = \sqrt{\frac{θ}{π σ^{2}}} \exp (- \frac{θ x^{2}}{σ^{2}}) \sim N (0, \frac{σ^{2}}{2 θ})

3.3 Geometric Brownian Motion

For $μ (x) = μ x$ , $σ (x) = σ x$ (used in finance):

\frac{\partial p}{\partial t} = - \frac{\partial}{\partial x} [μ x p] + \frac{σ^{2}}{2} \frac{\partial^{2}}{\partial x^{2}} [x^{2} p]

Solution (log-normal distribution):

p (x, t) = \frac{1}{x σ \sqrt{2 π t}} \exp (- \frac{(\ln x - \ln x_{0} - (μ - σ^{2} / 2) t)^{2}}{2 σ^{2} t})

3.4 Summary Table

Process	Drift $μ (x)$	Diffusion $σ (x)$	FPE Type
Wiener Process	$0$	$1$	Heat equation
OU Process	$- θ x$	$σ$	Mean-reverting
Geometric BM	$μ x$	$σ x$	Log-normal
Constant drift	$μ$	$σ$	Convection-diffusion
VP-[[Stochastic Differential Equation (SDE)\|SDE]]	$- \frac{1}{2} β (t) x$	$\sqrt{β (t)}$	Variance-preserving
VE-[[Stochastic Differential Equation (SDE)\|SDE]]	$0$	$\sqrt{\frac{d σ^{2}}{d t}}$	Variance-exploding

4. Multidimensional Fokker-Planck Equation

4.1 General $d$ -Dimensional Form

For a multivariate SDE:

d X_{t} = μ (X_{t}, t) d t + σ (X_{t}, t) d W_{t}

where $W_{t} \in R^{m}$ is an $m$ -dimensional [[Wiener Process|Wiener process]], the Fokker-Planck equation becomes:

\frac{\partial p}{\partial t} = - \sum_{i = 1}^{d} \frac{\partial}{\partial x_{i}} [μ_{i} p] + \frac{1}{2} \sum_{i = 1}^{d} \sum_{j = 1}^{d} \frac{\partial^{2}}{\partial x_{i} \partial x_{j}} [D_{i j} p]

where the diffusion matrix is $D = σ σ^{⊤} \in R^{d \times d}$ .

4.2 Compact Operator Notation

\frac{\partial p}{\partial t} = - \nabla \cdot (μ p) + \frac{1}{2} \nabla \cdot (\nabla \cdot (D p))

or in terms of the probability current $J$ :

\frac{\partial p}{\partial t} + \nabla \cdot J = 0, J = μ p - \frac{1}{2} \nabla \cdot (D p)

4.3 Isotropic Noise Special Case

When $σ = σ I$ (independent noise in each dimension):

\frac{\partial p}{\partial t} = - \nabla \cdot (μ p) + \frac{σ^{2}}{2} \nabla^{2} p

This is the form most commonly encountered in [[Diffusion Model|diffusion models]].

5. Stationary Distribution

5.1 Equilibrium Condition

For a time-homogeneous [[Stochastic Differential Equation (SDE)|SDE]] ( $μ$ and $σ$ independent of $t$ ), the stationary distribution $p_{\infty} (x)$ satisfies $\frac{\partial p_{\infty}}{\partial t} = 0$ :

0 = - \frac{d}{d x} [μ (x) p_{\infty} (x)] + \frac{1}{2} \frac{d^{2}}{d x^{2}} [σ^{2} (x) p_{\infty} (x)]

5.2 Closed-Form Solution (1D)

Integrating once with zero probability current ( $J = 0$ ) gives:

μ (x) p_{\infty} (x) = \frac{1}{2} \frac{d}{d x} [σ^{2} (x) p_{\infty} (x)]

Solution:

p_{\infty} (x) = \frac{C}{σ^{2} (x)} \exp (2 \int_{x_{0}}^{x} \frac{μ (y)}{σ^{2} (y)} d y)

where $C$ is a normalization constant such that $\int p_{\infty} (x) d x = 1$ .

5.3 Potential Form

If $μ (x) = - \frac{d U (x)}{d x}$ (gradient of a potential $U$ ) and $σ$ is constant:

p_{\infty} (x) = C \exp (- \frac{2 U (x)}{σ^{2}})

This is the Gibbs-Boltzmann distribution with “temperature” $\frac{σ^{2}}{2}$ .

[!NOTE] Physical Analogy
The stationary distribution corresponds to thermal equilibrium in statistical mechanics. The drift $μ (x) = - \nabla U (x)$ drives the system toward lower potential energy, while diffusion ( $σ$ ) adds thermal fluctuations.

6. Forward and Backward Kolmogorov Equations

Aspect	Forward (Fokker-Planck)	Backward (Kolmogorov)
Variable	Future state $x$ (at time $t$ )	Initial state $x_{0}$ (at time $0$ )
Operates on	Probability density $p (x, t ∣ x_{0})$	Expectation $u (x_{0}, t) = E [f (X_{t}) ∣ X_{0} = x_{0}]$
Equation	$\frac{\partial p}{\partial t} = L^{*} p$	$\frac{\partial u}{\partial t} = L u$
Boundary condition	Given initial distribution	Given final payoff $u (x, T) = f (x)$
Application	Density evolution, diffusion models	Option pricing (Feynman-Kac), hitting probabilities

6.1 Backward Kolmogorov Equation

The backward equation governs the evolution of expectations:

\frac{\partial u (x, t)}{\partial t} = μ (x) \frac{\partial u}{\partial x} + \frac{1}{2} σ^{2} (x) \frac{\partial^{2} u}{\partial x^{2}}

with terminal condition $u (x, T) = f (x)$ . Then $u (x_{0}, 0) = E [f (X_{T}) ∣ X_{0} = x_{0}]$ .

6.2 Feynman-Kac Formula

The backward equation extends to include a potential (discount) term:

\frac{\partial u}{\partial t} + μ \frac{\partial u}{\partial x} + \frac{σ^{2}}{2} \frac{\partial^{2} u}{\partial x^{2}} - r (x) u = 0

with solution:

u (x, t) = E [f (X_{T}) \exp (- \int_{t}^{T} r (X_{s}) d s) | X_{t} = x]

7. Connection to Diffusion Models

7.1 Forward Process as Fokker-Planck Evolution

In [[Diffusion Model|diffusion models]], the forward process follows an [[Stochastic Differential Equation (SDE)|SDE]]:

d x_{t} = f (t) x_{t} d t + g (t) d W_{t}

The Fokker-Planck equation describing $p_{t} (x)$ is:

\frac{\partial p_{t} (x)}{\partial t} = - \nabla_{x} \cdot [f (t) x p_{t} (x)] + \frac{1}{2} g (t)^{2} \nabla_{x}^{2} p_{t} (x)

The initial condition is the data distribution $p_{0} (x) = p_{data} (x)$ , and after sufficient time $T$ , $p_{T} (x) \approx N (0, σ_{T}^{2} I)$ .

7.2 VP-SDE and VE-SDE Specific Forms

Variance-Preserving (VP) [[Stochastic Differential Equation (SDE)|SDE]] ( $f (t) = - \frac{1}{2} β (t)$ , $g (t) = \sqrt{β (t)}$ ):

\frac{\partial p_{t}}{\partial t} = \frac{1}{2} β (t) \nabla_{x} \cdot (x p_{t}) + \frac{1}{2} β (t) \nabla_{x}^{2} p_{t}

Variance-Exploding (VE) [[Stochastic Differential Equation (SDE)|SDE]] ( $f (t) = 0$ , $g (t) = \sqrt{\frac{d σ^{2} (t)}{d t}}$ ):

\frac{\partial p_{t}}{\partial t} = \frac{1}{2} \frac{d σ^{2} (t)}{d t} \nabla_{x}^{2} p_{t}

7.3 Connection to [[Probability Flow ODE]]

The Fokker-Planck continuity form:

\frac{\partial p_{t}}{\partial t} = - \nabla_{x} \cdot [v_{t} (x) p_{t} (x)]

directly defines the [[Probability Flow ODE]] velocity field:

v_{t} (x) = f (t) x - \frac{1}{2} g (t)^{2} \nabla_{x} \log p_{t} (x)

This means the [[Probability Flow ODE]] and the forward [[Stochastic Differential Equation (SDE)|SDE]] share the same Fokker-Planck equation — and therefore the same marginal densities $p_{t} (x)$ at all times.

[!NOTE] Key Bridge
The Fokker-Planck equation is the mathematical bridge connecting three equivalent descriptions of diffusion models:

SDE (stochastic trajectories)

Probability Flow ODE (deterministic trajectories)

Score matching (density gradients)

7.4 Score Function Role

In the Fokker-Planck framework, the [[Score Function|score function]] $\nabla_{x} \log p_{t} (x)$ appears naturally as the term that shifts the probability current from pure diffusion toward the data distribution in the reverse process.

8. Numerical Methods

8.1 Finite Difference Method

Discretize the spatial domain and approximate derivatives:

# 1D Fokker-Planck solver (finite difference)
def fokker_planck_fd(mu, sigma, x_grid, t_grid, p0):
    """
    Solve ∂p/∂t = -∂/∂x[μp] + (1/2)∂²/∂x²[σ²p]
    Using Crank-Nicolson (implicit, stable)
    """
    dx = x_grid[1] - x_grid[0]
    dt = t_grid[1] - t_grid[0]
    N = len(x_grid)
    
    p = p0.copy()
    p_history = [p0.copy()]
    
    for n in range(1, len(t_grid)):
        # Build tridiagonal matrix for Crank-Nicolson
        A = build_crank_nicolson_matrix(mu, sigma, dx, dt, N)
        B = build_rhs_matrix(mu, sigma, dx, dt, N)
        
        p = solve_linear_system(A, B @ p)
        p_history.append(p.copy())
    
    return p_history

8.2 Monte Carlo Approach

Instead of solving the PDE, simulate many SDE trajectories and estimate the density:

def fokker_planck_mc(mu, sigma, x0, T, n_paths=10000):
    """Estimate density at time T via Monte Carlo simulation."""
    X_T = np.zeros(n_paths)
    
    for i in range(n_paths):
        X = x0
        t = 0
        dt = 0.001
        while t < T:
            dW = np.sqrt(dt) * np.random.randn()
            X += mu(X, t) * dt + sigma(X, t) * dW
            t += dt
        X_T[i] = X
    
    # Estimate density from samples
    density, bins = np.histogram(X_T, bins=100, density=True)
    return bins, density

8.3 Method Comparison

Method	Accuracy	Speed	Dimension	Best For
Finite Difference	High	Slow	$d \leq 3$	Low-dim, high precision
Finite Element	High	Medium	$d \leq 3$	Complex geometries
Monte Carlo	Low-Medium	Fast	Any $d$	High dimensions
Spectral Methods	Very High	Fast	$d \leq 2$	Smooth, periodic problems
Deep Learning	Medium	Medium	Any $d$	High-dim, irregular domains

9. Mathematical Properties

9.1 Conservation and Positivity

Probability conservation: $\frac{d}{d t} \int p (x, t) d x = 0$
Positivity preservation: If $p (x, 0) \geq 0$ , then $p (x, t) \geq 0$ for all $t > 0$
Smoothing property: FPE instantly smooths any initial distribution ( $t > 0 \Rightarrow p (x, t) \in C^{\infty}$ under ellipticity)

9.2 Connection to [[Itô’s Lemma]]

The Fokker-Planck equation can be derived from [[Itô’s Lemma]] by considering the expected time evolution of a test function $φ (x)$ :

\frac{d}{d t} E [φ (X_{t})] = E [L φ (X_{t})]

where $L$ is the infinitesimal generator. Integration by parts yields the FPE.

9.3 Ergodic Behavior

For an ergodic [[Markov Process|Markov process]]:

Unique stationary distribution $p_{\infty}$ exists
$p (x, t) \to p_{\infty} (x)$ as $t \to \infty$
Convergence rate determined by spectral gap of $L^{*}$

9.4 Entropy Production

The Fokker-Planck equation satisfies an $H$ -theorem: the relative entropy (KL divergence) to the stationary distribution decreases monotonically:

\frac{d}{d t} D_{KL} (p_{t} ∥ p_{\infty}) \leq 0

This implies the system irreversibly approaches equilibrium — a manifestation of the Second Law of Thermodynamics for stochastic systems.

10. Core Formula Cards

[!QUOTE] 1D Fokker-Planck Equation
$\frac{\partial p}{\partial t} = - \frac{\partial}{\partial x} [μ (x, t) p] + \frac{1}{2} \frac{\partial^{2}}{\partial x^{2}} [σ^{2} (x, t) p]$

[!QUOTE] Probability Current Form
$\frac{\partial p}{\partial t} + \frac{\partial J}{\partial x} = 0, J = μ p - \frac{1}{2} \frac{\partial}{\partial x} [σ^{2} p]$

[!QUOTE] Multidimensional FPE
$\frac{\partial p}{\partial t} = - \sum_{i} \frac{\partial}{\partial x_{i}} [μ_{i} p] + \frac{1}{2} \sum_{i, j} \frac{\partial^{2}}{\partial x_{i} \partial x_{j}} [D_{i j} p]$

[!QUOTE] Stationary Distribution (1D, zero flux)
$p_{\infty} (x) = \frac{C}{σ^{2} (x)} \exp (2 \int_{x_{0}}^{x} \frac{μ (y)}{σ^{2} (y)} d y)$

[!QUOTE] Heat Equation ([[Wiener Process|Wiener Process]] / $μ = 0, σ = 1$ )
$\frac{\partial p}{\partial t} = \frac{1}{2} \frac{\partial^{2} p}{\partial x^{2}}$

[!QUOTE] Infinitesimal Generator
$L f (x) = μ (x) \frac{\partial f}{\partial x} + \frac{1}{2} σ^{2} (x) \frac{\partial^{2} f}{\partial x^{2}}$ $\frac{\partial p}{\partial t} = L^{*} p$

[!QUOTE] Backward Kolmogorov Equation
$\frac{\partial u}{\partial t} = μ (x) \frac{\partial u}{\partial x} + \frac{1}{2} σ^{2} (x) \frac{\partial^{2} u}{\partial x^{2}}$

[!QUOTE] FPE in [[Diffusion Model|Diffusion Models]]
$\frac{\partial p_{t}}{\partial t} = - \nabla \cdot [f (t) x p_{t}] + \frac{1}{2} g (t)^{2} \nabla^{2} p_{t}$

11. Summary

Aspect	Description
What it describes	Time evolution of probability density under an SDE
Type	Second-order parabolic PDE
Input	Drift $μ (x, t)$ , diffusion $σ (x, t)$ , initial density $p (x, 0)$
Output	Density $p (x, t)$ at all future times
Key property	Conservation of total probability
Role in diffusion models	Bridges SDE description to Probability Flow ODE
Physical analog	Continuity equation, heat equation, Fick’s law of diffusion
Named after	Adriaan Fokker (1914) and Max Planck (1917)

[[Stochastic Differential Equation (SDE)]]
[[Wiener Process|Wiener Process]]
[[Probability Flow ODE]]
[[Diffusion Model]]
[[Itô’s Lemma]]
[[Itô Integral]]
[[Markov Process]]
[[Martingale]]
[[Score Function]]
[[Kolmogorov Equations]]
[[Neural ODE]]
[[Langevin Dynamics]]
[[Feynman-Kac Formula]]

Dataview Query

1
2
3

LIST
FROM #fokker_planck OR #pde OR #stochastic_process
SORT file.ctime DESC

References

Paper: On the Theory of Brownian Motion (Fokker, 1914)
Paper: Über einen Satz der statistischen Dynamik (Planck, 1917)
Book: The Fokker-Planck Equation: Methods of Solution and Applications — Hannes Risken
Book: Stochastic Processes in Physics and Chemistry — Van Kampen
Book: Stochastic Differential Equations: An Introduction with Applications — Bernt Øksendal
Paper: Score-Based Generative Modeling through SDEs (Song et al., 2021)
Paper: Maximum Likelihood Training of Score-Based Diffusion Models (Song et al., 2021)
Blog: Fokker-Planck Equation and Diffusion Models — AI papers summary
Course: MIT 18.S096 Topics in Mathematics with Applications in Finance
Course: CS236 Deep Generative Models (Stanford)