Kolmogorov Equations
The Kolmogorov equations are a family of fundamental equations that govern the time evolution of transition probabilities in [[Markov Process|Markov processes]]. They form the mathematical backbone connecting discrete-state jump processes, continuous-state diffusions, and the PDE descriptions of stochastic dynamics — including the [[Fokker-Planck Equation|Fokker-Planck equation]] and the backward equation used in option pricing and hitting-time problems.
1. Core Concept
1.1 The Kolmogorov Triplet
Kolmogorov’s legacy in stochastic processes crystallizes into three interlocking equations:
1 | Kolmogorov Equations — Three Pillars |
| Equation | Domain | What It Describes | Key Application |
|---|---|---|---|
| Chapman-Kolmogorov | Discrete + Continuous time | Semigroup property of transitions | Foundation of all Markov models |
| Forward (Fokker-Planck) | Continuous time | Evolution of probability density
|
Diffusion models, physics, population dynamics |
| Backward | Continuous time | Evolution of conditional expectations
|
Option pricing, hitting times, Feynman-Kac |
1.2 Why They Matter Together
The three equations are not independent — they are different facets of the same underlying semigroup structure:
- Chapman-Kolmogorov is the algebraic identity — it asserts that transitions compose
- Forward equation is the differential identity — it describes the future of the density
- Backward equation is the adjoint identity — it describes the past-dependence of expectations
In modern [[Diffusion Model|diffusion models]], all three appear:
- Chapman-Kolmogorov: the Markov chain of the forward noising process
- Forward (Fokker-Planck): the evolution of
along the forward SDE - Backward: the foundation for score matching via denoising
2. Chapman-Kolmogorov Equation
2.1 Discrete-Time Markov Chains (DTMC)
For a discrete-time [[Markov Process|Markov chain]] with transition matrix
The Chapman-Kolmogorov equation states that multi-step transitions compose via matrix multiplication:
Or in matrix form:
Interpretation: To go from
2.2 Continuous-Time Markov Chains (CTMC)
For a CTMC with generator matrix
This is the semigroup property: the transition operator
2.3 General State Space
For a Markov process on a general (possibly continuous) state space with transition kernel
This is the most general form: to transition from
2.4 Probabilistic Interpretation
The Chapman-Kolmogorov equation is more than a formula — it’s a consistency condition:
If you know the 1-step transition probabilities, you know everything about the process.
1 | Time: 0 ────── s ────── s+t |
Every path from
3. Kolmogorov Forward Equation
3.1 CTMC Form (Master Equation)
Starting from
In component form:
Interpretation: The rate of change of
3.2 Diffusion Form (Fokker-Planck Equation)
For a [[Stochastic Differential Equation (SDE)|diffusion process]]
Compact operator notation:
where
[!NOTE] Unified View
Both the CTMC master equation and the Fokker-Planck equation are Kolmogorov forward equations — they differ only in the state space (discrete vs continuous) and the form of the generator.
3.3 Forward vs Fokker-Planck Terminology
| Context | Equation Name | Generator |
|---|---|---|
| Discrete state (CTMC) | Kolmogorov Forward / Master Equation |
|
| Continuous state (diffusion) | Fokker-Planck Equation / Forward Kolmogorov |
|
| General Markov process | Kolmogorov Forward Equation |
|
3.4 Role in Diffusion Models
In [[Diffusion Model|diffusion models]], the forward noising process is a Markov diffusion. Its density
This equation determines how the data distribution
4. Kolmogorov Backward Equation
4.1 CTMC Form
Differentiating
In component form:
Key difference from forward: In the forward equation, the sum is over the second index of
4.2 Diffusion Form
For a diffusion, the backward equation governs the conditional expectation:
with terminal condition
Compact operator form:
where
4.3 Forward vs Backward — Side-by-Side
| Aspect | Forward (Fokker-Planck) | Backward |
|---|---|---|
| Variable |
|
|
| Unknown | Density
|
Expectation
|
| Operator |
|
|
| Initial/Boundary |
|
|
| Direction | Forward in time | Backward in time |
| Lineage | From
|
From expectation at
|
| CTMC Form |
|
|
[!WARNING] The Adjoint Distinction
The forward equation uses(adjoint), the backward uses . For self-adjoint generators (e.g., pure Brownian motion ), forward and backward equations coincide — but this is the exception, not the rule.
4.4 Feynman-Kac Extension
The backward equation generalizes to include a potential (discount/killing) term
This has the stochastic representation:
Applications: Option pricing (Black-Scholes), exit problems, reaction-diffusion systems.
5. The Generator and Semigroup Framework
5.1 Transition Semigroup
The transition operators
Properties:
- Identity:
- Semigroup:
(Chapman-Kolmogorov) - Continuity:
(strong continuity)
5.2 Infinitesimal Generator
The generator
This single operator encodes ALL information about the process dynamics.
| Process | Generator
|
|---|---|
| [[Wiener Process|Wiener Process]] |
|
| General diffusion |
|
| CTMC |
|
| Jump-diffusion |
|
5.3 The Unified Kolmogorov Equations
From the semigroup property, BOTH Kolmogorov equations follow:
Forward:
→ Acting on the density:
Backward:
→ Acting on the test function:
1 | Chapman-Kolmogorov |
6. Connection to Diffusion Models
6.1 The Full Kolmogorov Picture
In [[Diffusion Model|diffusion models]], the Kolmogorov equations provide the complete mathematical scaffolding:
| Component | Kolmogorov Equation | Role |
|---|---|---|
| Forward noising process | Chapman-Kolmogorov |
|
| Marginal density evolution | Forward (Fokker-Planck) |
|
| Score matching | Backward | Links
|
| Probability Flow ODE | Forward (continuity form) | Same marginals as SDE |
| Reverse-time SDE | Backward (time-reversed) |
|
6.2 From Chapman-Kolmogorov to DDPM
The DDPM forward process is defined by discrete Markov transitions:
The Chapman-Kolmogorov equation allows compressing multiple steps:
Due to the Gaussian structure and Chapman-Kolmogorov, this simplifies to a single Gaussian:
This is the Chapman-Kolmogorov equation in action: multi-step transitions reduce to a single closed-form expression, making efficient training possible.
6.3 Kolmogorov Equations in Score-Based Models
In score-based generative models ([[Score Function|Score SDE]] framework):
- Forward equation describes how
spreads from data → noise - Backward equation describes the reverse-time dynamics for sampling
- The score function
appears in the backward equation as the drift correction term
The equivalence between SDE sampling and [[Probability Flow ODE]] sampling follows from the fact that both share the same Kolmogorov forward equation — they produce identical marginal distributions at all times.
7. Mathematical Properties
7.1 Uniqueness
Under standard regularity conditions (Lipschitz drift, bounded diffusion, non-degenerate noise):
- The forward equation has a unique solution for a given initial density
- The backward equation has a unique solution for a given terminal condition
- Both solutions are
(continuously differentiable in , twice in )
7.2 Positivity and Conservation
Both the forward and backward equations preserve fundamental properties:
- Forward:
for all (conservation of probability) - Forward:
(positivity preservation) - Backward: Maximum principle —
is bounded by its terminal values
7.3 Self-Adjoint Case
When
- [[Wiener Process|Wiener Process]]:
(the Laplacian is self-adjoint) - Gradient diffusions with symmetric potential:
In general diffusions with non-zero drift, the generator is not self-adjoint — forward and backward equations are genuinely different.
7.4 Spectral Interpretation
The forward and backward equations share the same spectrum (eigenvalues of
- Forward eigenfunctions = left eigenvectors of
- Backward eigenfunctions = right eigenvectors of
The spectral gap
8. Core Formula Cards
[!QUOTE] Chapman-Kolmogorov (General)
[!QUOTE] Chapman-Kolmogorov (DTMC)
[!QUOTE] Kolmogorov Forward (CTMC)
[!QUOTE] Kolmogorov Backward (CTMC)
[!QUOTE] Kolmogorov Forward (Diffusion / Fokker-Planck)
[!QUOTE] Kolmogorov Backward (Diffusion)
[!QUOTE] Infinitesimal Generator
[!QUOTE] Semigroup Property
9. Summary
| Aspect | Description |
|---|---|
| What they describe | Time evolution of transition probabilities in Markov processes |
| Chapman-Kolmogorov | Algebraic consistency: transitions compose via semigroup property |
| Forward (Fokker-Planck) | How probability density flows forward in time |
| Backward | How conditional expectations evolve backward from terminal conditions |
| Unifying framework | All three derive from the semigroup property of Markov transitions |
| Key distinction | Forward uses adjoint generator
|
| Role in diffusion models | Forward = density evolution; Backward = reverse process / score matching foundation |
| Named after | Andrey Kolmogorov (1931) — who also axiomatized probability theory |
Kolmogorov’s equations are the mathematical thread that connects the algebraic (Chapman-Kolmogorov), probabilistic (forward density), and analytic (backward expectations) descriptions of stochastic dynamics — a unification that remains at the heart of modern generative modeling.
Dataview Query
1 | LIST |
Related Concepts
- [[Markov Process]]
- [[Fokker-Planck Equation]]
- [[Stochastic Differential Equation (SDE)]]
- [[Wiener Process|Wiener Process]]
- [[Diffusion Model]]
- [[Score Function]]
- [[Probability Flow ODE]]
- [[Martingale]]
- [[Langevin Dynamics]]
- [[Feynman-Kac Formula]]
References
- Paper: Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung (Kolmogorov, 1931 — foundational paper)
- Book: Continuous Martingales and Brownian Motion (Revuz & Yor, Chapter III: Markov Processes)
- Book: Stochastic Differential Equations (Øksendal, Chapter 8: Diffusions and Kolmogorov Equations)
- Book: Markov Processes: Characterization and Convergence (Ethier & Kurtz)
- Book: Probability Theory and Stochastic Processes (Grimmett & Stirzaker)
- Book: Diffusion Models: A Comprehensive Guide (Yang Song, Chapter on Score SDE)
- Wikipedia: Chapman-Kolmogorov equation, Fokker-Planck equation, C₀-semigroup, Infinitesimal generator
"