Markov Process
A Markov Process is a stochastic process that satisfies the Markov Property: the future state depends only on the current state, not on the past history. This “memoryless” property makes Markov processes fundamental in modeling random systems across physics, finance, biology, and machine learning.
1. Core Concept
1.1 Markov Property
Intuitive Understanding:
“Given the present, the future is independent of the past.”
Formal Definition:
A stochastic process
for all
[!NOTE] Key Insight
The Markov property means that the current state contains all the information needed to predict the future. Past history provides no additional predictive power once we know the present.
1.2 State Space and Time
Markov processes are classified by:
State Space:
- Discrete: Finite or countable states (Markov Chain)
- Continuous: Real-valued or vector-valued states
Time Parameter:
- Discrete-time:
- Continuous-time:
1.3 Classification
| Type | Time | State Space | Example |
|---|---|---|---|
| Discrete-time Markov Chain (DTMC) | Discrete | Discrete | Random walk, board games |
| Continuous-time Markov Chain (CTMC) | Continuous | Discrete | Queueing systems, Poisson process |
| Markov Process (general) | Continuous | Continuous | [[Wiener Process |
| Hidden Markov Model (HMM) | Discrete | Discrete (hidden) | Speech recognition, NLP |
2. Discrete-Time Markov Chains (DTMC)
2.1 Definition
A discrete-time Markov chain is characterized by:
- State space:
- Initial distribution:
- Transition matrix:
, where
Properties of transition matrix:
-
for all -
for all (row stochastic)
2.2 Chapman-Kolmogorov Equation
The
Matrix form:
2.3 State Classification
Reachability:
- State
is reachable from state if for some
Communication:
- States
and communicate ( ) if is reachable from and vice versa
Classification:
| Property | Definition | Meaning |
|---|---|---|
| Recurrent | Return to state with probability 1 | Will visit again |
| Transient | Return probability < 1 | May never return |
| Absorbing |
|
Once entered, never leaves |
| Ergodic | Recurrent + aperiodic + positive recurrent | Converges to stationary distribution |
2.4 Stationary Distribution
A distribution
or equivalently:
Existence and Uniqueness Theorem:
For an irreducible, aperiodic, positive recurrent Markov chain:
- Unique stationary distribution
exists -
for all -
, where is return time to state
2.5 Example: Random Walk
Simple symmetric random walk on integers:
Transition probabilities:
Properties:
- If
: Null recurrent (returns with probability 1, but infinite expected return time) - If
: Transient (drifts to )
3. Continuous-Time Markov Chains (CTMC)
3.1 Definition
A continuous-time Markov chain is characterized by:
- State space:
- Generator matrix (Q-matrix):
Properties:
-
for (transition rates) -
(negative diagonal) -
for all
3.2 Transition Probability Matrix
The transition probability matrix
Forward equation:
Backward equation:
Solution:
3.3 Holding Times
Key property: The time spent in state
Memoryless property:
3.4 Embedded Markov Chain
The jump chain (embedded DTMC) has transition matrix:
3.5 Stationary Distribution
For an irreducible CTMC, stationary distribution
or:
4. General Markov Processes
4.1 Transition Kernel
For continuous state space, the transition probability is described by a kernel:
where
4.2 Chapman-Kolmogorov Equation (General)
4.3 Feller Property
A Markov process is Feller if its transition semigroup maps continuous functions to continuous functions. This ensures:
- Well-behaved sample paths
- Existence of generator
- Connection to PDEs
4.4 Infinitesimal Generator
The generator
Examples:
-
[[Wiener Process|Wiener Process]]:
-
[[Stochastic Differential Equation (SDE)|SDE]]
:
5. Connection to Other Concepts
5.1 Markov Process and [[Wiener Process|Wiener Process]]
The [[Wiener Process|Wiener Process]] is a Markov process with:
- Continuous state space:
- Continuous time:
- Transition density:
Markov property:
5.2 Markov Process and [[Stochastic Differential Equation (SDE)|SDE]]
Solutions to SDEs are Markov processes:
Conditions for Markov property:
- Coefficients
depend only on current state (not history) - [[Wiener Process|Wiener Process]] increments are independent
5.3 Markov Process and [[Martingale]]
Relationship:
- Not all Markov processes are [[Martingale|martingales]]
- Not all martingales are Markov processes
Example: [[Wiener Process|Wiener Process]] is both Markov and [[Martingale|martingale]].
[[Martingale]] condition for Markov process:
A Markov process
5.4 Markov Process and [[Diffusion Model|Diffusion Models]]
Forward process in diffusion models:
This is a Markov chain! The full forward process:
Key implications:
- Efficient sampling: only need current state
- Tractable posterior:
simplifies - Training objective decomposes over time steps
6. Important Theorems
6.1 Strong Markov Property
Definition: A process has the strong Markov property if the Markov property holds at stopping times
Theorem: [[Wiener Process|Wiener Process]] and solutions to SDEs satisfy the strong Markov property.
6.2 Ergodic Theorem
For an ergodic Markov chain with stationary distribution
Interpretation: Time average converges to ensemble average.
6.3 Convergence Rate
For a finite, irreducible, aperiodic Markov chain:
where:
-
is the second-largest eigenvalue modulus -
is a constant -
is total variation distance
Mixing time:
6.4 Reversibility
A Markov chain is reversible if it satisfies detailed balance:
Implications:
- Easier to compute stationary distribution
- Connection to physics (detailed balance in thermodynamics)
- Used in MCMC algorithms ([[Metropolis-Hastings]])
7. Applications
7.1 [[Diffusion Model|Diffusion Models]]
Forward process (Markov chain):
Key properties:
- Each step depends only on previous step
- Enables tractable computation of
- Reverse process also Markov (approximately)
7.2 [[Martingale|Martingale]] Theory
Markov martingales:
- [[Wiener Process|Wiener Process]]
- Geometric [[Wiener Process|Brownian motion]]
- Solutions to certain SDEs
Applications:
- Financial modeling (stock prices)
- Optimal stopping problems
- Stochastic control
7.3 Queueing Theory
M/M/1 queue:
- Arrivals: Poisson process (rate
) - Service: Exponential (rate
) - State: Number of customers in system
Transition rates:
-
(arrival) -
(departure)
Stationary distribution (if
7.4 Population Dynamics
Birth-death process:
- State: Population size
- Transitions: Birth (+1) or death (-1)
Applications:
- Ecology (species population)
- Epidemiology (disease spread)
- Genetics (allele frequencies)
7.5 Reinforcement Learning
Markov Decision Process (MDP):
- State space
- Action space
- Transition:
- Reward:
Bellman equation:
7.6 PageRank Algorithm
Web surfing as Markov chain:
- States: Web pages
- Transitions: Following links
- Stationary distribution: PageRank scores
Transition matrix:
where
8. Simulation and Sampling
8.1 DTMC Simulation
1 | def simulate_markov_chain(P, x0, n_steps): |
8.2 CTMC Simulation (Gillespie Algorithm)
1 | def simulate_ctmc(Q, x0, T_max): |
8.3 MCMC Sampling
[[Metropolis-Hastings]] algorithm:
1 | def metropolis_hastings(target_proposal, x0, n_samples): |
9. Advanced Topics
9.1 Hidden Markov Models (HMM)
Structure:
- Hidden states:
(Markov chain) - Observations:
(depends on )
Components:
- Transition matrix:
- Emission probabilities:
- Initial distribution:
Algorithms:
- Forward algorithm: Compute
- Viterbi algorithm: Find most likely state sequence
- Baum-Welch algorithm: Learn parameters (EM)
9.2 Markov Random Fields (MRF)
Definition: Undirected graphical model with Markov property:
where
Applications:
- Image segmentation
- Spatial statistics
- Physics (Ising model)
9.3 Jump-Diffusion Processes
Combine continuous diffusion with discrete jumps:
where
Applications:
- Financial modeling (asset prices with sudden crashes)
- Neuroscience (neuron spiking)
- Queueing systems
9.4 Feynman-Kac Formula
Connects Markov processes to PDEs:
solves the PDE:
where
10. Core Formula Cards
[!QUOTE] Markov Property
[!QUOTE] Chapman-Kolmogorov Equation (DTMC)
[!QUOTE] Stationary Distribution (DTMC)
[!QUOTE] Kolmogorov Forward Equation (CTMC)
[!QUOTE] Stationary Distribution (CTMC)
[!QUOTE] Infinitesimal Generator
[!QUOTE] Detailed Balance (Reversibility)
Related Concepts
- [[Stochastic Process]]
- [[Wiener Process|Wiener Process]]
- [[Stochastic Differential Equation (SDE)]]
- [[Martingale]]
- [[Poisson Process]]
- [[Diffusion Model]]
- [[Kolmogorov Equations]]
- [[Langevin Dynamics]]
- [[Metropolis-Hastings]]
- [[Hidden Markov Model]]
- [[Markov Decision Process]]
Dataview Query
1 | LIST |
References
- Book: Introduction to Probability Models - Sheldon Ross
- Book: Markov Chains - J.R. Norris
- Book: Stochastic Processes - Sheldon Ross
- Course: STAT 260 High-Dimensional Statistics (Berkeley)
- Course: CS236 Deep Generative Models (Stanford)
- Wikipedia: Markov chain, Markov property, Chapman-Kolmogorov equations