Inferring Markov Chains: Bayesian Estimation,
Model Comparison, Entropy Rate, and Out-of-class Modeling

Chris C. Strelioff
Center for Complex Systems Research and Department of Physics
University of Illinois at Urbana-Champaign
Urbana, IL 61801
and
Center for Computational Science and Engineering
University of California at Davis
Davis, CA 95616


James P. Crutchfield
Computational Science and Engineering Center and Physics Department
University of California, Davis
One Shields Ave, Davis CA 95616

Alfred Hubler
Center for Complex Systems Research and Department of Physics
University of Illinois at Urbana-Champaign
Urbana, IL 61801

ABSTRACT: Markov chains are a natural and well understood tool for describing one-dimensional patterns in time or space. We show how to infer k-th order Markov chains, for arbitrary k, from finite data by applying Bayesian methods to both parameter estimation and model-order selection. Extending existing results for multinomial models of discrete data, we connect inference to statistical mechanics through information-theoretic (type theory) techniques. We establish a direct relationship between Bayesian evidence and the partition function which allows for straightforward calculation of the expectation and variance of the conditional relative entropy and the source entropy rate. Finally, we introduce a novel method that uses finite data-size scaling with model-order comparison to infer the structure of out-of-class processes.


C. C. Strelioff, J. P. Crutchfield, and Alfred Hubler, "Inferring Markov Chains: Bayesian Estimation, Model Comparison, Entropy Rate, and Out-of-class Modeling", Physical Review E 76:1 (2007) 011106.
[pdf] 391 kb
Santa Fe Institute Working Paper 07-04-005. arxiv.org e-print math.ST/0703715.