Nearly Maximally Predictive Features and Their Dimensions

Sarah Marzen

Physics of Living Systems Group
Department of Physics
Massachusetts Institute of Technology
Cambridge, MA 02139

and

James P. Crutchfield

Complexity Sciences Center
Physics Department
University of California at Davis
Davis, CA 95616

ABSTRACT: Scientific explanation often requires inferring maximally predictive features from a given data set. Unfortunately, the collection of minimal maximally predictive features for most stochastic processes is uncountably infinite. In such cases, one compromises and instead seeks nearly maximally predictive features. Here, we derive upper-bounds on the rates at which the number and the coding cost of nearly maximally predictive features scales with desired predictive power. The rates are determined by the fractal dimensions of a process' mixed-state distribution. These results, in turn, show how widely-used finite-order Markov models can fail as predictors and that mixed-state predictive features offer a substantial improvement.


Sarah Marzen and James P. Crutchfield, "Nearly Maximally Predictive Features and Their Dimensions", Phys. Rev. E (Rapid Communication) 95:5 (2017) 051301(R). doi:10.1103/PhysRevE.95.051301.
[pdf] 885 KB
Santa Fe Institute Working Paper 17-02-007.
arxiv.org:1702.08565 [cond-mat.stat-mech].