Generalized maximum entropy estimation

Tobias Sutter; David Sutter; Peyman Mohajerin Esfahani; John Lygeros

Generalized maximum entropy estimation

Tobias Sutter, David Sutter, Peyman Mohajerin Esfahani, John Lygeros

Team Bart De Schutter

Research output: Contribution to journal › Article › Scientific › peer-review

5 Citations (Scopus)

125 Downloads (Pure)

Abstract

We consider the problem of estimating a probability distribution that maximizes the entropy while satisfying a finite number of moment constraints, possibly corrupted by noise. Based on duality of convex programming, we present a novel approximation scheme using a smoothed fast gradient method that is equipped with explicit bounds on the approximation error. We further demonstrate how the presented scheme can be used for approximating the chemical master equation through the zero-information moment closure method, and for an approximate dynamic programming approach in the context of constrained Markov decision processes with uncountable state and action spaces.

Original language	English
Number of pages	29
Journal	Journal of Machine Learning Research
Volume	20
Issue number	138
Publication status	Published - 2019

Keywords

Approximate dynamic programming
Convex optimization
Entropy maximization
Fast gradient method
Relative entropy minimization

Access to Document

17-486Final published version, 561 KBLicence: CC BY

Cite this

@article{f98169ee93fd4a6c881d0a2fb14aea36,

title = "Generalized maximum entropy estimation",

abstract = "We consider the problem of estimating a probability distribution that maximizes the entropy while satisfying a finite number of moment constraints, possibly corrupted by noise. Based on duality of convex programming, we present a novel approximation scheme using a smoothed fast gradient method that is equipped with explicit bounds on the approximation error. We further demonstrate how the presented scheme can be used for approximating the chemical master equation through the zero-information moment closure method, and for an approximate dynamic programming approach in the context of constrained Markov decision processes with uncountable state and action spaces.",

keywords = "Approximate dynamic programming, Convex optimization, Entropy maximization, Fast gradient method, Relative entropy minimization",

author = "Tobias Sutter and David Sutter and Esfahani, {Peyman Mohajerin} and John Lygeros",

year = "2019",

language = "English",

volume = "20",

journal = "Journal of Machine Learning Research",

issn = "1532-4435",

publisher = "Microtome Publishing",

number = "138",

}

TY - JOUR

T1 - Generalized maximum entropy estimation

AU - Sutter, Tobias

AU - Sutter, David

AU - Esfahani, Peyman Mohajerin

AU - Lygeros, John

PY - 2019

Y1 - 2019

N2 - We consider the problem of estimating a probability distribution that maximizes the entropy while satisfying a finite number of moment constraints, possibly corrupted by noise. Based on duality of convex programming, we present a novel approximation scheme using a smoothed fast gradient method that is equipped with explicit bounds on the approximation error. We further demonstrate how the presented scheme can be used for approximating the chemical master equation through the zero-information moment closure method, and for an approximate dynamic programming approach in the context of constrained Markov decision processes with uncountable state and action spaces.

AB - We consider the problem of estimating a probability distribution that maximizes the entropy while satisfying a finite number of moment constraints, possibly corrupted by noise. Based on duality of convex programming, we present a novel approximation scheme using a smoothed fast gradient method that is equipped with explicit bounds on the approximation error. We further demonstrate how the presented scheme can be used for approximating the chemical master equation through the zero-information moment closure method, and for an approximate dynamic programming approach in the context of constrained Markov decision processes with uncountable state and action spaces.

KW - Approximate dynamic programming

KW - Convex optimization

KW - Entropy maximization

KW - Fast gradient method

KW - Relative entropy minimization

UR - http://www.scopus.com/inward/record.url?scp=85077514838&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:85077514838

SN - 1532-4435

VL - 20

JO - Journal of Machine Learning Research

JF - Journal of Machine Learning Research

IS - 138

ER -

Generalized maximum entropy estimation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this