Markov Decision Processes
Discrete Stochastic Dynamic Programming
M.L. Puterman(Author)
Wiley (Publisher)
Published on 29. April 1994
Book
Hardback
672 pages
978-0-471-61977-2 (ISBN)
Article exhausted; check different version
Description
A Markov chain is a sequence of events where the probability of each event is dependent on the event immediately preceding it, but independent of earlier events. Models and numerical equations are used to describe the patterns. This process is particularly useful in operations research and decision science for plotting the sequence of actions which will cause a system to perform optimally. This study provides a unified treatment of the theory, applications and computational methods for Markov decision processes. Important topics featured include action elimination methods, value iteration in the average reward case and sensitive discount optimality.
More details
Series
Language
English
Place of publication
New York
United States
Publishing group
John Wiley and Sons Ltd
Target group
College/higher education
Professional and scholarly
Dimensions
Height: 242 mm
Width: 162 mm
Weight
1049 gr
ISBN-13
978-0-471-61977-2 (9780471619772)
Copyright in bibliographic data is held by Nielsen Book Services Limited or its licensors: all rights reserved.
Schweitzer Classification
Other editions
Additional editions

E-Book
08/2014
Wiley
€120.99
Available for download

E-Book
11/2009
Wiley
€120.99
Available for download

Book
02/2005
Wiley
€160.50
Shipment within 10-20 days
Content
Model Formulation. Examples. Finite--Horizon Markov Decision Processes. Infinite--Horizon Models: Foundations. Discounted Markov Decision Problems. The Expected Total--Reward Criterion. Average Reward and Related Criteria. The Average Reward Criterion--Multichain and Communicating Models. Sensitive Discount Optimality. Continuous--Time Models. Afterword. Notation. Appendices. Bibliography. Index.