Operations & Supply Chain Management

See the latest research, articles and faculty on the Operations & Supply Chain Management Area of Expertise at Columbia Business School.

Latest on Operations & Supply Chain Management

No articles have been found by those filters.

Operations & Supply Chain Management Faculty

CBS Faculty Research on Operations & Supply Chain Management

Discounted and undiscounted value-iteration in Markov decision processes: A survey

Authors: Awi Federgruen and Paul Schweitzer

Date: January 1, 1979

Format: Chapter

Book: Dynamic Programming and its Applications

A survey is given of the present state of the art of value-iteration and related successive approximation methods, as well as of resulting turnpike properties, in both the discounted and undiscounted version of finite state and action Markov Decision Problems.

Authors: Awi Federgruen, A. Hordijk, and H. C. Tijms

Date: December 1, 1978

Format: Journal Article

Journal: Journal of Applied Probability

In this paper we consider a set of denumerable stochastic matrices where the paramter set is a compact metric space. We give a number of simultaneous recurrence conditions on the stochastic matrices and establish equivalences between these conditions. The results obtained generalize corresponding results in Markov chain theory to a considerable extent and have applications in stochastic control problems.

Authors: Paul Schweitzer and Awi Federgruen

Date: November 1, 1978

Format: Journal Article

Journal: Mathematics of Operations Research

This paper investigates the solutions to the functional equations that arise inter alia in Undiscounted Markov Renewal Programming. We show that the solution set is a connected, though possibly nonconvex set whose members are unique up to the n* constants, characterize n* and show that some of these n* degrees of freedom are locally rather than globally independent.

Authors: Awi Federgruen, Paul Schweitzer, and H. C. Tijms

Date: October 1, 1978

Format: Journal Article

Journal: Journal of Mathematical Analysis and Applications

This paper is concerned with the properties of the value-iteration operator which arises in undiscounted Markov decision problems. We give both necessary and sufficient conditions for this operator to reduce to a contraction operator, in which case it is easy to show that the value-iteration method exhibits a uniform geometric convergence rate.

Authors: Paul Schweitzer and Awi Federgruen

Date: June 15, 1978

Format: Journal Article

Journal: Journal of Mathematical Analysis and Applications

An example for undiscounted multichain Markov Renewal Programming shows that policies may exist such that the Policy Iteration Algorithm (PIA) can converge to these policies for some (but not all) choices of the additive constants in the relative values, and as a consequence that the PIA may cycle if the relative values are improperly determined.

Authors: Awi Federgruen and H. C. Tijms

Date: June 1, 1978

Format: Journal Article

Journal: Journal of Applied Probability

This paper is concerned with the optimality equation for the average costs in a denumerable state semi-Markov decision model. It will be shown that under each of a number of recurrency conditions on the transition probability matrices associated with the stationary policies, the optimality equation has a bounded solution. This solution indeed yields a stationary policy which is optimal for a strong version of the average cost optimality criterion.

Authors: Awi Federgruen

Date: January 1, 1978

Format: Journal Article

Journal: Advances in Applied Probability

This paper considers non-cooperative N-person stochastic games with a countable state space and compact metric action spaces. We concentrate upon the average return per unit time criterion for which the existence of an equilibrium policy is established under a number of recurrency conditions with respect to the transition probability matrices associated with the stationary policies.

Authors: Awi Federgruen, A. Hordijk, and H. C. Tijms

Date: January 1, 1978

Format: Chapter

Book: Dynamic Programming and Its Applications

This paper considers an undiscounted semi-Markov decision problem with denumerable state space and compact metric action spaces. Recurrence conditions on the transition probability matrices associated with the stationary policies are considered and relations between these conditions are established. Also it is shown that under each of these conditions the optimality equation for the average costs has a bounded solution.

Authors: Paul Schweitzer and Awi Federgruen

Date: November 1, 1977

Format: Journal Article

Journal: Mathematics of Operations Research

This paper considers undiscounted Markov Decision Problems. For the general multichain case, we obtain necessary and sufficient conditions which guarantee that the maximal total expected reward for a planning horizon of n epochs minus n times the long run average expected reward has a finite limit as n approaches infinity for each initial state and each final reward vector. In addition, we obtain a characterization of the chain and periodicity structure of the set of one-step and J-step maximal gain policies.

Operations & Supply Chain Management

Latest on Operations & Supply Chain Management

Operations & Supply Chain Management Faculty

CBS Faculty Research on Operations & Supply Chain Management

Discounted and undiscounted value-iteration in Markov decision processes: A survey

A note on simultaneous recurrence conditions on a set of denumerable stochastic matrices

The functional equations of undiscounted Markov renewal programming

Contraction mappings underlying undiscounted Markov decision problems

Foolproof convergence in multichain policy iteration

The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms

On N person stochastic games with denumerable state space

Recurrence Conditions in Denumerable State Markov Decision Processes

The asymptotic behavior of undiscounted value iteration in Markov decision problems

Latest on Operations & Supply Chain Management

Operations & Supply Chain Management Faculty

CBS Faculty Research on Operations & Supply Chain Management

Discounted and undiscounted value-iteration in Markov decision processes: A survey

A note on simultaneous recurrence conditions on a set of denumerable stochastic matrices

The functional equations of undiscounted Markov renewal programming

Contraction mappings underlying undiscounted Markov decision problems

Foolproof convergence in multichain policy iteration

The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms

On N person stochastic games with denumerable state space

Recurrence Conditions in Denumerable State Markov Decision Processes

The asymptotic behavior of undiscounted value iteration in Markov decision problems

External CSS

Homepage Breadcrumb Block