Retour aux activités
Séminaire informel de théorie des systèmes (ISS)

Private and Common Information States for Dynamic Programming of POMDPs With Delayed Sharing Patterns

iCalendar

8 mai 2026   10h00 — 11h00

Charalambos D. Charalambous University of Cyprus, Chypre

Charalambos Charalambous

Séminaire hybride à l'Université McGill ou Zoom.

The interest to develop a dynamic programming (DP) approach for multiagent decentralized stochastic optimal control, with delayed sharing information patterns, was initiated in the early 1970's, with the appearance of Witsenhausen's 1971 seminal paper on separation of estimation and control. Most previous studies focused on a single value function (and corresponding DP equation), conditioned on the shared or common information of all controls or agents.

In this talk, I will present a new generalized DP framework based on decentralized team equilibrium called Person-by-Person (PbP) optimality in static team theory. Each agent is assigned an individual value function conditioned on the agent's delayed sharing information pattern, while all other agents' strategies are fixed.

I will introduce several new DP equations which characterize decentralized team equilibrium, with emphasis on the role of private and common information components of each agent's information pattern to reduce complexity and to retain the key fundamental properties of centralized DP equations of partially observable Markov decision problems (POMDPs):

1) the optimization is over the agent's action spaces rather than their strategy spaces,
2) each agent compresses the data into a private information state, and
3) a centralized information state which is common to all agents.

The new DP framework quantifies a conceptual property of optimal strategies to compress their data, initially envisioned by H. Witsenhausen in his paper, "Separation of estimation and control for discrete time systems," in Proceedings of the IEEE, vol. 59, no. 11, pp. 1557-1566, 1971.


Biography: Prof. Charalambos D. Charalambous is a faculty member of ECE department of University of Cyprus. He was an Associate Professor at University of Ottawa, from 1999-2003, and served on the faculty of McGill University, Department of Electrical and Computer Engineering, as a non-tenure faculty member, 1995-1999. He is currently editor at large of the Mathematics of Control, Signals and Systems, while he has served on several editorial boards. Charalambous' research is focused on Stochastic control, estimation, decision, information theory, optimization of stochastic systems subject to ambiguity, decentralized stochastic games of control with asymmetry of information, and their applications to networks.

Peter E. Caines responsable
Aditya Mahajan responsable
Shuang Gao responsable
Borna Sayedana responsable

Lieu

Salle MC 437
CIM
Pavillon McConnell
Université McGill
3480, rue University
Montréal QC H3A 0E9
Canada

Organisme associé

Centre for intelligent machines (CIM)