Cahiers : GERAD

Jan 2025

G-2025-07
Asymptotic normality of cumulative cost in linear quadratic regulators

Borna Sayedana, Peter E. Caines, and Aditya Mahajan

The central limit theorem is a fundamental result in probability theory that characterizes the distribution of deviation from the mean in the law of large nu...

BibTeX reference

Nov 2023

G-2023-57
Asymmetric actor-critic with approximate information state

Amit Sinha and Aditya Mahajan

Reinforcement learning (RL) for partially observable Markov decision processes (POMDPs) is a challenging problem because decisions need to be made based on t...

BibTeX reference

Feb 2023

G-2023-05
Strong consistency and rate of convergence of switched least squares system identification for autonomous Markov jump linear systems

Borna Sayedana, Mohammad Afshari, Peter E. Caines, and Aditya Mahajan

In this paper, we investigate the problem of system identification for autonomous Markov jump linear systems (MJS) with complete state observations. We prop...

BibTeX reference

Apr 2021

G-2021-26
Maintenance of a collection of machines under partial observability: Indexability and computation of Whittle index

Nima Akbarzadeh and Aditya Mahajan

We consider the problem of scheduling maintenance for a collection of machines under partial observations when the state of each machine deteriorates stochas...

BibTeX reference

Jun 2020

G-2020-34
Restless bandits: Indexability and computation of Whittle index

Nima Akbarzadeh and Aditya Mahajan

Restless bandits are a class of sequential resource allocation problems concerned with allocating one or more resources among several alternative processes...

BibTeX reference

Mar 2019

G-2019-18
Reinforcement learning in stationary mean-field games

Jayakumar Subramanian and Aditya Mahajan

Multi-agent reinforcement learning has made significant progress in recent years, but it remains a hard problem. Hence, one often resorts to developing lea...

BibTeX reference

Apr 2018

G-2018-28
Renewal Monte Carlo: Renewal theory based reinforcement learning

Jayakumar Subramanian and Aditya Mahajan

In this paper, we present an online reinforcement learning algorithm, called Renewal Monte Carlo (RMC), for infinite horizon Markov decision processes with ...

BibTeX reference

Apr 2017

G-2017-29
Static teams with common information

Mohammad Afshari and Aditya Mahajan

We consider a static team problem in which agents observe correlated Gaussian observations and seek to minimize a quadratic cost. It is assumed that the ob...

BibTeX reference

Jun 2016

G-2016-40
Structural results for two-user interactive communication

Jhelum Chakravorty and Aditya Mahajan

In this paper we consider an interactive communication system with two users, who sequentially observe two correlated sources, and send the quantized observa...

BibTeX reference

Dec 2015

G-2015-132
Privacy-optimal strategies for smart metering systems with a rechargeable battery

Simon Li, Ashish Khisti, and Aditya Mahajan

In smart-metered systems, fine-grained power demand data (load profile) is communicated from a user to the utility provider. The correlation of the load pr...

BibTeX reference

Nov 2015

G-2015-121
Mean field linear quadratic teams

Jalal Arabneydi and Aditya Mahajan

In this paper, we investigate team optimal control of a population of heterogeneous LQ (Linear Quadratic) agents. The population consists of finite distinct...

BibTeX reference

Jul 2015

G-2015-67
On computing optimal thresholds in decentralized sequential hypothesis testing

Can Cui and Aditya Mahajan

Decentralized sequential hypothesis testing refers to a generalization of Wald's sequential hypothesis testing setup in which multiple decision makers make ...

BibTeX reference

May 2015

G-2015-53
Fundamental limits of remote estimation of Markov processes under communication constraints

Jhelum Chakravorty and Aditya Mahajan

The fundamental limits of remote estimation of Markov processes under communication constraints are presented. The remote estimation system consists of a sen...

BibTeX reference

Dec 2014

G-2014-104
Distortion-transmission trade-off in real-time transmission of Markov sources

Jhelum Chakravorty and Aditya Mahajan

The problem of optimal real-time transmission of a Markov source under constraints on the expected number of transmissions is considered, both for the discou...

BibTeX reference

Nov 2014

G-2014-87
Decentralized stochastic control

Aditya Mahajan and Mehnaz Mannan

Decentralized stochastic control refers to the multi-stage optimization of a dynamical system by multiple controllers that have access to different informati...

BibTeX reference

Nov 2014

G-2014-86
Sufficient statistics for linear control strategies in decentralized systems with partial history sharing

Aditya Mahajan and Ashutosh Nayyar

In decentralized control systems with linear dynamics, quadratic cost, and Gaussian disturbance (also called decentralized LQG systems) linear control strate...

BibTeX reference

GERAD

Aditya Mahajan

Cahiers du GERAD

16 results — page 1 of 1