Retour aux activités
Séminaire informel de théorie des systèmes (ISS)

Thompson sampling in online decision-making


3 déc. 2018   14h00 — 15h00

Yi Ouyang Chercheur, Preferred Networks, Inc., États-Unis

In an online decision-making system, historical data is used to determine the current decision. But at the same time, the results of online decisions are also collected and fed back into the system for future utilization. As a result, the design of efficient online decision-making algorithms should not only optimize according to past information but also aim to generate useful data. The key challenge lies in balancing between exploiting what is known to maximize the immediate outcome and investing to explore new information that may improve future performance. Thompson sampling is a systematic method that balances the exploration-exploitation tradeoff. It has shown strong empirical performance in certain domains and also achieved provable optimal performance in some decision-making problems. In this talk, I will describe how Thompson sampling is used in various applications, and I will also discuss its limitations and potential ways for improvements.

Entrée gratuite.
Bienvenue à tous!

Peter E. Caines responsable
Aditya Mahajan responsable
Dena Firoozi responsable


Salle MC 437
Pavillon McConnell
Université McGill
3480, rue University Montréal QC H3A 0E9 Canada

Axes de recherche

Applications de recherche