Department of Mathematics - Seminar on Statistics - Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning | School of Science - The Hong Kong University of Science and Technology

18 Jul 2023

4pm - 5pm

Seminar, Lecture, Talk

Science & Technology

In this paper, we propose a robust policy evaluation algorithm in reinforcement learning, to feature outlier contamination and heavy-tailed reward distributions. We further develop a fully-online method to conduct statistical inference for the modeling parameters. Our method converges faster to the minimum asymptotic variance than the classical temporal difference (TD) learning and avoids the selection of the step sizes. Numerical experiments are provided on the effectiveness of the proposed algorithm in real-world reinforcement learning experiments, which highlight the efficiency and robustness of our approach when compared to the existing online bootstrap method. This work is joint with Jiyuan Tu (SUFE), Xi Chen (NYU), and Weidong Liu (SJTU).

18 Jul 2023

4pm - 5pm

Where

Room 2303 (Lifts 17/18)

Speakers/Performers

Prof. Yichen ZHANG
Purdue University

Organizer(S)

Department of Mathematics

Contact/Enquiries

Payment Details

Audience

Alumni, Faculty and staff, PG students, UG students

Language(s)

English

Other Events

24 May 2024

Seminar, Lecture, Talk

IAS / School of Science Joint Lecture - Confinement Controlled Electrochemistry: Nanopore beyond Sequencing

Abstract Nanopore electrochemistry refers to the promising measurement science based on elaborate pore structures, which offers a well-defined geometric confined space to adopt and characterize sin...

13 May 2024

Seminar, Lecture, Talk

IAS / School of Science Joint Lecture – Expanding the Borders of Chemical Reactivity

Abstract The lecture will demonstrate how it has been possible to expand the borders of cycloadditions beyond the “classical types of cycloadditions” applying organocatalytic activation principles....