Industrial & Enterprise Systems Engineering Calendar

View Full Calendar

Optimal estimation for optimal decisions: Instance-dependent theory and efficient algorithms

Event Type

Seminar/Symposium

Sponsor

Industrial and Enterprise Systems Engineering, Dept. Head office

Location

Room B02, Coordinated Science Laboratory

Date

Jan 20, 2023 2:00 - 3:00 pm *Please note: New time change*

Speaker

Wenlong Mou

Contact

BuuLinh Quach

E-Mail

bquach@illinois.edu

Phone

217-265-5220

Views

309

Originating Calendar

ISE Faculty Candidates

*Presentation will be recorded.

Abstract:

Data-driven methodology is a pillar of real-world decision-making. When applying statistical learning methods, puzzling phenomena have been arising in choosing estimators, tuning their parameters, and characterizing bias-variance trade-offs. There are various settings in which asymptotic and/or worst-case theory fails to provide the relevant guidance, so that a more refined approach, both non-asymptotic and instance-optimal, is required.

In this talk, I present some recent advances in optimal procedures for statistical decision-making. The bulk of this talk focuses on function approximation methods for policy evaluation in reinforcement learning. I describe a novel class of optimal and instance-dependent oracle inequalities for projected Bellman equations, as well as efficient Markovian stochastic approximation procedures achieving these guarantees. These instance-dependent results can guide parameter tuning for temporal difference learning. Drawing on this perspective, I then discuss instance-dependent optimal methods for off-policy estimation in contextual bandits, and illustrate how the bias-variance trade-off for decision-making can be substantially different from statistical learning.

Bio:

Wenlong Mou is a Ph.D. student at Department of EECS, UC Berkeley, advised by Martin Wainwright and Peter Bartlett. Prior to Berkeley, he received his B.Sc. degree in Computer Science from Peking University. Wenlong's research interests include statistics, machine learning theory, dynamic programming and optimization, and applied probability. He is particularly interested in designing optimal statistical methods that enable optimal data-driven decision making, powered by efficient computational algorithms.

link for robots only