CVSL Logo
FrancaisHome
AboutPeopleResearchPublicationsEventsProfile
About
REPARTI Seminars


The REPARTI Seminars at Université Laval are held on Fridays at 11:30 a.m.
Please see the program for more details.
Dec 14 2017 1:30PM
Seminar
Deep 6-DOF Tracking
Dec 15 2017 11:00AM
Seminar
Building and Evaluating Data-Driven Neural Dialogue Systems

 

 

 

REPARTI

MIVIM

Oct 8 2010 11:30AM

Hamid Chinaei
Laboratoire DAMAS
Département d'informatique et de génie logiciel, U. Laval


Inverse Reinforcement Learning for Dialogue Management



Abstract:

Reinforcement Learning (RL) is a learning and planning method which can be used as a formal optimization framework for minimizing a cost function. In particular, it can be used in dialogue management for learning dialogue plans where we deal with the problem of curse of dimensionality in dialog situations. As a result, also the learned dialogue plans are more detailed compared to the hand-crafted ones.

However, the numerical assignment of the costs in RL framework is itself hand-crafted. As such, Inverse Reinforcement Learning (IRL) tries to first learn the cost function based on expert behaviour. That is, the cost function which the expert designers implicitly tried to optimize, and is inherent in the dialog trajectories. In this presentation, I briefly go through reinforcement learning for dialogue management followed by what Inverse Reinforcement Learning is. Then, I describe two IRL algorithms: the first one uses a complete 'oracle' model of the user and system, and the second is designed to use only logs of users interacting with a deployed system.. Finally, I explain the experiments we did on toy problems followed by discussion at the end.


The CVSL seminars are held on Fridays in room PLT-1120.




     
   
   

©2002-. Computer Vision and Systems Laboratory. All rights reserved