Recent Advances in Reinforcement Learning Workshop 2015

March 23-28, 2015


Explaining intelligent behavior in biological organisms has been one of holy grails of artificial intelligence (AI) research. Reinforcement learning (RL) started out as a model of learning in biological systems and today has grown to be one of the important paradigms of intelligent system design drawing ideas from varied fields such as neuroscience, psychology, control theory, operations research and AI. In turn reinforcement learning has had significant impact in many domains – it is the most popular model of learning in computational neuroscience and is used to explain different phenomena observed in the brain; reinforcement learning methods have been used to build AI agents in domains which were traditionally regarded as very hard, such as the game of Go; RL has changed the traditional approach to adaptive optimal control theory by introducing newer ways of modeling system dynamics; and in robotics, RL is the primary learning paradigm used for training autonomous agents. From the early beginnings as a theory of behavioral psychology, over three decades RL has grown into a mathematically sophisticated field with rigorous underpinnings drawn from different disciplines. This workshop will introduce the participants to the basic concepts of reinforcement learning as well as more recent exciting results in the field from the leaders in the community.

Venue : ICSR Building, Hall-2

RARL Workshop 2015

Indian Institute of Technology, Madras, India
March 23-28, 2015

Schedule

Monday, March 23
09:00 - 10:30 Richard Sutton
University of Alberta
The Grand Ambition of Reinforcement Learning and Artificial Intelligence
Video
10:30 - 11:00 Coffee Break
11:00 - 12:30 Csaba Szepesvári
University of Alberta
Online Learning
Video
12:30 - 14:00 Lunch
14:00 - 15:30 Sridhar Mahadevan
University of Massachusetts Amherst
Representation Discovery in Reinforcement Learning: A 60 year adventure from IBM's Arthur Samuel to Google's Deep Mind
15:30 - 16:00 Coffee Break
16:00 - 17:30 Vivek Borkar
Indian Institute of Technology Bombay
Q-learning without Stochastic Approximation
slides Video
Tuesday, March 24
09:00 - 10:30 Richard Sutton
University of Alberta
New Algorithms for Off-policy Temporal-difference Learning                                                                                                                  
Video
10:30 - 11:00 Coffee Break
11:00 - 12:30 Csaba Szepesvári
University of Alberta
Online Learning
Video
12:30 - 14:00 Lunch
14:00 - 15:30 Sridhar Mahadevan
University of Massachusetts Amherst
Mirror Mirror on the Wall: Which is the ``Best" Reinforcement Learning Algorithm of them All?
Video
15:30 - 16:00 Coffee Break
16:00 - 17:30 Shivaram Kalyanakrishnan
Indian Institute of Technology Bombay
An Improved Bound for MDP Planning
Wednesday, March 25
09:00 - 10:30 Richard Sutton
University of Alberta
Learning to Predict with Independent-of-span Computational Complexity                                                                                                                  
Video
10:30 - 11:00 Coffee Break
11:00 - 12:30 Csaba Szepesvári
University of Alberta
Online Learning
Video
12:30 - 14:00 Lunch
14:00 - 15:30 Sridhar Mahadevan
University of Massachusetts Amherst
From Optimization to Equilibration: Artificial Intelligence in the 21st Century
Video
15:30 - 16:00 Coffee Break
16:00 - 17:30 Mausam
Indian Institute of Technology Delhi
Goal-Directed MDPs : Models and Algorithms
slides Video
Thursday, March 26
09:00 - 10:30 Satinder Singh
University of Michigan Ann Arbor
Spectral Learning of Predictive State Representations                                                                                                                                                                                                                                   
Video
10:30 - 11:00 Coffee Break
11:00 - 12:30 Shivaram Kalyanakrishnan
Indian Institute of Technology Bombay
Learning with Imperfect Representations
12:30 - 14:00 Lunch
14:00 - 15:30 Aditya Gopalan
Indian Institute of Science Bangalore
Complex Bandits
Video
15:30 - 16:00 Coffee Break
16:00 - 17:30 Satinder Singh
University of Michigan Ann Arbor
Some applications of RL
Video
Friday, March 27
09:00 - 10:30 Satinder Singh
University of Michigan Ann Arbor
Optimal Rewards                                                                                                                  
Video
10:30 - 11:00 Coffee Break
11:00 - 12:30 Shalabh Bhatnagar
Indian Institute of Science Bangalore
Feature Updation Schemes for Reinforcement Learning
Video
12:30 - 14:00 Lunch
14:00 - 15:30 Aditya Gopalan
Indian Institute of Science Bangalore
Online Learning in Parameterized Markov Decision Processes
Video
15:30 - 16:00 Coffee Break
16:00 - 17:30 Prashanth L.A
On the convergence rate of TD(0) with function approximation: Non-asymptotic bounds in online and batch settings
slides Video
Saturday, March 28
09:00 - 10:30 Balaraman Ravindran
Indian Institute of Technology Madras
Exploiting Spatial Structure in Reinforcement Learning                                                                                                                                                                                                                                   
Video
10:30 - 11:00 Coffee Break
11:00 - 12:30 Shalabh Bhatnagar
Indian Institute of Science Bangalore
The Multi-timescale Q-learning Algorithm
Video
12:30 - 14:00 Lunch

Confirmed Speakers

Vivek Borkar

Sridhar Mahadevan

Satinder Singh

Richard Sutton

Csaba Szepesvari

Shalabh Bhatnagar

Aditya Gopalan

Shivaram Kalyanakrishnan

Balaraman Ravindran

Organizers

Shalabh Bhatnagar

Indian Institute of Science

Balaraman Ravindran

Indian Institute of Technology, Madras

Partners

Location

  • Indian Institute Of Technology, Madras, India