Skip to Content.
Sympa Menu

nl-uiuc - [nl-uiuc] AIIS talk: Satinder Singh ( Dec 3 )

nl-uiuc AT lists.cs.illinois.edu

Subject: Natural language research announcements

List archive

[nl-uiuc] AIIS talk: Satinder Singh ( Dec 3 )


Chronological Thread 
  • From: Yonatan Bisk <bisk1 AT illinois.edu>
  • To: nl-uiuc AT cs.uiuc.edu, aivr AT cs.uiuc.edu, dais AT cs.uiuc.edu, cogcomp AT cs.uiuc.edu, vision AT cs.uiuc.edu, aiis AT cs.uiuc.edu, aistudents AT cs.uiuc.edu, "Girju, Corina R" <girju AT illinois.edu>, Eyal Amir <eyal AT cs.uiuc.edu>
  • Subject: [nl-uiuc] AIIS talk: Satinder Singh ( Dec 3 )
  • Date: Mon, 29 Nov 2010 15:01:54 -0600
  • List-archive: <http://lists.cs.uiuc.edu/pipermail/nl-uiuc>
  • List-id: Natural language research announcements <nl-uiuc.cs.uiuc.edu>

When: Friday December 3 @ 2pm

Where:     3405 SC

Speaker:   Satinder Singh ( http://www.eecs.umich.edu/~baveja/ )

Title:      The Optimal Reward Problem

Abstract:   Impressive results have been obtained by research
approaches to autonomous agents that start with a given reward
function and then focus on developing theory and algorithms for
learning or planning policies that lead to high cumulative reward. In
a departure from this work, we recognize that in many situations the
starting point is an agent designer with a reward function seeking to
build an autonomous agent to act on its behalf.  What reward function
should the designer build into the autonomous agent? In this new view,
setting the parameters (agent's reward function) equal to the given
preferences (designer's reward function) implements a
preferences-parameters confound. If an agent is bounded, as most
agents are in practice, we expect that breaking the
preferences-parameters confound would be beneficial. We define the
optimal reward problem, that of designing the agent's reward function
given a designer's reward function, an agent architecture, and a
distribution over environments. The main focus of the talk will be on
a discussion of some empirical and theoretical insights obtained by
solving the optimal reward problem
** This is joint work with Jonathan Sorg and Richard Lewis at the
University of Michigan.


Bio: Satinder Singh is a Professor of Computer Science &
Engineering at the University of Michigan and is presently also
serving as AI Lab Director. His research interests include
reinforcement learning, computational game theory and mechanism
design, and artificial intelligence.




  • [nl-uiuc] AIIS talk: Satinder Singh ( Dec 3 ), Yonatan Bisk, 11/29/2010

Archive powered by MHonArc 2.6.16.

Top of Page