Anytime optimal MDP planning with trial-based heuristic tree search

Thomas Keller
Planning and acting in a dynamic environment is a challenging task for an autonomous agent, especially in the presence of uncertain and exogenous effects, a large number of states, and a long-term planning horizon. In this thesis, we approach the problem by considering algorithms that interleave planning for the current state and execution of the taken decision. The main challenge of the agent is to use its tight deliberation time wisely. One solution are determinizations,...
This data repository is not currently reporting usage information. For information on how your repository can submit usage information, please see our documentation.