(주)노바테크 로봇 엔지니어 경력/신입 채용(울산/부산)
MERRIC인
Monte Carlo Tree Search in continuous spaces using Voronoi optimistic optimization with regret bounds
Beomjoon Kim(MIT)
USA | AAAI Technical Track: Planning, Routing, and Sched

■  View full text

Vol. 34 No. 06: AAAI-20 Technical Tracks 6 / AAAI Technical Track: Planning, Routing, and Scheduling

https://ojs.aaai.org/index.php/AAAI/article/view/6546

 

■ Researchers

Beomjoon Kim [MIT], Kyungjae Lee[Seoul National University], Sungbin Lim[Kakao Brain], Leslie Kaelbling[MIT], Tomas Lozano-Perez[MIT]

 

 

■ Abstract

Many important applications, including robotics, data-center management, and process control, require planning action sequences in domains with continuous state and action spaces and discontinuous objective functions. Monte Carlo tree search (MCTS) is an effective strategy for planning in discrete action spaces. We provide a novel MCTS algorithm (voot) for deterministic environments with continuous action spaces, which, in turn, is based on a novel black-box function-optimization algorithm (voo) to efficiently sample actions. The voo algorithm uses Voronoi partitioning to guide sampling, and is particularly efficient in high-dimensional spaces. The voot algorithm has an instance of voo at each node in the tree. We provide regret bounds for both algorithms and demonstrate their empirical effectiveness in several high-dimensional problems including two difficult robotics planning problems.

 

인쇄 Facebook Twitter 스크랩

  전체댓글 0

[로그인]

댓글 입력란
프로필 이미지
0/500자