기계·로봇 연구정보센터

Regret bounds for meta Bayesian optimization with an unknown Gaussian process prior

Beomjoon Kim(MIT)

USA | Proceedings of the Thirty-second Conference on Neu

2018.11.23 | 바로가기

GP-UCB

Cited by 13

■ View full text

Proceedings of the Thirty-second Conference on Neural Information Processing Systems, 2018.11.23

https://arxiv.org/abs/1811.09558

■ Researchers

Beomjoon Kim [MIT], Zi Wang, Beomjoon Kim, Leslie Pack Kaelbling

■ Abstract

Bayesian optimization usually assumes that a Bayesian prior is given. However, the strong theoretical guarantees in Bayesian optimization are often regrettably compromised in practice because of unknown parameters in the prior. In this paper, we adopt a variant of empirical Bayes and show that, by estimating the Gaussian process prior from offline data sampled from the same prior and constructing unbiased estimators of the posterior, variants of both GP-UCB and probability of improvement achieve a near-zero regret bound, which decreases to a constant proportional to the observational noise as the number of offline data and the number of online evaluations increase. Empirically, we have verified our approach on challenging simulated robotic problems featuring task and motion planning.

전체댓글 0

[로그인]

댓글 입력란

0/500자