Model-SelectionforNon-ParametricFunction Approximation in Continuous ControlProblems: A Case Study in aSmartEnergy System

阅读量:

32

作者:

PS Daniel Urieliand

展开

摘要:

This paper investigates the application of value-function-based reinforcement learning to a smart energy control system, specifically the task of controlling an HVAC system to minimize energy while satisfying residents' comfort requirements. In theory, value-function-based reinforcement learning methods can solve control problems such as this one optimally. However, since choosing an appropriate parametric representation of the value function turns out to be difficult, we develop an alternative method, which results in a practical algorithm for value function approximation in continuous state-spaces. To avoid the need to carefully design a parametric representation for the value function, we use a smooth non-parametric function approximator, specifically Locally Weighted Linear Regression (LWR). LWR is used within Fitted Value Iteration (FVI), which has met with several practical successes. However, for efficiency reasons, LWR is used with a limited sample-size, which leads to poor performance without careful tuning of LWR's parameters. We therefore develop an efficient meta-learning procedure that performs online model-selection and tunes LWR's parameters based on the Bellman error. Our algorithm is fully implemented and tested in a realistic simulation of the HVAC control domain, and results in significant energy savings.

展开

会议时间:

2013/09/23

被引量:

2

通过文献互助平台发起求助,成功后即可免费获取论文全文。

相似文献

参考文献

引证文献

辅助模式

0

引用

文献可以批量引用啦~
欢迎点我试用!

引用