Meta-learning strategy based on user preferences and a machine recommendation system for real-time cooling load and COP forecasting 基于用户偏好的元学习策略和用于实时冷却负荷和 COP 预测的机器推荐系统
Wenqiang , Guangcai Gong , Houhua Fan , Pei Peng , Liang Chun Wenqiang , Guangcai Gong , Houhua Fan , Pei Peng , Liang Chun a School of Civil Engineering, Hunan Univ., Changsha 410082, China a 湖南大学土木工程学院,中国长沙 410082 Research and Design Center, Hubei Huayu Hi-Tech Architectural Design Consulting Co., Ltd., Yichang Three Gorges Branch, Yichang 443000, China 湖北华宇高科建筑设计咨询有限公司宜昌三峡分公司研究设计中心,中国宜昌 443000 Hunan Tianyu Energy Technology Co., Ltd., Changsha 410082, China 湖南天宇能源科技有限公司,中国长沙 410082
H I G H L I G H T S
A new meta-learning recommendation system is proposed. 提出了一种新的元学习推荐系统。
The new system concerned two-stage (subjective and objective) user preferences. 新系统涉及两个阶段(主观和客观)的用户偏好。
Multi-objective decision making algorithms (MODMA) are used in option optimization. 多目标决策算法(MODMA)用于期权优化。
A new "walking slide method" aimed at some extremely special cases is proposed. 针对一些极为特殊的情况,提出了一种新的 "行走滑动法"。
The new system is validated on real buildings, and its generalizability is proved. 新系统在真实建筑上得到了验证,其通用性也得到了证明。
A R T I C L E I N F O
Keywords: 关键词:
Cooling load prediction 冷却负荷预测
Meta-learning 元学习
Artificial neural network (ANN) 人工神经网络(ANN)
Recommendation system 推荐系统
User preferences 用户偏好
Abstract 摘要
A B S T R A C T Building data forecasting plays an increasingly important role in building energy savings. However, the one-fitsall model cannot satisfy all the requirements of multiple application scenarios and user preferences. Motivated by the need to bridge the research gap between different user preferences (application scenarios) and energy prediction model recommendation systems, this paper proposes a novel meta-learning strategy based on an artificial neural network recommendation system. This strategy is employed for real-time cooling loads, coefficients of performance prediction and optimal prediction model recommendations. The data set is composed of 40 cases from five factory buildings. After the predictions and recommendations are obtained for all cases, the two-stage user preferences are considered based on multi-objective decision-making algorithms. Then, a new model termed the "walking slide method", is proposed to predict some special cases. This study shows that the seasonal autoregressive integrated moving average model and random forest model achieve the best prediction accuracy and the minimum computation cost separately for most cases, while the long short-term memory is the best model when considering the two criteria. The variances between the different cases lead to a lower crossvalidation score (approximately 65%), but a higher success rate (over 99%) for the recommendation performance. In addition, in the more complex application scenarios, a lower prediction accuracy and recommendation success rate will be obtained. In most cases, the use of a prediction combined with a monitoring system is the best choice. Last, the reliability of the results is verified by application studies. This work provides a scientific basis for energy prediction applications based on user preferences. 建筑数据预测在建筑节能中发挥着越来越重要的作用。然而,千篇一律的模型无法满足多种应用场景和用户偏好的所有要求。为了弥补不同用户偏好(应用场景)和能源预测模型推荐系统之间的研究差距,本文提出了一种基于人工神经网络推荐系统的新型元学习策略。该策略用于实时冷却负荷、性能系数预测和最优预测模型推荐。数据集由来自五座工厂大楼的 40 个案例组成。在获得所有案例的预测和建议后,基于多目标决策算法考虑了两个阶段的用户偏好。然后,提出了一种名为 "行走滑动法 "的新模型,用于预测一些特殊情况。研究表明,在大多数情况下,季节自回归综合移动平均模型和随机森林模型分别达到了最好的预测精度和最小的计算成本,而在考虑这两个标准时,长短期记忆是最好的模型。不同案例之间的差异导致交叉验证得分较低(约 65%),但推荐性能的成功率较高(超过 99%)。此外,在较为复杂的应用场景中,预测准确率和推荐成功率都会较低。在大多数情况下,将预测与监控系统结合使用是最佳选择。最后,应用研究验证了结果的可靠性。这项工作为基于用户偏好的能源预测应用提供了科学依据。
1. Introduction 1.导言
The opposing demands of increasing energy consumption and decreasing total energy storage require more detailed management and use of energy [1]. Energy usage predictions are of great significance for energy savings in buildings which have become one of the largest energy consumers in the world [2], especially large energy consumption buildings (e.g., factories). In heating, ventilation, and air conditioning 日益增长的能源消耗和日益减少的能源储存总量这对矛盾的需求,要求对能源进行更细致的管理和使用[1]。建筑已成为世界上最大的能源消耗之一[2],尤其是高能耗建筑(如工厂),能源使用预测对于建筑节能意义重大。在供暖、通风和空调领域
(HVAC) systems, energy consumption prediction can address the time delay between air-conditioning cooling (heating) load and heat extraction by allowing HVAC equipment to respond in advance to reduce the maximum air-conditioning load demand. Additionally, energy consumption prediction is an important tool for fault detection, diagnosis, and energy control optimization. As a result, many prediction models have been proposed, including meta-learning strategies that automatically learn to select optimal prediction methods. In these (暖通空调(HVAC)系统中,能耗预测可以解决空调制冷(制热)负荷与热量提取之间的时间延迟问题,让暖通空调设备提前做出响应,降低最大空调负荷需求。此外,能耗预测还是故障检测、诊断和能源控制优化的重要工具。因此,人们提出了许多预测模型,包括自动学习选择最佳预测方法的元学习策略。在这些
applications, user preferences are expected to be very important, because a good optimization strategy is based on the actual situations of the users. This section introduces some load prediction models and user preferences as follows. 由于一个好的优化策略要以用户的实际情况为基础,因此用户的偏好就显得非常重要。本节将介绍一些负载预测模型和用户偏好,具体如下。
In general, the modeling methods for short-term building energy prediction can be broadly categorized into three types: physical, statistical, and data-driven models. Physical models (e.g., EnergyPlus [3], TRNSYS [4], ESP-r [5]) can also be thought of as "white-box" models, because they require clear physical principles and a large amount of detailed building information (such as the thermophysical parameters of envelopes and building dimensions). Physical models have the advantage of having a clear calculation process but often require too much relevant information; hence, it is very difficult to use them in building energy management (BEM) systems. In addition, there is another type of physically based "variant" models [6]. The parameters utilized in the "variant" models are available adjusted measured (monitored) data. Statistical models have been mainly based on statistical principles and include multiple linear regression [7], Kalman filtering [8], Box-Jenkins [9], autoregressive integrated moving average (ARIMA) [10], wavelet and other relevant models. Those similar models include the seasonal autoregressive integrated moving average (SARIMA) [11] and wave recurrent neural network (WaveRNN) [12] models. In [13], an ARIMA and a support vector machine (SVM) model were combined. 一般来说,短期建筑能耗预测的建模方法可大致分为三类:物理模型、统计模型和数据驱动模型。物理模型(如 EnergyPlus [3]、TRNSYS [4]、ESP-r [5])也可视为 "白盒 "模型,因为它们需要明确的物理原理和大量详细的建筑信息(如围护结构的热物理参数和建筑尺寸)。物理模型的优点是计算过程清晰,但往往需要过多的相关信息,因此很难在建筑能源管理系统(BEM)中使用。此外,还有另一种基于物理的 "变体 "模型[6]。变体 "模型中使用的参数是经过调整的测量(监测)数据。统计模型主要基于统计原理,包括多元线性回归模型[7]、卡尔曼滤波模型[8]、盒状-詹金斯模型[9]、自回归综合移动平均模型(ARIMA)[10]、小波模型和其他相关模型。类似的模型还包括季节自回归积分移动平均(SARIMA)[11] 和波浪循环神经网络(WaveRNN)[12] 模型。在文献[13]中,ARIMA 模型与支持向量机(SVM)模型相结合。
This hybrid model is also an important part of the statistical models. Commonly, HVAC system data can be regarded as a type of time series [14]. Box and Jenkins discussed this concept in detail in [15] as a basic idea for statistical models. However, statistical models are very sensitive to the intrinsic linkages and laws in data. 这种混合模型也是统计模型的重要组成部分。通常,暖通空调系统数据可被视为一种时间序列[14]。Box 和 Jenkins 在文献[15]中详细讨论了这一概念,并将其作为统计模型的基本思想。然而,统计模型对数据的内在联系和规律非常敏感。
The final (and very important) type of prediction model is the datadriven model, which mainly relies on the operation data of the building HVAC system. The correlations between the inputs (e.g., outdoor meteorological parameters) and outputs (e.g., cooling load and coefficient of performance (COP)) can be determined without studying the internal physical laws affecting the various elements. Thus, data-driven models are also known as "black-box" models. Furthermore, the models can achieve higher accuracy, are more flexible to use, and have higher generalization ability for complex nonlinear correlations than the physical and statistical models. Through literature reading, there are three main aspects of data-driven prediction models for HVAC systems. The first aspect regards the application of some specific data-driven models. The extreme gradient boosting (XGB) method showed superiority for prediction, as described in [16]. Xu [17] proposed a novel long short-term memory (LSTM) model, and the results showed that this model had slight advantages in terms of the level of indoor temperature prediction performance as compared with the SVM model, decision tree model, back-propagation neural network (BPNN) model, 最后一种(也是非常重要的一种)预测模型是数据驱动模型,它主要依赖于建筑暖通空调系统的运行数据。输入(如室外气象参数)和输出(如制冷负荷和性能系数 (COP))之间的相关性可以在不研究影响各要素的内部物理规律的情况下确定。因此,数据驱动模型也被称为 "黑箱 "模型。此外,与物理和统计模型相比,数据驱动模型可以获得更高的精度,使用更灵活,对复杂的非线性关联具有更高的泛化能力。通过文献阅读,暖通空调系统的数据驱动预测模型主要有三个方面。第一个方面是一些特定数据驱动模型的应用。如文献[16]所述,极端梯度提升(XGB)方法在预测方面表现出优越性。Xu[17]提出了一种新型的长短期记忆(LSTM)模型,结果表明,与 SVM 模型、决策树模型、反向传播神经网络(BPNN)模型相比,该模型在室内温度预测性能水平方面略有优势、
and original LSTM model. Fan et al. [18] presented some one-stepahead prediction models (i.e., a direct approach based on recurrent models and RNN, LSTM, gated recurrent unit (GRU), and multi-input and multi-output (MIMO) approaches). This research indicated that the direct approach was the most accurate model, while the GRU was the most cost-effective. Naji et al. [19] developed a method for predicting building energy consumption based on an extreme learning machine 和原始 LSTM 模型。Fan 等人[18]提出了一些领先一步的预测模型(即基于递归模型的直接方法和 RNN、LSTM、门控递归单元(GRU)以及多输入多输出(MIMO)方法)。这项研究表明,直接方法是最精确的模型,而 GRU 则是最具成本效益的方法。Naji 等人[19] 开发了一种基于极端学习机的建筑能耗预测方法。
(ELM) and found that ELM predictions were superior to those of genetic programming (GP) and artificial neural networks (ANNs) by evaluating the root mean square error (RMSE), and values. Some new and improved support vector regression (SVR) methods have also performed well . As discussed above, all of the specific models achieved ideal performances in specific energy prediction studies. The second aspect of data-driven models regards ANN models [22], some (ELM) 的预测结果,通过评估均方根误差 (RMSE)、 和 值,发现 ELM 的预测结果优于遗传编程 (GP) 和人工神经网络 (ANN) 的预测结果。一些新的改进型支持向量回归(SVR)方法也表现出色, 。如上所述,所有具体模型在具体的能源预测研究中都取得了理想的成绩。数据驱动模型的第二个方面涉及到 ANN 模型[22],其中一些模型可以用于能源预测。
Fig. 1. Framework of the proposed strategy. 图 1.拟议战略的框架。
revised ANN models, and ANN hybrid models. Fu [23] investigated a deep neural network for real-time cooling load forecasting (CLF). This model not only exhibited high stability and robustness but also performed better than the BPNN and SVM models. An ANN model and a case-based reasoning (CBR) model were compared in [24], and the results indicated that the ANN model obtained high levels of accuracy. For more accurate load forecasting, various ANN hybrid models were proposed. Among them, the model-based method was one of the most commonly used methods. Similar to the physically-based "variant" models discussed above, this method has two parts: a mathematical part and an optimization algorithm. Thus, various optimization algorithms were combined with ANNs, including the clustering-based ANN [25] and BPNN [26]. These optimization algorithms also included the Bayesian optimization algorithm [27,28], uncertainty algorithms [29,30,31], probability density optimization algorithms [32], and supervisory optimization control algorithms [33], all of which obtained good performance in various HVAC system predictions. Many researchers have concentrated on studying the inputs extraction [34] of the ANN model, which is used to obtain more satisfactory evaluation scores. Principal component analysis (PCA) [35], t-distributed stochastic neighbor embedding (t-SNE) [36], and some other tools can provide substantial help for feature extraction. 修正后的 ANN 模型和 ANN 混合模型。Fu [23] 研究了一种用于实时冷却负荷预测(CLF)的深度神经网络。该模型不仅表现出很高的稳定性和鲁棒性,而且性能优于 BPNN 和 SVM 模型。文献[24]对 ANN 模型和基于案例的推理(CBR)模型进行了比较,结果表明 ANN 模型的准确度较高。为了更准确地进行负荷预测,人们提出了各种 ANN 混合模型。其中,基于模型的方法是最常用的方法之一。与上文讨论的基于物理的 "变体 "模型类似,这种方法包括两个部分:数学部分和优化算法。因此,各种优化算法与 ANN 相结合,包括基于聚类的 ANN [25] 和 BPNN [26]。这些优化算法还包括贝叶斯优化算法[27,28]、不确定性算法[29,30,31]、概率密度优化算法[32]和监督优化控制算法[33],它们都在各种暖通空调系统预测中取得了良好的性能。许多研究人员集中研究了 ANN 模型的输入提取[34],用于获得更令人满意的评估分数。主成分分析(PCA)[35]、t 分布随机邻域嵌入(t-SNE)[36]和其他一些工具可为特征提取提供实质性帮助。
The third aspect of data-driven models concerns meta-learning. The phrase "meta-learning" was first used in [37] in the context of a time series, and it was used to express the process of automatic learning to select prediction models for different tasks and purposes [38,39]. Lemke and Gabrys [40] studied meta-learning for time series prediction, and found that the decision tree was an important tool for metalearning. Cui [41] built a meta-learning model in terms of a building energy model recommendation (BEMR) system, and the results showed that the BEMR could identify the best performance model for of the cases. 数据驱动模型的第三个方面涉及元学习。元学习"(meta-learning)一词最早出现在时间序列的语境中[37],用来表达为不同任务和目的选择预测模型的自动学习过程[38,39]。Lemke 和 Gabrys [40] 研究了时间序列预测的元学习,发现决策树是金属学习的重要工具。Cui[41]在建筑节能模型推荐(BEMR)系统方面建立了元学习模型,结果表明 BEMR 可以为 的案例确定最佳性能模型。
Note that the current investigations largely focus on real case predictions by various models to find the most appropriate model. This finding indicated that in the machine learning field, the one-fits-all modeling approach was limited [42] in fulfilling all of the user requirements, and each model (even traditional SARIMAX [43]) could play an irreplaceable role for a certain case. On the one hand, with the development of machine learning techniques, more new models will be invented, and the rapid developments in big data can offer opportunities for the effective use of these prediction models in HVAC systems. On the other hand, in building energy management (BEM) systems, building data is usually generated once every five minutes to one hour. The trial-and-error process of inputting data into various models is costly, especially when the amount of data is large. Thus, it is better to predict the "best performance model" in advance using the "metalearning" strategy. The perspective of users, the user requirements, the maximum accuracy that meter equipment can provide and the relationships between these factors have a high impact on the function of load prediction and, thus, the user preferences [44,45] regarding load predictions should be considered. Two-stage user preferences are investigated in this study, which includes two aspects and one restriction; the two aspects are the subjective requirements of the approach for the prediction or monitoring systems and the accuracy requirements for the prediction system; the restriction is an investment. To obtain the subjective requirements under the investment restriction, a questionnaire survey will be made for users before the monitoring system installation. Some users could pay more attention to the whole cooling load provided by the HVAC system, and in this case, a prediction system with suitable accuracy can meet the demand. If energy-saving is also expected by the users, the COP value and a high-precision prediction system should be considered. Moreover, the budgets of the users will impose some limits on these demands, even if most of the users wish to pay attention to the cooling load, COP and monitoring systems. Therefore, five options that combine the forecasting and monitoring systems can be proposed based on practical experience: "cooling load prediction"; "COP prediction"; "cooling load/COP prediction"; "cooling load prediction and monitoring"; and "cooling load/COP prediction and monitoring". Based on the above, the main objective of this study can be summarized as the proposal and resolution of two questions: (1) what type of model can achieve the best prediction performance, and does that model meet the requirements of the users?; (2) which option mentioned above is the comprehensive optimal for users under an investment restriction? Naturally, the meta-learning approach and multiobjective decision-making algorithms (MODMA) [46,47] were chosen to resolve these questions in this paper. 需要注意的是,目前的研究主要集中于各种模型对实际案例的预测,以找到最合适的模型。这一发现表明,在机器学习领域,"一刀切 "的建模方法[42]在满足用户所有要求方面是有限的,每个模型(即使是传统的 SARIMAX [43])都可能在特定情况下发挥不可替代的作用。一方面,随着机器学习技术的发展,将会有更多新模型被发明出来,而大数据的快速发展也为在暖通空调系统中有效使用这些预测模型提供了机会。另一方面,在楼宇能源管理系统(BEM)中,楼宇数据通常每五分钟到一小时产生一次。将数据输入各种模型的试错过程成本高昂,尤其是当数据量较大时。因此,使用 "金属学习 "策略提前预测 "最佳性能模型 "是更好的选择。用户的视角、用户的要求、仪表设备所能提供的最大精度以及这些因素之间的关系对负荷预测的功能有很大影响,因此应考虑用户对负荷预测的偏好[44,45]。本研究调查了两个阶段的用户偏好,其中包括两个方面和一个限制条件;两个方面是对预测或监测系统方法的主观要求和对预测系统精度的要求;限制条件是投资。为了获得投资限制下的主观要求,将在监测系统安装前对用户进行问卷调查。有些用户可能更关注暖通空调系统提供的整个冷负荷,在这种情况下,精度合适的预测系统就能满足需求。如果用户还希望节能,则应考虑 COP 值和高精度的预测系统。此外,即使大多数用户希望关注制冷负荷、COP 值和监控系统,用户的预算也会对这些需求造成一定的限制。因此,可以根据实践经验提出五种将预测和监控系统相结合的方案:"冷负荷预测"、"COP 预测"、"冷负荷/COP 预测"、"冷负荷预测与监测 "和 "冷负荷/COP 预测与监测"。基于以上所述,本研究的主要目标可概括为提出并解决两个问题:(1) 什么样的模型能达到最佳预测性能,以及该模型是否满足用户的要求?(2) 在投资限制条件下,上述哪种方案对用户而言是综合最优的?当然,本文选择元学习方法和多目标决策算法(MODMA)[46,47] 来解决这些问题。
The contributions of this article include the following: (1) the research object (i.e., factories) and one of the prediction objects (i.e., COP) have rarely been analyzed in previous studies; most of the investigated building types are office buildings [16,17,20,24,27,36], and most of the prediction objects are cooling loads and electric loads. At the same time, the electricity price is included in the feature pool for the data-driven models. (2) This study proposes a new meta-learning strategy based on a machine recommendation system, which accounts for the two-stage user preferences. (3) A new "walking slide method" aimed at predicting some special cases is proposed based on the recommendation system. (4) Five options that combine the forecasting and monitoring systems are proposed and optimized here by key attributes (i.e., energy-saving potential, prediction accuracy, initial investment, and prediction of computational cost and measurement cost) which has high practicability and provides valuable suggestions for the users. 本文的贡献包括以下几点:(1)研究对象(即工厂)和预测对象之一(即 COP)在以往的研究中鲜有分析;所研究的建筑类型多为办公建筑[16,17,20,24,27,36],预测对象多为冷负荷和电负荷。同时,电价也被纳入数据驱动模型的特征库中。(2)本研究在机器推荐系统的基础上提出了一种新的元学习策略,该策略考虑了用户的两阶段偏好。(3) 基于推荐系统提出了一种新的 "行走滑动法",旨在预测一些特殊情况。(4) 提出了预测与监测系统相结合的五种方案,并根据关键属性(即节能潜力、预测精度、初始投资、预测计算成本和测量成本)进行了优化,具有很高的实用性,可为用户提供有价值的建议。
This paper is organized as follows. Section 2 presents the detailed meta-learning strategy with two-stage user preferences. Section 3 includes five parts: (1) a brief introduction of 40 cases; (2) the performances of the forecasting; (3) the performance of the meta-level learning; (4) the performance when considering the user preferences; and (5) a discussion. Section 4 describes the conclusions. The Appendix A gives a brief introduction to the data-driven forecasting models. 本文的组织结构如下。第 2 节详细介绍了具有两阶段用户偏好的元学习策略。第 3 节包括五个部分:(1) 40 个案例的简要介绍;(2) 预测的性能;(3) 元学习的性能;(4) 考虑用户偏好时的性能;(5) 讨论。第 4 节阐述了结论。附录 A 简要介绍了数据驱动预测模型。
2. Methodology 2.方法论
Following the existing research procedure on meta-learning [41], the meta-learning system proposed in this paper was based on an ANN recommendation system. Analysis of the user preferences, which were investigated rarely before, was based on MODMA algorithms. There were three steps in this system, as shown in Fig. 1. The first step was inputting the data (the COP and cooling load) from cases one to 40 into the six selected common forecasting models; each case was brought into the six models, and a total of 240 regression predictions were made. This step contained data pre-processing, data division, training, prediction, and most importantly, the "best" performance model determination among the six prediction models. The second step was recommendation system modeling, which included the derivation of the meta-features (inputs), data division, training, and prediction. The outputs of the recommendation system were the "best" prediction performance models in step one. The third step was the comprehensive optional process, and the two-stage user preferences was considered, while a new "walking slide method" was proposed to make predictions for those special cases that could not meet the user's accuracy requirements. Last, by both taking objective and subjective properties into account when using MODMA, the optimal options of the comprehensive attributes could be selected, and the whole strategy could be built. This section presents two parts: 1) a description of a metalearning strategy combined with user preferences, 2) the "walking slide method". 根据现有的元学习研究程序[41],本文提出的元学习系统基于 ANN 推荐系统。对用户偏好的分析是基于 MODMA 算法的,这在以前的研究中很少见。如图 1 所示,该系统分为三个步骤。第一步是将案例 1 至 40 的数据(COP 和冷负荷)输入到所选的六个常见预测模型中;每个案例都被输入到六个模型中,总共进行了 240 次回归预测。这一步包括数据预处理、数据分割、训练、预测,最重要的是在六个预测模型中确定性能 "最佳 "的模型。第二步是推荐系统建模,包括元特征(输入)的推导、数据分割、训练和预测。推荐系统的输出是第一步中的 "最佳 "预测性能模型。第三步是综合可选过程,考虑了两阶段的用户偏好,同时提出了一种新的 "行走滑动法",对无法满足用户准确性要求的特殊情况进行预测。最后,在使用 MODMA 时兼顾客观和主观属性,选择综合属性的最优选项,构建整体策略。本节将介绍两个部分:1)结合用户偏好的金属学习策略描述;2)"行走滑动法"。
2.1. Meta-learning strategy with user preferences 2.1.带用户偏好的元学习策略
2.1.1. Forecasting models 2.1.1.预测模型
(1) Statistical models (1) 统计模型
In some less-complicated conditions, statistical forecasting models can play a suitable role in some simple cooling load forecasting tasks. Usually, HVAC data have strong regularity, and, possible simple application scenarios should be considered in step one. Thus, two traditional methods are introduced in this paper: The time series method [10] and the wavelet analysis method [48]. These two methods are introduced in the Appendix A. 在一些不太复杂的条件下,统计预测模型可以在一些简单的冷负荷预测任务中发挥适当的作用。通常情况下,暖通空调数据具有很强的规律性,而且在第一步中就应考虑到可能的简单应用场景。因此,本文介绍了两种传统方法:时间序列法 [10] 和小波分析法 [48]。附录 A 介绍了这两种方法。
(2) Machine learning models (2) 机器学习模型
Machine learning could be very useful under conditions in which the connections between the inputs and outputs are highly nonlinear. Considering the possible complex application scenarios in terms of the user preferences, six common models were explored: WNN, SARIMAX, Elman recurrent neural network (Elman RNN) [49], random forest (RF), long short-term memory recurrent neural network (LSTM-RNN), and SVR, some of which are introduced in Appendix A. 在输入和输出之间的联系高度非线性的情况下,机器学习可能非常有用。考虑到用户偏好方面可能存在的复杂应用场景,我们探索了六种常见模型:WNN、SARIMAX、Elman 循环神经网络(Elman RNN)[49]、随机森林(RF)、长短期记忆循环神经网络(LSTM-RNN)和 SVR,附录 A 介绍了其中一些模型。
Machine learning can be summarized as using the right features to build the right model to complete the given tasks. The first and important part of a machine learning process is the feature selection. In this work, PCA was commonly used to extract the main inputs. By calculating the Pearson correlation coefficient ( ) (as shown in Eq. (1) below and [50]) between the principal component and the original inputs, a coefficient color map was generated. Then, the corresponding principal components could be found. For example, if the top seven components explained at least of the entire original data set, the top seven components were deemed to be principal components. A detailed presentation of PCA could be found in [51]. 机器学习可以概括为使用正确的特征来建立正确的模型,从而完成给定的任务。机器学习过程的第一个重要部分是特征选择。在这项工作中,通常使用 PCA 来提取主要输入。通过计算主成分与原始输入之间的皮尔逊相关系数( )(如下式(1)和文献[50]所示),生成系数色图。然后,就可以找到相应的主成分。例如,如果前七个分量至少解释了整个原始数据集的 ,那么前七个分量就被认为是主分量。关于 PCA 的详细介绍,请参阅 [51]。
In the above equation, is the true value, is the predicted value, and is the number of data points. 在上式中, 是真实值, 是预测值, 是数据点数。
The second part of machine learning is data training. In this study, a cross-validation method [52] was utilized to randomly split the training data and testing data to examine the performance of the data using different prediction models. 机器学习的第二部分是数据训练。本研究采用交叉验证法[52],随机分割训练数据和测试数据,以检验数据在不同预测模型下的表现。
The final part of machine learning is testing (predicting). To evaluate the performance, the normalized root mean square error (NRMSE) and computation time (e) of each case are used as the evaluation criteria. NRMSE is defined in Eq. (2), and the computation time (e) includes both the training and prediction processes. 机器学习的最后一部分是测试(预测)。为了评估性能,我们使用每个案例的归一化均方根误差(NRMSE)和计算时间(e)作为评估标准。NRMSE 在公式 (2) 中定义,计算时间 (e) 包括训练和预测过程。
where is the true value, is the predicted value, and is the number of data points or instances. Here, and are the maximum and minimum value of . 其中, 是真实值, 是预测值, 是数据点或实例数。这里, 和 是 的最大值和最小值。
2.1.2. Recommendation system 2.1.2.建议系统
A recommendation system is the core component of a meta-learning strategy. The decision tree [40], XGB [16] and ANN [41] (called "metalearners") could be utilized for meta-learning systems. The ANN-based meta-learning system is more common in HVAC systems. 推荐系统是元学习策略的核心组成部分。决策树 [40]、XGB [16] 和 ANN [41](称为 "金属学习器")可用于元学习系统。基于 ANN 的元学习系统在暖通空调系统中更为常见。
(1) Modeling of the artificial neural network (ANN) (1) 人工神经网络(ANN)建模
The parameter settings of the ANN were as follows: the number of hidden neurons: inputs + outputs) [53]; inputs was the number of input layer units, outputs was the number of output layer units, and the number of training patterns was the number of training samples. In this study, the number of hidden layers was 8 . The hidden layer activation function was "Relu", and the output layer activation function was "Softmax". There were six output neurons. The loss function was "categorical cross-entropy", and the optimizer was stochastic gradient descent (SGD) [54]. In that regard, the SGD trained with a single sample as a training unit can calculate faster. The ANN model was built in a Python environment using the TensorFlow library. The inputs of the proposed ANN were the meta-features selected by the PCA method, and the output was the best prediction performance model of the six regression models in this work. The "Label Encoder" tool was used to code the prediction models with the best performance. ANN 的参数设置如下:隐神经元数: inputs + outputs) [53];输入为输入层单元数,输出为输出层单元数,训练模式数为训练样本数。在本研究中,隐层数为 8。隐层激活函数为 "Relu",输出层激活函数为 "Softmax"。共有 6 个输出神经元。损失函数为 "分类交叉熵",优化器为随机梯度下降(SGD)[54]。在这方面,以单个样本为训练单元进行训练的 SGD 计算速度更快。ANN 模型是在 Python 环境中使用 TensorFlow 库构建的。拟建 ANN 的输入为 PCA 方法选出的元特征,输出为本研究中六个回归模型中预测性能最好的模型。使用 "标签编码器 "工具对性能最佳的预测模型进行编码。
(2) The first stage of the user preferences (2) 用户偏好的第一阶段
As discussed in Section 1, the user preferences in different application scenarios cannot be ignored before a single recommendation system is introduced to the users. For example, when the total cooling capacity is not too large in some small office buildings, the users might only be concerned with whether the cooling load of the HVAC system is stable. However, in some larger factories with large cooling load demands, the users must not only be concerned with the total cooling load but also with the COP of the HVAC system. In some rigorous environments (such as hospitals), users must also ensure the normal operation of the HVAC system; hence, a monitoring system is necessary. At the same time, the budget of the users will impose some limits on the demands. Based on these observations, five different options were provided for the first stage of the user preferences: Option 1, cooling load prediction; Option 2, COP prediction; Option 3, Cooling load and COP prediction; Option 4, cooling load prediction and monitoring; and Option 5, cooling load/COP prediction and monitoring. These five options are similar to those of the five packages for the users, where the benefit and cost (calculated in Section 3.4.1) of each package are different. The most appropriate option for users cannot be determined in a simple manner. 如第 1 节所述,在向用户推出单一推荐系统之前,不能忽视不同应用场景下的用户偏好。例如,在一些小型办公楼中,当总制冷量不太大时,用户可能只关心暖通空调系统的冷负荷是否稳定。但在一些冷负荷需求较大的大型工厂,用户不仅要关注总冷负荷,还要关注暖通空调系统的 COP。在一些严格的环境中(如医院),用户还必须确保暖通空调系统的正常运行,因此,监控系统是必要的。同时,用户的预算也会对需求造成一定的限制。根据上述观察结果,在用户偏好的第一阶段提供了五个不同的方案:方案 1,冷负荷预测;方案 2,COP 预测;方案 3,冷负荷和 COP 预测;方案 4,冷负荷预测和监控;方案 5,冷负荷/COP 预测和监控。这五个方案与用户的五个套餐相似,但每个套餐的效益和成本(在第 3.4.1 节中计算)不同。无法简单地确定最适合用户的方案。
MODMA was utilized to connect the objective performances (NRMSE and e) of the models with the subjective user preferences, and was also combined with the recommendation system. A theoretical algorithm for MODMA with unknown weights is expressed briefly below: 利用 MODMA 将模型的客观性能(NRMSE 和 e)与用户的主观偏好联系起来,并与推荐系统相结合。下文简要介绍了权重未知的 MODMA 理论算法:
Some assumptions were made, as presented in Table 1. After the scheme (option) set, properties set, attribute weights vector, and subjective utility preference information vector were built, a primitive decision matrix could be made. With different attribute evaluation types for different properties, a normalized decision matrix based on the original decision information could be established; the normalizing process is expressed in Eqs. (3) and (4). Last, the fuzzy comprehensive attribute value for every scheme (option) could also be obtained by the additive weighting method. 表 1 列出了一些假设。在建立了方案(选项)集、属性集、属性权重向量和主观效用偏好信息向量之后,就可以建立原始决策矩阵 。针对不同属性的不同属性评价类型,可以建立基于原始决策信息的归一化决策矩阵 ;归一化过程用公式(3)和(4)表示。最后,还可以通过加权法得到每个方案(选项)的模糊综合属性值。
Table 1 表 1
Introduction of parameters in MODMA. 在 MODMA 中引入参数。
Parameters
Expression
Remarks
set
set
-
scheme (option) set
properties set
attribute weights vector
subjective utility preference
information vector
in terms of and
evaluation value
-
primitive decision matrix
-
normalized decision matrix
, in terms of
fuzzy comprehensive attribute value
attribute evaluation type
benefit type
attribute evaluation type
cost type
The establishment and solution of the decision model accounted for one principle: the preference values based on subjective and objective decision-making should conform to the consistency principle, and the selection of the weights should minimize the sum of the distances squared of the subjective and objective preference values. The following optimization models can then be given: 决策模型的建立和求解遵循一个原则:基于主客观决策的偏好值应符合一致性原则,权重的选择应使主客观偏好值的距离平方和最小。这样就可以给出以下优化模型:
where is the subjective preference value for scheme . Eq. (5) can be solved by the Lagrange multiplier method. 其中 是方案 的主观偏好值。式 (5) 可用拉格朗日乘数法求解。
Some key properties (i.e., NRMSE, e, N, initial investment, and energy-saving) were investigated to evaluate the advantages and disadvantages of the five options, and the attribute evaluation types for the key properties are presented in Table 2. 对一些关键属性(即 NRMSE、e、N、初始投资和节能)进行了调查,以评价五种方案的优劣,关键属性的属性评价类型见表 2。
The energy savings for Options 1 to 5 are given by Eqs. (6)-(10), respectively: 方案 1 至 5 的节能效果分别由公式 (6)-(10) 得出:
In the above, and were the real (tested) cooling load and COP for the HVAC system, while and were the predicted cooling load and COP for the HVAC system. and were the revised COP values based on the monitoring system defined in Eq. (11). In Eqs. (6)-(10), the summation lower limit of 0 indicated the moment of the starting measurement, and the summation upper limit indicated the moment of the end measurement. 在上述公式中, 和 是暖通空调系统的实际(测试)冷却负荷和 COP,而 和 是暖通空调系统的预测冷却负荷和 COP。 和 是根据公式 (11) 中定义的监控系统修正的 COP 值。在公式 (6)-(10) 中,求和下限 0 表示起始测量时刻,求和上限 表示结束测量时刻。
where is the lower-limit value for the COP, which means that the two types of revised COP will be larger than at any time owing to the monitoring system. 其中 是 COP 的下限值,这意味着由于监控系统的存在,两种类型的修正 COP 在任何时候都会大于 。
It should be noted that the energy savings calculated by Eqs. (6)-(10) is only concerned with saving the energy loss caused by the fluctuation in the cooling/heating sources rather than the terminal energy loss and thermal comfort of the room. The thermal comfort of the room is considered to be stable after and before the prediction. 需要注意的是,根据公式(6)-(10)计算出的节能效果只涉及冷/热源波动造成的能源损耗,而不是房间的终端能源损耗和热舒适度。房间的热舒适度被认为在预测后和预测前是稳定的。
(3) The second stage of the user preferences (3) 用户偏好的第二阶段
For different application scenarios, not all situations require highprecision prediction. The accuracy of the prediction can have a large impact on the initial investment and measurement. By investigating some classical literature , it was found that most studies obtained good prediction accuracy but showed little consideration for whether the real accuracy matches their own needs, especially for the cost to obtain the accuracy. 对于不同的应用场景,并非所有情况都需要高精度预测。预测精度会对初始投资和测量产生很大影响。通过调查一些经典文献 ,我们发现大多数研究都获得了较高的预测精度,但很少考虑实际精度是否符合自身需求,尤其是获得精度所需的成本。
Thus, the second stage of the user preferences should be considered. When the minimum accuracy requirements were set, if the accuracy of the "best performance model" (i.e., the best NRMSE score) recommended by the meta-learning system could not meet the minimum requirement, the signal regression model and recommendation system discussed above were considered to have failed. For example, if the minimum requirement was 0.8 and the accuracy of the "recommended best model" was 0.7 , the recommendation system failed to meet the user's requirements. Therefore, a special regression model, namely, the "walking slide method", was proposed to address these special cases. A detailed description of this combination method will be discussed in Section 2.2. 因此,应考虑第二阶段的用户偏好。在设定最低精度要求时,如果元学习系统推荐的 "最佳性能模型"(即最佳 NRMSE 分数)的精度不能满足最低要求,则认为上述信号回归模型和推荐系统失败。例如,如果最低要求为 0.8,而 "推荐的最佳模型 "的准确度为 0.7,则推荐系统不能满足用户的要求。因此,针对这些特殊情况,我们提出了一种特殊的回归模型,即 "行走滑动法"。关于这种组合方法的详细介绍将在第 2.2 节中讨论。
This section proposes the new meta-learning strategy system, and two-stage user preferences are incorporated into the recommendation system by the MODMA method and the new "walking slide method". 本节提出了新的元学习策略系统,并通过 MODMA 方法和新的 "行走滑动法 "将两阶段用户偏好纳入推荐系统。
2.2. Walking slide method for the special case 2.2.特殊情况下的行走滑动法
Based on the situation described in Section 2.1.2, the "walking slide method" is proposed in this section. This idea arose from the data splitting process [57] of the testing and training parts. As expressed in Table 3, a data slider is created first, and the length of the slider defaults to the period . Then, the slider is incorporated into the welltrained recommendation system. While increasing the length of the slider (from to ), the recommended results (the best prediction performance model on the slider) before and after the increasing are observed. Until , the endpoint of the slider can be confirmed. Next, the original time series (e.g. in Fig. 2) will be separated into several fragments. Every fragment of the corresponding data can perform best on a certain model, and the model obtained the best score was determined by the well-trained recommendation system mentioned in Section 2.1. For example, as shown in Fig. 2, if the time series was separated into four fragments (this step might be more complicated in reality), , , the best scores could be achieved with the SVR, LSTM-RNN, Elman RNN and RF models, respectively. Finally, the four fragments are combined to calculate the accuracy of the entire time series . The detailed algorithm is presented in Table 3. 根据第 2.1.2 节所述情况,本节提出了 "行走滑动法"。这一想法源于测试和训练部分的数据分割过程[57]。如表 3 所示,首先创建一个数据滑块 ,滑块的长度默认为周期 。然后,将滑块纳入训练有素的推荐系统。在增加滑块长度(从 到 )的同时,观察 之前和 之后的推荐结果(滑块上的最佳预测性能模型)。直到 ,可以确认滑块的终点。接下来,原始时间序列(如图 2 中的 )将被分成几个片段。相应数据的每个片段都可以在某个模型上表现最佳,而获得最佳分数的模型则由第 2.1 节中提到的训练有素的推荐系统决定。例如,如图 2 所示,如果将时间序列分成四个片段(这个步骤在现实中可能更复杂), , ,SVR、LSTM-RNN、Elman RNN 和 RF 模型分别可以获得最佳分数。最后,将四个片段合并计算整个时间序列的准确度 。详细算法见表 3。
This section proposed the "walking slide method" for cases in which the recommendation system fails. In this regard, the step size, walking rate, limit value of the start of step three in Table 3, and other 本节针对推荐系统失效的情况提出了 "行走滑动法"。在这方面,表 3 中的步长、行走速度、第三步开始的极限值以及其他
Table 2 表 2
The attribute evaluation type for the five properties. 五个属性的属性评估类型。
Attribute evaluation type
Evaluation properties
Reasons
cost type
NRMSE
smaller NRMSE could provide a larger benefit
smaller e values could provide a larger benefit
a larger N results in a larger acquisition cost for users
benefit type
initial investment
larger initial investment results in a larger cost for users
energy saving
larger energy savings result in larger benefits for users
Table 3 表 3
Algorithm of the walking slide method. 行走滑动法的算法。
Algorithm: Walking slide method 算法:行走滑动法
Requirements: Series , recommendation system, data slider , period , step size , walking rate 要求:系列 ,推荐系统,数据滑块 ,周期 ,步幅 ,步行速率