Research on Influencing Factors of Carbon Market Price and Machine Learning Pricing Prediction: Empirical Analysis Based on Seven Carbon Markets in China
Abstract:carbon trading market is an important tool to cope with climate change and realize low-carbon economy transition. The price formation mechanism and influencing factors of carbon trading market have attracted increasing attention. However, most of the existing studies focus on theoretical discussions, lacking empirical analysis of the seven carbon trading markets in China. This paper uses differential econometric models and machine learning algorithms (including decision trees, random forests and Xgboost) to analyze carbon trading market data from 2014 to 2023 to explore the impact of macroeconomic factors on carbon trading prices. The results show that there is a significant negative correlation between macroeconomic factors and carbon trading price, and machine learning model is better than traditional linear regression model in predicting carbon trading price. The research of this paper provides an important reference for improving the efficiency of carbon trading market and formulating relevant policies.
Keywords:carbon trading; econometric differential model; machine learning
I. Introduction
The theoretical focus of carbon trading market is "Pigou tax" and "Coase law", the logic of internalizing carbon emission externality cost in carbon trading market mainly lies in total amount control and trading system design, and the hidden economic principle behind carbon trading is "green premium"(Sun Xia et al., 2024)[3]. Behind this system design reflects the simple concept of justice of "who pollutes, who governs; who develops, who protects". As an important tool to cope with climate change and realize low-carbon economy transition, carbon trading market has attracted more and more attention in recent years. With the global emphasis on carbon emission control, countries have established and improved carbon trading markets to promote efficient allocation of resources and technological innovation. China, as the world's largest carbon dioxide emitter, is of great significance to promote the transformation of low-carbon economy for achieving sustainable development. The primary goal of low-carbon economy transition is to reduce carbon emissions, and the impact of traditional energy consumption structure makes carbon emissions from energy consumption the most important source of emissions at present.
Existing research shows that efficient carbon markets can effectively reflect the true cost of carbon emissions and thus guide companies to adopt low-carbon technologies. Although carbon trading market has many advantages, there are still many limitations in practice, such as limited market development, low trading frequency and low offset ratio of certified emission reductions. These problems may be related to insufficient incentives and limited coverage of carbon trading markets. Therefore, it is of great theoretical and practical significance to deeply analyze the factors affecting carbon trading price, especially the role of macroeconomic factors.
The innovation of this paper lies in using machine learning method to deeply analyze the pricing problem of carbon trading market, overcoming the limitation of traditional forecasting model affected by data frequency and other factors, so as to improve the accuracy of forecasting results. Although the existing literature affirms the positive role of carbon trading in energy conservation and emission reduction, the imperfections of the market lead to its inefficiency, unfair competition and manipulation. Therefore, this paper will select the data of China's seven carbon trading markets from 2014 to 2023 to analyze the impact of macroeconomic factors on carbon trading prices, aiming to improve the efficiency of carbon trading markets and provide reference for the formulation of relevant policies. The structure of this paper is as follows: the second part is literature review, the third part introduces research design and methods, the fourth part presents empirical results, and the fifth part is conclusions and recommendations.
II. Literature review
Carbon trading market is an important tool to cope with climate change and realize low-carbon economic transformation. With the emphasis on carbon emission control, how to improve the efficiency of carbon trading market, optimize resource allocation and promote technological innovation has become the focus of academia and policy makers. This paper reviews the related literatures and discusses the importance of improving the efficiency of carbon trading market, the factors affecting carbon trading price, the limitations of the market and the corresponding solutions.
An efficient carbon market can effectively reflect the true cost of carbon emissions, thus guiding enterprises to adopt low-carbon technologies and achieve efficient allocation of resources. Xu Jianwei and Liu Zhihua (2024) pointed out that China, as a major CO2 emitter, promoting the transformation of low-carbon economy is of great significance for implementing the concept of green and low-carbon development. The primary goal of low-carbon economic transformation is to reduce carbon emissions, and due to the influence of traditional energy consumption structure, carbon emissions generated by energy consumption are the most important source of emissions at present and also the focus of carbon emission reduction in the future[5]. At the same time, Wu Zhenni et al.(2024) emphasized that, from the perspective of implied carbon emissions, the implementation difference of carbon trading policy will increase the uneven distribution of carbon emission reduction responsibilities between pilot areas and non-pilot areas, enriching the relevant conclusions of the current research on the impact of carbon trading pilot policy on implied carbon emissions between regions[4]. Although carbon trading markets have many advantages, there are still many limitations in practice. Luo Liangwen et al.(2024) pointed out that the domestic carbon emission trading market still faces some challenges, such as limited market development, inability to be applied on a large scale, low trading frequency, low offset ratio of certified emission reductions, complex offset procedures, and difficulty in mobilizing market enthusiasm. This may be related to insufficient incentives in the carbon trading market, or it may be due to the limited coverage of carbon trading, resulting in poor implementation of policies[2]. Sun Xia and Liang Hongzhi (2024) emphasize that the phenomenon of hidden carbon emission transfer not only affects the emission reduction effect, but also may lead to unfair competition among regions. In addition, the market mechanism is not perfect, and the lack of regulatory measures makes the market vulnerable to manipulation, similar to Zhao et al. (2023)[7].
Carbon trading price is the result of many factors. Accordingto Gao Kaiet al.(2024), policy regulation, market supply-demand relationship and various external economic environments will affect the fluctuation of carbon trading price[1]. Specifically, policy uncertainty tends to lead to market participants 'expected volatility of future prices, exacerbating market instability. (2019) points outthat transparency and information symmetry of markets also play an important role in price formation, and that information asymmetry may lead to manipulation by some participants[12]. In addition, Fang et al. (2023) Further research shows that there is a significant interaction between technological innovation breakthroughs and carbon trading policies, and a good policy environment can promote technological progress and thus affect carbon trading prices[89]. A deep understanding of the dynamics and complexity of carbon markets is therefore fundamental to effective policy formulation.
To sum up, the existing literature affirms the positive role of carbon trading in energy conservation and emission reduction, but at the same time, due to the imperfections of the market, there are problems such as inefficiency, unfair competition and even manipulation. Therefore, this paper selects the data of seven carbon trading markets in China from 2014 to 2023 to analyze the impact of macro indicators on carbon trading, which can not only improve the efficiency of carbon trading market, but also provide direction for formulating relevant policies.
III. Study design
(i) Research methods
1. differential econometric model
In this chapter, to simplify the research process and establish effective econometric models, we will construct linear models for the relationship between financial asset returns and macroeconomic factors in each pilot market. Specifically, we use the Differenced Econometric Model, which improves the robustness and predictive power of the model by differentiating the time series data to eliminate non-stationarity and seasonal fluctuations.
represents the difference in logarithm of valence,Lrepresents the lag factor, andXrepresents the vector of explanatory variables. When constructing the linear model, the variables selected mainly include the trading prices of the seven major trading markets, the indexes of the domestic stock market, the major energy sources and the US stock index.
The basic idea of differential econometric models is to eliminate trend components and periodic fluctuations in the data by calculating the differential value of the variable (i.e., the difference between the current value and the previous value), so that the data are more consistent with the stationarity assumption. This process not only helps to improve the explanatory power of the model, but also effectively reduces the influence of autocorrelation on the regression results. After differential processing, the model can more accurately capture the dynamic relationship between carbon trading prices and macroeconomic factors.
2. decision-making tree
Zhou Liang (2022) defines three machine learning models in turn as basic modeling[6]. Threemachine learning algorithms based on decision treeare CART decision tree algorithm, Bagging ensemble algorithm based random forest algorithm and Boosting ensemble algorithm based Xgboost algorithm. CART decision trees select attributes by comparing the Gini index of each attribute.
where is the sum of t decision trees for the sample as the predicted value, f is the tth regression tree, and F is the set space of all regression trees.
3. random forest
Random Forest (RF) is an ensemble learning method that uses voting to combine prediction results, assigning the category with the most votes as the final output. Random forest can handle very large amounts of data, and its error rate for most learning tasks is almost at the same level as any other method, with a smaller tendency to overfit.
Building a loss function based on the decision tree:
where lrepresentsthe deviation between thepredicted value and the true value , Ω represents the complexity of each tree, and the calculation formula is:
where T is the number of leaf nodes, w is the weight of leaf nodes, γ and λ are regular coefficients.
4. gradient hoist
Xgboost is proposed on the basis of the traditional GBDT model and is a new gradient-lifting ensemble learning method. Compared with GBDT, Xgboost uses second-order Taylor expansion to increase regularization term to seek optimal solution of loss function, avoiding overfitting to some extent.
Based on the first two formulas,Xgboost Taylor expands the loss functionto quadratic terms at , and transforms the optimization objective into a problem of solving the minimum of a quadratic function in one variable under the given decision tree structure. Then we try to segment the leaf nodes by greedy algorithm and compare the gain of objective function before and after segmentation until we get the optimal model.
5. performance evaluation
In order to evaluate the predictive performance of the above four models,mean absolute error (MAE) and root mean square error (RMSE) areusedas evaluation indicators of model goodness of fit[6], which are calculated as follows:
N represents the sample size, and represent the predicted and observed values of the sample, respectively. The smaller the MAE and RMSE, the better the model fit.
(ii) Variable settings
The data in this article comes from Wind database, using monthly data from 2014 to 2023 to comprehensively show the changing trend of carbon trading market. Referring to Zeng et al.(2023), this study sets the average transaction price of carbon trading (yuan/ton) as the explained variable[137], and divides the explanatory variables into the following categories:
1. Stock index category: including Shanghai index, Shenzhen index, energy index, raw material index, industrial index, bond yield;
2. Energy price category: covering raw coal price (yuan/ton), steel price (yuan/ton), ore price (yuan/ton), liquefied natural gas price (yuan/ton), gasoline price (yuan/ton) and crude oil price (yuan/ton), among which raw coal price selects Dongsheng price in Inner Mongolia, gasoline price selects No.95 gasoline price[11];
3. U.S. stock index category: including Nasdaq index and Dow Jones index.
(iii) Descriptive statistics
Table1Sample observation time and data
|
|
|
|
|
|
| 2014/1/1 | 2023/12/31 | 3652 | 2027 | 55.50% |
| 2014/1/1 | 2023/12/31 | 3652 | 1498 | 41.02% |
| 2014/1/1 | 2023/12/31 | 3652 | 1468 | 40.20% |
| 2014/1/1 | 2023/12/31 | 3652 | 2142 | 58.65% |
| 2014/1/1 | 2023/12/31 | 3652 | 897 | 24.56% |
| 2014/1/1 | 2023/12/31 | 3652 | 2238 | 61.28% |
| 2014/1/1 | 2023/12/31 | 3652 | 843 | 23.08% |
This paper collects trading activity in China's seven major carbon trading markets from January 1, 2014 to December 31, 2023, reflecting significant differences in non-zero trading days and their proportions. The non-zero trading days in Shenzhen market are 2027 days, accounting for 55.50% of the total observation days, showing high trading activity; while the non-zero trading days in Tianjin and Chongqing markets are only 897 days and 843 days respectively, with the proportion as low as 24.56% and 23.08%, indicating that their trading activity is relatively low. These differences may be closely related to factors such as market mechanism, policy environment and participant base, especially the proportion of non-zero trading days in Guangdong and Hubei markets is 58.65% and 61.28% respectively, further emphasizing the diversity of market activity.
Table2Descriptive statistical results of data indicators
Column | Obs | Mean | Std |
| 840 | 32.80 | 23.46 |
| 840 | 3102.76 | 433.87 |
| 840 | 10946.09 | 2084.65 |
| 840 | 994.99 | 263.93 |
| 840 | 3245.03 | 678.46 |
| 840 | 3977.54 | 856.90 |
| 840 | 127.44 | 14.48 |
| 840 | 438.05 | 266.85 |
| 840 | 3600.50 | 886.67 |
| 840 | 477.64 | 386.77 |
| 840 | 4266.87 | 1479.23 |
| 840 | 3502.51 | 4052.17 |
| 840 | 63.95 | 19.79 |
| 840 | 8604.20 | 3568.57 |
| 840 | 25370.13 | 6578.51 |
The table above shows statistics for 840 observations, covering average carbon trading prices and various economic indicators. The average transaction price of carbon trading is 32.80 yuan/ton, and the standard deviation is 23.46, indicating that the market price fluctuates significantly and may be affected by changes in policy and supply and demand. In terms of stock index, the average value of Shanghai Stock Index is 3102.76 and the standard deviation is 433.87, showing relatively stable market performance; while the average value of Shenzhen Index is 10946.09 and the standard deviation is as high as 2084.65, reflecting greater volatility. The energy index has a mean of 994.99 and a standard deviation of 263.93, indicating price volatility in the energy market.
In terms of raw material prices, the average price of raw coal is 438.05 yuan/ton, and the standard deviation is 266.85, indicating that the price fluctuates greatly. The average price of steel and iron ore is 3600.50 yuan and 477.64 yuan respectively, and the standard deviation is 886.67 yuan and 386.77 yuan respectively, reflecting the volatility of raw material market. The average price of LNG and gasoline is 4266.87 yuan and 3502.51 yuan respectively, and the standard deviation is 1479.23 yuan and 4052.17 yuan respectively, showing high fluctuation of energy prices. Nasdaq and Dow averages were 8604.20 and 25370.13, respectively, with standard deviations of 3568.57 and 6578.51, indicating significant volatility in the U.S. stock market.
V. Empirical Results
(a) Results of differential regression
The following tableshows the regression analysis results between various financial assets and macroeconomic factors in the seven major carbon trading markets in China. By comparing the regression coefficient, t value, standard error (Std) and coefficient of determination (R²) of different markets, we can observe the failure phenomenon of carbon trading market in pricing mechanism.