生态环境学报 ›› 2024, Vol. 33 ›› Issue (12): 1891-1901.DOI: 10.16258/j.cnki.1674-5906.2024.12.007

• 研究论文【环境科学】 • 上一篇    下一篇

基于CNN-LSTM的鄱阳湖生态经济区大气污染物时空预测

黄怡容1(), 熊秋林1,2,3,*(), 熊正坤1, 陈文波1,2,3, 李长鸿1, 沙鸿钰1   

  1. 1.东华理工大学测绘与空间信息工程学院,江西 南昌 330013
    2.东华理工大学江西省流域生态过程与信息重点实验室,江西 南昌 330013
    3.东华理工大学南昌市景观过程与国土空间生态修复重点实验室,江西 南昌 330013
  • 收稿日期:2024-05-29 出版日期:2024-12-18 发布日期:2024-12-31
  • 通讯作者: *熊秋林。E-mail: xiong_ql@163.com
  • 作者简介:黄怡容(2001年生),女,硕士研究生,研究方向为生态环境监测与评价。E-mail: hyrr_123@163.com
  • 基金资助:
    国家自然科学基金项目(42107274);江西省自然科学基金项目(20202BABL213030)

The Spatiotemporal Prediction of Air Pollutants in the Poyang Lake Ecological Economic Zone Based on CNN-LSTM

HUANG Yirong1(), XIONG Qiulin1,2,3,*(), XIONG Zhengkun1, CHEN Wenbo1,2,3, LI Changhong1, SHA Hongyu1   

  1. 1. School of Surveying and Geoinformation Engineering, East China University of Technology, Nanchang 330013, P. R. China
    2. Jiangxi Key Laboratory of Watershed Ecological Process and Information, East China University of Technology, Nanchang 330013, P. R. China
    3. Nanchang Key Laboratory of Landscape Process and Territorial Spatial Ecological Restoration, East China University of Technology, Nanchang 330013, P. R. China
  • Received:2024-05-29 Online:2024-12-18 Published:2024-12-31

摘要:

为准确预测大气污染物浓度的变化趋势,基于2020年1月1日-2024年4月6日鄱阳湖生态经济区77个环境空气质量自动监测站点(36个国控点和41个省控点)的主要大气污染物逐日监测数据,通过空间可视化技术,探讨了该区域2020-2023年主要大气污染物的时空演化特征;在已有先验知识的基础上,基于卷积神经网络(CNN)和长短期记忆神经网络(LSTM)模型,构建了鄱阳湖生态经济区的主要大气污染物浓度时间序列预测模型(CNN-LSTM);基于LSTM模型和CNN-LSTM模型预测了研究区2024年主要大气污染物浓度的时空分布。结果表明:1)2020-2023年鄱阳湖生态经济区夏季空气质量较好,主要大气污染物NO2、PM2.5、PM10和CO浓度的月变化均呈“U”形趋势,夏秋季O3浓度显著高于冬春季,SO2浓度逐年降低,但O3、PM2.5和PM10浓度均存在上升的趋势;2)O3、PM2.5和PM10在鄱阳湖生态经济区分布呈“西高东低”的特征,污染严重区主要分布在南昌市及九江市;3)构建的CNN-LSTM模型在主要大气污染物的预测精度上均优于LSTM模型,MAE和RMSE指标均显著下降,R2指标显著提高,绝大部分监测站点预测值与实测值Pearson相关系数(r)大于0.8;4)基于CNN-LSTM模型的时空预测结果,2024年6项主要大气污染物浓度较2023年均呈现一定的上升趋势,其中PM10和O3上升尤为明显。建议考虑PM10和O3的协同治理,以有效防控鄱阳湖生态经济区的空气污染。

关键词: 空气质量, 卷积神经网络, 长短期记忆神经网络, 深度学习, 时空预测

Abstract:

Air pollution has become a pressing global environmental issue with severe implications for human health, economic growth, and societal wellbeing. Long-term exposure to key pollutants, such as particulate matter (PM), ozone (O3), sulfur dioxide (SO2), and carbon monoxide (CO), is strongly linked to respiratory and cardiovascular diseases. According to the World Health Organization (WHO), 90% of the world’s population resides in areas where air quality fails to meet health standards, contributing to approximately seven million premature deaths annually. Urban areas, characterized by dense populations and concentrated industrial activities, are particularly vulnerable to high levels of pollution. Accurately predicting air quality trends is essential for mitigating health risks, informing policy decisions, and effectively managing pollution sources. Traditional forecasting methods, such as numerical weather prediction models (WRF-Chem) and statistical approaches (ARIMA), have limitations owing to the complexity of atmospheric processes and inherent model uncertainties. These methods often struggle to capture the nonlinear dynamics of pollution dispersion and interactions among various environmental factors. In contrast, recent advances in machine and deep learning have offered promising solutions by leveraging large, diverse datasets. These approaches integrate information from multiple sources, uncover hidden patterns, and generate reliable predictions. The aim of this study was to predict trends in air pollutant concentrations within the Poyang Lake Ecological and Economic Zone (POLEZ) using deep learning models, specifically Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM).This study is based on daily monitoring data of major air pollutants collected from 77 ambient air quality automatic monitoring stations (36 national control points and 41 provincial control points) within the POLEZ, covering the period from January 1, 2020, to April 6, 2024. Data from January 1, 2020, to December 31, 2023, were used as the training set, while the data from January 1, 2024, to April 6, 2024, served as the testing set. Initially, the spatial and temporal evolution of six major air pollutants (CO, NO2, SO2, ozone, PM2.5, and PM10) from 2020 to 2023 were analyzed using spatial visualization techniques. This analysis provides a comprehensive overview of pollutant trends and regional variations, helping identify key patterns and anomalies across the POLEZ. Building on this spatiotemporal analysis, a CNN-LSTM model was developed to predict pollutant concentrations in 2024. The CNN component of the model extracts time-series features from historical air quality data, whereas the LSTM component captures long-term dependencies, thereby enabling more accurate predictions of future trends. The performance of the CNN-LSTM model was compared with that of a traditional LSTM model to evaluate the added benefit of combining CNN and LSTM for air quality prediction. The key findings from this study are as follows: 1) Seasonal Patterns of Pollutants: Analysis of data from 2020 to 2023 reveals that concentrations of nitrogen dioxide (NO2), respirable particulate matter (PM2.5), inhalable particulate matter (PM10), and carbon monoxide (CO) follow a “U”-shaped seasonal pattern, with lower levels in the summer and higher levels in the winter. This pattern reflects the influence of seasonal weather conditions on air quality, particularly in winter when temperature inversions and weak vertical mixing hinder the dispersion of pollutants. In contrast, ozone (O3) concentrations peaked in summer and fall, driven by enhanced photochemical reactions due to increased sunlight and warmer temperatures. Monthly trends in sulfur dioxide (SO2) concentrations show less variability, with a gradual decline over time, indicating the effectiveness of recent pollution control efforts in the region. 2) Spatial and Temporal Distribution of Pollutants: The spatial distribution of pollutants across the POLEZ revealed that Nanchang, as a regional economic hub, experienced the highest pollutant concentrations, primarily due to intensive industrial and transportation activities. Pollutants such as carbon monoxide (CO), nitrogen dioxide (NO2), and PM2.5, exhibit a west-to-east gradient, with higher pollution levels in western cities, including Nanchang and Jiujiang. Sulfur dioxide (SO2) concentrations remained relatively low across the region, particularly around Poyang Lake. However, PM2.5 and PM10 concentrations increased in 2023, likely driven by economic recovery following the pandemic. Jiujiang, in particular, is the most polluted city for ozone (O3), with concentrations showing a northwest-southeast gradient. 3) Model Performance: The CNN-LSTM model demonstrated superior prediction accuracy compared to the LSTM model. Specifically, the Mean Absolute Error (MAE) for CO decreased from 0.22 to 0.17, representing a 22.73% improvement; for SO2, the MAE decreased from 1.67 to 1.28, a 23.35% improvement; for NO2, the MAE decreased from 10.96 to 7.15, showing a 34.76% improvement; for O3, the MAE decreased from 17.21 to 12.39, an improvement of 28.01%; for PM2.5, the MAE decreased from 21.09 to 13.35, reflecting a 36.70% accuracy improvement; and for PM10, the MAE decreased from 28.42 to 19.51, representing a 31.35% improvement. Additionally, the CNN-LSTM model showed significant improvements in both the Root Mean Squared Error (RMSE) and values, indicating that the model provided more reliable and accurate predictions of air pollution trends. The Pearson correlation coefficients between the predicted and observed values for most monitoring stations exceeded 0.8, further validating the high accuracy and reliability of the CNN-LSTM model for air quality forecasting. 4) Projections for 2024: The 2024 forecasts indicate an upward trend in the concentrations of all six major pollutants. Carbon monoxide levels are expected to rise slightly from 2023 but will remain below the levels observed in 2020. Nitrogen dioxide concentrations are projected to remain stable, underscoring the need for continued control measures to maintain air quality. Sulfur dioxide concentrations are anticipated to increase significantly, warranting increased attention and stricter control measures. Similarly, the concentrations of PM2.5, PM10, and O3 are projected to rise, with notable increases in both the PM10 and ozone levels. These findings emphasize the need for more effective and coordinated control measures, particularly for PM10 and ozone, to prevent further deterioration of air quality. The results of this study underscore the effectiveness of the CNN-LSTM model in predicting the spatial and temporal trends of air pollutants and provide valuable insights for future air quality management in the Poyang Lake Ecological and Economic Zone. These findings emphasize the necessity for sustained and coordinated air pollution control efforts, particularly in high-risk areas, such as Nanchang and Jiujiang. By incorporating deep learning models into air quality prediction systems, policymakers can make informed decisions that will ultimately contribute to improving public health and environmental quality.

Key words: air quality, convolutional neural network (CNN), long short-term memory (LSTM) network, deep learning, spatiotemporal prediction

中图分类号: