0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Knowledge-guided machine learning reveals pivotal drivers for gas-to-particle conversion of atmospheric nitrate

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Particulate nitrate, a key component of fine particles, forms through the intricate gas-to-particle conversion process. This process is regulated by the gas-to-particle conversion coefficient of nitrate (ε(NO 3 )). The mechanism between ε(NO 3 ) and its drivers is highly complex and nonlinear, and can be characterized by machine learning methods. However, conventional machine learning often yields results that lack clear physical meaning and may even contradict established physical/chemical mechanisms due to the influence of ambient factors. It urgently needs an alternative approach that possesses transparent physical interpretations and provides deeper insights into the impact of ε(NO 3 ). Here we introduce a supervised machine learning approach—the multilevel nested random forest guided by theory approaches. Our approach robustly identifies NH 4 +, SO 4 2−, and temperature as pivotal drivers for ε(NO 3 ). Notably, substantial disparities exist between the outcomes of traditional random forest analysis and the anticipated actual results. Furthermore, our approach underscores the significance of NH 4 + during both daytime (30%) and nighttime (40%) periods, while appropriately downplaying the influence of some less relevant drivers in comparison to conventional random forest analysis. This research underscores the transformative potential of integrating domain knowledge with machine learning in atmospheric studies.

          Graphical abstract

          Highlights

          • Multilevel nested random forest guided by theory approaches yields reliable results.

          • Pivotal variables are screened automatically via recursive feature elimination.

          • The approach rectifies the outcomes of RF and thoroughly discourses G/P conversion.

          • The significance of NH 4 + is underscored during both daytime (30%) and nighttime (40%).

          • Integrating domain knowledge into machine learning is a research hotspot in the future.

          Related collections

          Most cited references50

          • Record: found
          • Abstract: not found
          • Article: not found

          Random Forests

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Bagging predictors

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Deep learning and process understanding for data-driven Earth system science

              Machine learning approaches are increasingly used to extract patterns and insights from the ever-increasing stream of geospatial data, but current approaches may not be optimal when system behaviour is dominated by spatial or temporal context. Here, rather than amending classical machine learning, we argue that these contextual cues should be used as part of deep learning (an approach that is able to extract spatio-temporal features automatically) to gain further process understanding of Earth system science problems, improving the predictive ability of seasonal forecasting and modelling of long-range spatial connections across multiple timescales, for example. The next step will be a hybrid modelling approach, coupling physical process models with the versatility of data-driven machine learning.
                Bookmark

                Author and article information

                Contributors
                Journal
                Environ Sci Ecotechnol
                Environ Sci Ecotechnol
                Environmental Science and Ecotechnology
                Elsevier
                2096-9643
                2666-4984
                19 October 2023
                May 2024
                19 October 2023
                : 19
                : 100333
                Affiliations
                [a ]State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, Tianjin Key Laboratory of Urban Transport Emission Research, College of Environmental Science and Engineering, Nankai University, Tianjin, 300350, China
                [b ]CMA-NKU Cooperative Laboratory for Atmospheric Environment-Health Research (CLAER), College of Environmental Science and Engineering, Nankai University, Tianjin, 300350, China
                [c ]Department of Civil, Environmental, and Construction Engineering, University of Central Florida, Orlando, FL, USA
                [d ]School of Geography Earth and Environment Sciences, University of Birmingham, Birmingham, B15 2TT, UK
                [e ]State Key Laboratory of Precision Measuring Technology and Instruments, Tianjin Key Laboratory of air Pollutants Monitoring Technology, School of Precision Instrument and Opto-electronics Engineering, Tianjin University, Tianjin, 300072, China
                [f ]Gigantic Technology (Tianjin) Co., Ltd, Tianjin, 300072, China
                [g ]College of Computer Science, Nankai University, Tianjin, 300350, China
                [h ]China National Environmental Monitoring Centre, Beijing, 100012, China
                Author notes
                []Corresponding author. State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, Tianjin Key Laboratory of Urban Transport Emission Research, College of Environmental Science and Engineering, Nankai University, Tianjin, 300350, China. nksgl@ 123456nankai.edu.cn
                [∗∗ ]Corresponding author. zhangll@ 123456cnemc.cn
                Article
                S2666-4984(23)00098-4 100333
                10.1016/j.ese.2023.100333
                10661687
                53a28f1c-8611-4850-8aeb-fa6862bde7a1
                © 2023 The Authors

                This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

                History
                : 7 January 2023
                : 9 October 2023
                : 12 October 2023
                Categories
                Original Research

                machine learning,data driven,theoretical approach,domain knowledge,guide

                Comments

                Comment on this article