Transformer-based land use and land cover classification with explainability using satellite imagery

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Transformer-based models have greatly improved Land Use and Land Cover (LULC) applications. Their revolutionary ability to analyze and extract key information has greatly advanced the field. However, the high computational cost of these models presents a considerable obstacle to their practical implementation. Therefore, this study aims to strike a balance between computational cost and accuracy when employing transformer-based models for LULC analysis. We exploit transfer learning and fine-tuning strategies to optimize the resource utilization of transformer-based models. Furthermore, transparency is the core principle of our methodology to promote fairness and trust in applying LULC models across various domains, including forestry, environmental studies, and urban or rural planning. To ensure transparency, we have employed Captum, which enables us to uncover and mitigate potential biases and interpret AI-driven decisions. Our results indicate that transfer learning can potentially improve transformer-based models in satellite image classification, and strategic fine-tuning can maintain efficiency with minimal accuracy trade-offs. This research highlights the potential of Explainable AI (XAI) in Transformer-based models for achieving more efficient and transparent LULC analysis, thereby encouraging continued innovation in the field.

Related collections

Most cited references 9

Record: found
Abstract: not found
Article: not found

A Survey on Transfer Learning

Sinno Jialin Pan, Shi Qiang Yang (2010)

0 comments Cited 1881 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources

Xiao Zhu, Lichao Mou, Gui-Song Xia … (2017)

0 comments Cited 382 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Transformers in Vision: A Survey

Salman Khan, Muzammal Naseer, Munawar Hayat … (2022)

Astounding results from Transformer models on natural language tasks have intrigued the vision community to study their application to computer vision problems. Among their salient benefits, Transformers enable modeling long dependencies between input sequence elements and support parallel processing of sequence as compared to recurrent networks e.g. , Long short-term memory (LSTM). Different from convolutional networks, Transformers require minimal inductive biases for their design and are naturally suited as set-functions. Furthermore, the straightforward design of Transformers allows processing multiple modalities ( e.g. , images, videos, text and speech) using similar processing blocks and demonstrates excellent scalability to very large capacity networks and huge datasets. These strengths have led to exciting progress on a number of vision tasks using Transformer networks. This survey aims to provide a comprehensive overview of the Transformer models in the computer vision discipline. We start with an introduction to fundamental concepts behind the success of Transformers i.e., self-attention, large-scale pre-training, and bidirectional feature encoding. We then cover extensive applications of transformers in vision including popular recognition tasks ( e.g. , image classification, object detection, action recognition, and segmentation), generative modeling, multi-modal tasks ( e.g. , visual-question answering, visual reasoning, and visual grounding), video processing ( e.g. , activity recognition, video forecasting), low-level vision ( e.g. , image super-resolution, image enhancement, and colorization) and 3D analysis ( e.g. , point cloud classification and segmentation). We compare the respective advantages and limitations of popular techniques both in terms of architectural design and their experimental value. Finally, we provide an analysis on open research directions and possible future works. We hope this effort will ignite further interest in the community to solve current challenges towards the application of transformer models in computer vision.

0 comments Cited 150 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Mehak Khan: mehakkhan3@hotmail.com

Reza Arghandeh: reza.arghandeh@hvl.no

Journal

Journal ID (nlm-ta): Sci Rep

Journal ID (iso-abbrev): Sci Rep

Title: Scientific Reports

Publisher: Nature Publishing Group UK (London )

ISSN (Electronic): 2045-2322

Publication date (Electronic): 20 July 2024

Publication date PMC-release: 20 July 2024

Publication date Collection: 2024

Volume: 14

Electronic Location Identifier: 16744

Affiliations

Department of Computer Science, Electrical Engineering and Mathematical Sciences, Western Norway University of Applied Sciences, ( https://ror.org/05phns765) Bergen, Norway

Article

Publisher ID: 67186

DOI: 10.1038/s41598-024-67186-4

PMC ID: 11271450

PubMed ID: 39033183

SO-VID: 75488992-a48f-4cd7-975b-e7ab65e8db3e

License:

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

History

Date received : 26 January 2024

Date accepted : 9 July 2024

Funding

Funded by: Europeon Space Agency

Funded by: Western Norway University Of Applied Sciences

Open Access :

Open access funding provided by Western Norway University Of Applied Sciences

Custom metadata

ScienceOpen disciplines: Uncategorized

Keywords: computer science,ecology

Data availability:

ScienceOpen disciplines: Uncategorized

Keywords: computer science, ecology

Transformer-based land use and land cover classification with explainability using satellite imagery

Read this article at

Abstract

Related collections

EDP Sciences: UN SDGs research collection

Most cited references 9

A Survey on Transfer Learning

Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources

Transformers in Vision: A Survey

Author and article information

Contributors

Journal

Affiliations

Article

History

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 208

Most referenced authors 108