0
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Image‐based taxonomic classification of bulk insect biodiversity samples using deep learning and domain adaptation

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Complex bulk samples of insects from biodiversity surveys present a challenge for taxonomic identification, which could be overcome by high‐throughput imaging combined with machine learning for rapid classification of specimens. These procedures require that taxonomic labels from an existing source data set are used for model training and prediction of an unknown target sample. However, such transfer learning may be problematic for the study of new samples not previously encountered in an image set, for example, from unexplored ecosystems, and require methods of domain adaptation that reduce the differences in the feature distribution of the source and target domains (training and test sets). We assessed the efficiency of domain adaptation for family‐level classification of bulk samples of Coleoptera, as a critical first step in the characterization of biodiversity samples. Neural network models trained with images from a global database of Coleoptera were applied to a biodiversity sample from understudied forests in Cyprus as the target. Within‐dataset classification accuracy reached 98% and depended on the number and quality of training images, and on dataset complexity. The accuracy of between‐datasets predictions (across disparate source–target pairs that do not share any species or genera) was at most 82% and depended greatly on the standardization of the imaging procedure. An algorithm for domain adaptation, domain adversarial training of neural networks (DANN), significantly improved the prediction performance of models trained by non‐standardized, low‐quality images. Our findings demonstrate that existing databases can be used to train models and successfully classify images from unexplored biota, but the imaging conditions and classification algorithms need careful consideration.

          Related collections

          Most cited references40

          • Record: found
          • Abstract: found
          • Article: not found

          Deep learning.

          Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            A Survey on Transfer Learning

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Domain‐adversarial training of neural networks

                Bookmark

                Author and article information

                Contributors
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                Journal
                Systematic Entomology
                Systematic Entomology
                Wiley
                0307-6970
                1365-3113
                July 2023
                January 04 2023
                July 2023
                : 48
                : 3
                : 387-401
                Affiliations
                [1 ] The Center for Data Science Education and Research Shiga University Hikone Japan
                [2 ] Department of Biological Sciences University of Cyprus Nicosia Cyprus
                [3 ] Instituto de Productos Naturales y Agrobiología (IPNA‐CSIC) Tenerife Spain
                [4 ] Department of Life Sciences Natural History Museum London UK
                [5 ] Department of Life Sciences Silwood Park Campus, Imperial College London Ascot UK
                Article
                10.1111/syen.12583
                2fcab493-c029-412f-933e-5378e812e5a1
                © 2023

                http://creativecommons.org/licenses/by/4.0/

                History

                Comments

                Comment on this article