6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Using Deep Learning to Identify Costa Rican Native Tree Species From Wood Cut Images

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Tree species identification is critical to support their conservation, sustainable management and, particularly, the fight against illegal logging. Therefore, it is very important to develop fast and accurate identification systems even for non-experts. In this research we have achieved three main results. First, we developed—from scratch and using new sample collecting and processing protocols—an dataset called CRTreeCuts that comprises macroscopic cross-section images of 147 native tree species from Costa Rica. Secondly, we implemented a CNN for automated tree species identification based on macroscopic images of cross-sections of wood. For this CNN we apply the fine-tuning technique with VGG16 as a base model, pre-trained with the ImageNet data set. This model is trained and tested with a subset of 75 species from CRTreeCuts. The top-1 and top-3 accuracies achieved in the testing phase are 70.5% and 80.3%, respectively. The Same-Specimen-Picture Bias (SSPB), which is known to erroneously increase accuracy, is absent in all experiments. Finally, the third result is Cocobolo, an Android mobile application that uses the developed CNN as back-end to identify Costa Rican tree species from images of cross-sections of wood.

          Related collections

          Most cited references37

          • Record: found
          • Abstract: found
          • Article: not found

          One-shot learning of object categories.

          Learning visual models of object categories notoriously requires hundreds or thousands of training examples. We show that it is possible to learn much information about a category from just one, or a handful, of images. The key insight is that, rather than learning from scratch, one can take advantage of knowledge coming from previously learned categories, no matter how different these categories might be. We explore a Bayesian implementation of this idea. Object categories are represented by probabilistic models. Prior knowledge is represented as a probability density function on the parameters of these models. The posterior model for an object category is obtained by updating the prior in the light of one or more observations. We test a simple implementation of our algorithm on a database of 101 diverse object categories. We compare category models learned by an implementation of our Bayesian approach to models learned from by Maximum Likelihood (ML) and Maximum A Posteriori (MAP) methods. We find that on a database of more than 100 categories, the Bayesian approach produces informative models when the number of training examples is too small for other methods to operate successfully.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Generalizing from a Few Examples: A Survey on Few-shot Learning

            Machine learning has been highly successful in data-intensive applications but is often hampered when the data set is small. Recently, Few-shot Learning (FSL) is proposed to tackle this problem. Using prior knowledge, FSL can rapidly generalize to new tasks containing only a few samples with supervised information. In this article, we conduct a thorough survey to fully understand FSL. Starting from a formal definition of FSL, we distinguish FSL from several relevant machine learning problems. We then point out that the core issue in FSL is that the empirical risk minimizer is unreliable. Based on how prior knowledge can be used to handle this core issue, we categorize FSL methods from three perspectives: (i) data, which uses prior knowledge to augment the supervised experience; (ii) model, which uses prior knowledge to reduce the size of the hypothesis space; and (iii) algorithm, which uses prior knowledge to alter the search for the best hypothesis in the given hypothesis space. With this taxonomy, we review and discuss the pros and cons of each category. Promising directions, in the aspects of the FSL problem setups, techniques, applications, and theories, are also proposed to provide insights for future research. 1
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Going deeper in the automated identification of Herbarium specimens

              Background Hundreds of herbarium collections have accumulated a valuable heritage and knowledge of plants over several centuries. Recent initiatives started ambitious preservation plans to digitize this information and make it available to botanists and the general public through web portals. However, thousands of sheets are still unidentified at the species level while numerous sheets should be reviewed and updated following more recent taxonomic knowledge. These annotations and revisions require an unrealistic amount of work for botanists to carry out in a reasonable time. Computer vision and machine learning approaches applied to herbarium sheets are promising but are still not well studied compared to automated species identification from leaf scans or pictures of plants in the field. Results In this work, we propose to study and evaluate the accuracy with which herbarium images can be potentially exploited for species identification with deep learning technology. In addition, we propose to study if the combination of herbarium sheets with photos of plants in the field is relevant in terms of accuracy, and finally, we explore if herbarium images from one region that has one specific flora can be used to do transfer learning to another region with other species; for example, on a region under-represented in terms of collected data. Conclusions This is, to our knowledge, the first study that uses deep learning to analyze a big dataset with thousands of species from herbaria. Results show the potential of Deep Learning on herbarium species identification, particularly by training and testing across different datasets from different herbaria. This could potentially lead to the creation of a semi, or even fully automated system to help taxonomists and experts with their annotation, classification, and revision works.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Plant Sci
                Front Plant Sci
                Front. Plant Sci.
                Frontiers in Plant Science
                Frontiers Media S.A.
                1664-462X
                01 April 2022
                2022
                : 13
                : 789227
                Affiliations
                [1] 1School of Mathematics, Costa Rica Institute of Technology , Cartago, Costa Rica
                [2] 2School of Computing, Costa Rica Institute of Technology , Cartago, Costa Rica
                [3] 3School of Forestry Engineering, Costa Rica Institute of Technology , Cartago, Costa Rica
                [4] 4Cooperativa de Productividad Forestal, Facultad de Ciencias Forestales, Universidad de Concepción , Concepción, Chile
                Author notes

                Edited by: Alexis Joly, Research Centre Inria Sophia Antipolis Méditerrane, France

                Reviewed by: Sitanshu Sekhar Sahu, Birla Institute of Technology, India; Esa Prakasa, National Research and Innovation Agency (BRIN) of Indonesia, Indonesia

                *Correspondence: Geovanni Figueroa-Mata gfigueroa@ 123456itcr.ac.cr

                This article was submitted to Technical Advances in Plant Science, a section of the journal Frontiers in Plant Science

                Article
                10.3389/fpls.2022.789227
                9011719
                35432415
                768792c8-bca4-4c67-97bb-1110e1d1213b
                Copyright © 2022 Figueroa-Mata, Mata-Montero, Valverde-Otárola, Arias-Aguilar and Zamora-Villalobos.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 04 October 2021
                : 18 January 2022
                Page count
                Figures: 11, Tables: 2, Equations: 3, References: 39, Pages: 12, Words: 7736
                Funding
                Funded by: Instituto Tecnológico de Costa Rica, doi 10.13039/501100005388;
                Categories
                Plant Science
                Original Research

                Plant science & Botany
                deep learning,convolutional neural network,plant classification,automated image-based tree species identification,costa rican tree species,xylotheques

                Comments

                Comment on this article