Deep Learning Based Automatic Grape Downy Mildew Detection

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Grape downy mildew (GDM) disease is a common plant leaf disease, and it causes serious damage to grape production, reducing yield and fruit quality. Traditional manual disease detection relies on farm experts and is often time-consuming. Computer vision technologies and artificial intelligence could provide automatic disease detection for real-time controlling the spread of disease on the grapevine in precision viticulture. To achieve the best trade-off between GDM detection accuracy and speed under natural environments, a deep learning based approach named YOLOv5-CA is proposed in this study. Here coordinate attention (CA) mechanism is integrated into YOLOv5, which highlights the downy mildew disease-related visual features to enhance the detection performance. A challenging GDM dataset was acquired in a vineyard under a nature scene (consisting of different illuminations, shadows, and backgrounds) to test the proposed approach. Experimental results show that the proposed YOLOv5-CA achieved a detection precision of 85.59%, a recall of 83.70%, and a mAP@0.5 of 89.55%, which is superior to the popular methods, including Faster R-CNN, YOLOv3, and YOLOv5. Furthermore, our proposed approach with inference occurring at 58.82 frames per second, could be deployed for the real-time disease control requirement. In addition, the proposed YOLOv5-CA based approach could effectively capture leaf disease related visual features resulting in higher GDE detection accuracy. Overall, this study provides a favorable deep learning based approach for the rapid and accurate diagnosis of grape leaf diseases in the field of automatic disease detection.

Related collections

Most cited references 58

Record: found
Abstract: found
Article: not found

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.

Kaiming He, Xiangyu Zhang, Shaoqing Ren … (2015)

Existing deep convolutional neural networks (CNNs) require a fixed-size (e.g., 224 × 224) input image. This requirement is "artificial" and may reduce the recognition accuracy for the images or sub-images of an arbitrary size/scale. In this work, we equip the networks with another pooling strategy, "spatial pyramid pooling", to eliminate the above requirement. The new network structure, called SPP-net, can generate a fixed-length representation regardless of image size/scale. Pyramid pooling is also robust to object deformations. With these advantages, SPP-net should in general improve all CNN-based image classification methods. On the ImageNet 2012 dataset, we demonstrate that SPP-net boosts the accuracy of a variety of CNN architectures despite their different designs. On the Pascal VOC 2007 and Caltech101 datasets, SPP-net achieves state-of-the-art classification results using a single full-image representation and no fine-tuning. The power of SPP-net is also significant in object detection. Using SPP-net, we compute the feature maps from the entire image only once, and then pool features in arbitrary regions (sub-images) to generate fixed-length representations for training the detectors. This method avoids repeatedly computing the convolutional features. In processing test images, our method is 24-102 × faster than the R-CNN method, while achieving better or comparable accuracy on Pascal VOC 2007. In ImageNet Large Scale Visual Recognition Challenge (ILSVRC) 2014, our methods rank #2 in object detection and #3 in image classification among all 38 teams. This manuscript also introduces the improvement made for this competition.

0 comments Cited 920 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, Kaiming He, Ross Girshick … (2015)

State-of-the-art object detection networks depend on region proposal algorithms to hypothesize object locations. Advances like SPPnet and Fast R-CNN have reduced the running time of these detection networks, exposing region proposal computation as a bottleneck. In this work, we introduce a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals. An RPN is a fully convolutional network that simultaneously predicts object bounds and objectness scores at each position. The RPN is trained end-to-end to generate high-quality region proposals, which are used by Fast R-CNN for detection. We further merge RPN and Fast R-CNN into a single network by sharing their convolutional features---using the recently popular terminology of neural networks with 'attention' mechanisms, the RPN component tells the unified network where to look. For the very deep VGG-16 model, our detection system has a frame rate of 5fps (including all steps) on a GPU, while achieving state-of-the-art object detection accuracy on PASCAL VOC 2007, 2012, and MS COCO datasets with only 300 proposals per image. In ILSVRC and COCO 2015 competitions, Faster R-CNN and RPN are the foundations of the 1st-place winning entries in several tracks. Code has been made publicly available. Extended tech report

0 comments Cited 743 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

YOLOv4: Optimal Speed and Accuracy of Object Detection

Alexey Bochkovskiy, Chien-Yao Wang, Hong-Yuan Liao (2020)

There are a huge number of features which are said to improve Convolutional Neural Network (CNN) accuracy. Practical testing of combinations of such features on large datasets, and theoretical justification of the result, is required. Some features operate on certain models exclusively and for certain problems exclusively, or only for small-scale datasets; while some features, such as batch-normalization and residual-connections, are applicable to the majority of models, tasks, and datasets. We assume that such universal features include Weighted-Residual-Connections (WRC), Cross-Stage-Partial-connections (CSP), Cross mini-Batch Normalization (CmBN), Self-adversarial-training (SAT) and Mish-activation. We use new features: WRC, CSP, CmBN, SAT, Mish activation, Mosaic data augmentation, CmBN, DropBlock regularization, and CIoU loss, and combine some of them to achieve state-of-the-art results: 43.5% AP (65.7% AP50) for the MS COCO dataset at a realtime speed of ~65 FPS on Tesla V100. Source code is at https://github.com/AlexeyAB/darknet

0 comments Cited 458 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Zhao Zhang: URI : http://loop.frontiersin.org/people/1366337/overview

Yongliang Qiao: URI : http://loop.frontiersin.org/people/1401473/overview

Dongjian He: URI : http://loop.frontiersin.org/people/1367254/overview

Journal

Journal ID (nlm-ta): Front Plant Sci

Journal ID (iso-abbrev): Front Plant Sci

Journal ID (publisher-id): Front. Plant Sci.

Title: Frontiers in Plant Science

Publisher: Frontiers Media S.A.

ISSN (Electronic): 1664-462X

Publication date (Electronic): 09 June 2022

Publication date Collection: 2022

Volume: 13

Electronic Location Identifier: 872107

Affiliations

[1] ¹College of Mechanical and Electronic Engineering, Northwest A&F University , Xianyang, China

[2] ²College of Electronic and Electrical Engineering, Baoji University of Arts and Sciences , Baoji, China

[3] ³Key Laboratory of Agricultural Internet of Things, Ministry of Agriculture and Rural Affairs, Northwest A&F University , Xianyang, China

[4] ⁴Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Service, Northwest A&F University , Xianyang, China

[5] ⁵Faculty of Engineering, Australian Centre for Field Robotics (ACFR), The University of Sydney , Sydney, NSW, Australia

Author notes

Edited by: Yiannis Ampatzidis, University of Florida, United States

Reviewed by: Brun Francois, Association de Coordination Technique Agricole, France; Harald Scherm, University of Georgia, United States

*Correspondence: Yongliang Qiao y.qiao@ 123456acfr.usyd.edu.au

Dongjian He hdj168@ 123456nwsuaf.edu.cn

This article was submitted to Sustainable and Intelligent Phytoprotection, a section of the journal Frontiers in Plant Science

Article

DOI: 10.3389/fpls.2022.872107

PMC ID: 9227981

PubMed ID: 35755646

SO-VID: 7d2bbe14-224e-4c12-986e-104ae8c5106e

License:

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

History

Date received : 09 February 2022

Date accepted : 27 April 2022

Page count

Figures: 6, Tables: 3, Equations: 13, References: 59, Pages: 12, Words: 7828

Funding

Funded by: National Key Research and Development Program of China, doi 10.13039/501100012166;

Funded by: Key Research and Development Program of Ningxia, doi 10.13039/100016692;

Deep Learning Based Automatic Grape Downy Mildew Detection

Read this article at

Abstract

Related collections

Computer Vision, Deep Learning, Deep Reinforcement Learning, IoT

Most cited references 58

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

YOLOv4: Optimal Speed and Accuracy of Object Detection

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Page count

Funding

Categories

Comments

Comment on this article

Similar content 1,397

Cited by 9