DeepFruits: A Fruit Detection System Using Deep Neural Networks

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

This paper presents a novel approach to fruit detection using deep convolutional neural networks. The aim is to build an accurate, fast and reliable fruit detection system, which is a vital element of an autonomous agricultural robotic platform; it is a key element for fruit yield estimation and automated harvesting. Recent work in deep neural networks has led to the development of a state-of-the-art object detector termed Faster Region-based CNN (Faster R-CNN). We adapt this model, through transfer learning, for the task of fruit detection using imagery obtained from two modalities: colour (RGB) and Near-Infrared (NIR). Early and late fusion methods are explored for combining the multi-modal (RGB and NIR) information. This leads to a novel multi-modal Faster R-CNN model, which achieves state-of-the-art results compared to prior work with the F1 score, which takes into account both precision and recall performances improving from $0.807$ to $0.838$ for the detection of sweet pepper. In addition to improved accuracy, this approach is also much quicker to deploy for new fruits, as it requires bounding box annotation rather than pixel-level annotation (annotating bounding boxes is approximately an order of magnitude quicker to perform). The model is retrained to perform the detection of seven fruits, with the entire process taking four hours to annotate and train the new model per fruit.

Related collections

Most cited references 41

Record: found
Abstract: not found
Article: not found

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, Jia Deng, Jiang-hao Su … (2015)

0 comments Cited 2656 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.

Kaiming He, Xiangyu Zhang, Shaoqing Ren … (2015)

Existing deep convolutional neural networks (CNNs) require a fixed-size (e.g., 224 × 224) input image. This requirement is "artificial" and may reduce the recognition accuracy for the images or sub-images of an arbitrary size/scale. In this work, we equip the networks with another pooling strategy, "spatial pyramid pooling", to eliminate the above requirement. The new network structure, called SPP-net, can generate a fixed-length representation regardless of image size/scale. Pyramid pooling is also robust to object deformations. With these advantages, SPP-net should in general improve all CNN-based image classification methods. On the ImageNet 2012 dataset, we demonstrate that SPP-net boosts the accuracy of a variety of CNN architectures despite their different designs. On the Pascal VOC 2007 and Caltech101 datasets, SPP-net achieves state-of-the-art classification results using a single full-image representation and no fine-tuning. The power of SPP-net is also significant in object detection. Using SPP-net, we compute the feature maps from the entire image only once, and then pool features in arbitrary regions (sub-images) to generate fixed-length representations for training the detectors. This method avoids repeatedly computing the convolutional features. In processing test images, our method is 24-102 × faster than the R-CNN method, while achieving better or comparable accuracy on Pascal VOC 2007. In ImageNet Large Scale Visual Recognition Challenge (ILSVRC) 2014, our methods rank #2 in object detection and #3 in image classification among all 38 teams. This manuscript also introduces the improvement made for this competition.

0 comments Cited 907 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

The Pascal Visual Object Classes Challenge: A Retrospective

Mark Everingham, S. M. Ali Eslami, Luc Van Gool … (2015)

0 comments Cited 607 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Gabriel Oliver-Codina: Role: Academic Editor

Nuno Gracias: Role: Academic Editor

Antonio M. López: Role: Academic Editor

Journal

Journal ID (nlm-ta): Sensors (Basel)

Journal ID (iso-abbrev): Sensors (Basel)

Journal ID (publisher-id): sensors

Title: Sensors (Basel, Switzerland)

Publisher: MDPI

ISSN (Electronic): 1424-8220

Publication date (Electronic): 03 August 2016

Publication date Collection: August 2016

Volume: 16

Issue: 8

Electronic Location Identifier: 1222

Affiliations

Science and Engineering Faculty, Queensland University of Technology, Brisbane 4000, Australia; gzy555555@ 123456gmail.com (Z.G.); feras.dayoub@ 123456qut.edu.au (F.D.); ben.upcroft@ 123456qut.edu.au (B.U.); tristan.perez@ 123456qut.edu.au (T.P.); c.mccool@ 123456qut.edu.au (C.M.)

Author notes

[* ]Correspondence: enddl22@ 123456gmail.com ; Tel.: +61-449-722-415

Article

Publisher ID: sensors-16-01222

DOI: 10.3390/s16081222

PMC ID: 5017387

PubMed ID: 27527168

SO-VID: cf8166b7-636d-4f8a-a91e-9343de86dc03

License:

This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license ( http://creativecommons.org/licenses/by/4.0/).

History

Date received : 19 May 2016

Date accepted : 26 July 2016

Comments

Comment on this article

scite_

Cited by 260

See all cited by

Most referenced authors 612

See all reference authors

- Version 1

DeepFruits: A Fruit Detection System Using Deep Neural Networks

Read this article at

Abstract

Related collections

Journal of Disability Research

Most cited references 41

ImageNet Large Scale Visual Recognition Challenge

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.

The Pascal Visual Object Classes Challenge: A Retrospective

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Categories

Comments

Comment on this article

Similar content 177

Cited by 260

Most referenced authors 612