19
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Compact CNN for Indexing Egocentric Videos

      Preprint
      , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          While egocentric video is becoming increasingly popular, browsing it is very difficult. In this paper we present a compact 3D Convolutional Neural Network (CNN) architecture for long-term activity recognition in egocentric videos. Recognizing long-term activities enables us to temporally segment (index) long and unstructured egocentric videos. Existing methods for this task are based on hand tuned features derived from visible objects, location of hands, as well as optical flow. Given a sparse optical flow volume as input, our CNN classifies the camera wearer's activity. We obtain classification accuracy of 89%, which outperforms the current state-of-the-art by 19%. Additional evaluation is performed on an extended egocentric video dataset, classifying twice the amount of categories than current state-of-the-art. Furthermore, our CNN is able to recognize whether a video is egocentric or not with 99.2% accuracy, up by 24% from current state-of-the-art. To better understand what the network actually learns, we propose a novel visualization of CNN kernels as flow fields.

          Related collections

          Most cited references2

          • Record: found
          • Abstract: not found
          • Book Chapter: not found

          Learning to Recognize Daily Actions Using Gaze

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Novelty detection from an ego-centric perspective

              Bookmark

              Author and article information

              Journal
              2015-04-28
              2015-11-24
              Article
              10.1109/WACV.2016.7477708
              1504.07469
              e7b72242-e2ef-4df0-955c-aa8b37bbd8fd

              http://arxiv.org/licenses/nonexclusive-distrib/1.0/

              History
              Custom metadata
              cs.CV

              Computer vision & Pattern recognition
              Computer vision & Pattern recognition

              Comments

              Comment on this article