19
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A Deep Learning-Based Approach to Video-Based Eye Tracking for Human Psychophysics

      methods-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Real-time gaze tracking provides crucial input to psychophysics studies and neuromarketing applications. Many of the modern eye-tracking solutions are expensive mainly due to the high-end processing hardware specialized for processing infrared-camera pictures. Here, we introduce a deep learning-based approach which uses the video frames of low-cost web cameras. Using DeepLabCut (DLC), an open-source toolbox for extracting points of interest from videos, we obtained facial landmarks critical to gaze location and estimated the point of gaze on a computer screen via a shallow neural network. Tested for three extreme poses, this architecture reached a median error of about one degree of visual angle. Our results contribute to the growing field of deep-learning approaches to eye-tracking, laying the foundation for further investigation by researchers in psychophysics or neuromarketing.

          Related collections

          Most cited references17

          • Record: found
          • Abstract: found
          • Article: not found

          DeepLabCut: markerless pose estimation of user-defined body parts with deep learning

          Quantifying behavior is crucial for many applications in neuroscience. Videography provides easy methods for the observation and recording of animal behavior in diverse settings, yet extracting particular aspects of a behavior for further analysis can be highly time consuming. In motor control studies, humans or other animals are often marked with reflective markers to assist with computer-based tracking, but markers are intrusive, and the number and location of the markers must be determined a priori. Here we present an efficient method for markerless pose estimation based on transfer learning with deep neural networks that achieves excellent results with minimal training data. We demonstrate the versatility of this framework by tracking various body parts in multiple species across a broad collection of behaviors. Remarkably, even when only a small number of frames are labeled (~200), the algorithm achieves excellent tracking performance on test frames that is comparable to human accuracy.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Using DeepLabCut for 3D markerless pose estimation across species and behaviors

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              General theory of remote gaze estimation using the pupil center and corneal reflections.

              This paper presents a general theory for the remote estimation of the point-of-gaze (POG) from the coordinates of the centers of the pupil and corneal reflections. Corneal reflections are produced by light sources that illuminate the eye and the centers of the pupil and corneal reflections are estimated in video images from one or more cameras. The general theory covers the full range of possible system configurations. Using one camera and one light source, the POG can be estimated only if the head is completely stationary. Using one camera and multiple light sources, the POG can be estimated with free head movements, following the completion of a multiple-point calibration procedure. When multiple cameras and multiple light sources are used, the POG can be estimated following a simple one-point calibration procedure. Experimental and simulation results suggest that the main sources of gaze estimation errors are the discrepancy between the shape of real corneas and the spherical corneal shape assumed in the general theory, and the noise in the estimation of the centers of the pupil and corneal reflections. A detailed example of a system that uses the general theory to estimate the POG on a computer screen is presented.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Hum Neurosci
                Front Hum Neurosci
                Front. Hum. Neurosci.
                Frontiers in Human Neuroscience
                Frontiers Media S.A.
                1662-5161
                21 July 2021
                2021
                : 15
                : 685830
                Affiliations
                [1] 1Cognitive Neuroscience Lab, German Primate Center – Leibniz Institute for Primate Research , Göttingen, Germany
                [2] 2Faculty of Biology and Psychology, University of Göttingen , Göttingen, Germany
                [3] 3Bernstein Center for Computational Neuroscience , Göttingen, Germany
                [4] 4Leibniz-ScienceCampus Primate Cognition , Göttingen, Germany
                Author notes

                Edited by: Adonis Moschovakis, University of Crete, Greece

                Reviewed by: Andreas Antonios Kardamakis, Karolinska Institutet (KI), Sweden; Eduard Ort, Heinrich Heine University of Düsseldorf, Germany

                *Correspondence: Niklas Zdarsky, niklas.zd@ 123456posteo.de

                This article was submitted to Cognitive Neuroscience, a section of the journal Frontiers in Human Neuroscience

                Article
                10.3389/fnhum.2021.685830
                8333872
                34366813
                b57b51a3-d4f1-4e63-bee6-eeaaae706e69
                Copyright © 2021 Zdarsky, Treue and Esghaei.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 25 March 2021
                : 29 June 2021
                Page count
                Figures: 5, Tables: 1, Equations: 4, References: 18, Pages: 8, Words: 0
                Funding
                Funded by: Deutsche Forschungsgemeinschaft 10.13039/501100001659
                Categories
                Neuroscience
                Methods

                Neurosciences
                deep learning,eye tracking,gaze tracking,artificial intelligence,deeplabcut,human psychophysics

                Comments

                Comment on this article