13
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Phishing as one of the most well-known cybercrime activities is a deception of online users to steal their personal or confidential information by impersonating a legitimate website. Several machine learning-based strategies have been proposed to detect phishing websites. These techniques are dependent on the features extracted from the website samples. However, few studies have actually considered efficient feature selection for detecting phishing attacks. In this work, we investigate an agreement on the definitive features which should be used in phishing detection. We apply Fuzzy Rough Set (FRS) theory as a tool to select most effective features from three benchmarked data sets. The selected features are fed into three often used classifiers for phishing detection. To evaluate the FRS feature selection in developing a generalizable phishing detection, the classifiers are trained by a separate out-of-sample data set of 14,000 website samples. The maximum F-measure gained by FRS feature selection is 95% using Random Forest classification. Also, there are 9 universal features selected by FRS over all the three data sets. The F-measure value using this universal feature set is approximately 93% which is a comparable result in contrast to the FRS performance. Since the universal feature set contains no features from third-part services, this finding implies that with no inquiry from external sources, we can gain a faster phishing detection which is also robust toward zero-day attacks.

          Related collections

          Most cited references12

          • Record: found
          • Abstract: not found
          • Article: not found

          ROUGH FUZZY SETS AND FUZZY ROUGH SETS*

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            A wrapper approach for feature selection based on Bat Algorithm and Optimum-Path Forest

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Utilisation of website logo for phishing detection

                Bookmark

                Author and article information

                Journal
                13 March 2019
                Article
                1903.05675
                960d1046-34f8-4210-b2ec-c4789734d8c7

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Preprint of accepted paper in IEEE International Conference on Fuzzy Systems 2019
                cs.LG cs.CR stat.ML

                Security & Cryptology,Machine learning,Artificial intelligence
                Security & Cryptology, Machine learning, Artificial intelligence

                Comments

                Comment on this article