339
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Decision tree methods: applications for classification and prediction Translated title: 用于分类与预测的决策树分析

      methods-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Summary

          Decision tree methodology is a commonly used data mining method for establishing classification systems based on multiple covariates or for developing prediction algorithms for a target variable. This method classifies a population into branch-like segments that construct an inverted tree with a root node, internal nodes, and leaf nodes. The algorithm is non-parametric and can efficiently deal with large, complicated datasets without imposing a complicated parametric structure. When the sample size is large enough, study data can be divided into training and validation datasets. Using the training dataset to build a decision tree model and a validation dataset to decide on the appropriate tree size needed to achieve the optimal final model. This paper introduces frequently used algorithms used to develop decision trees (including CART, C4.5, CHAID, and QUEST) and describes the SPSS and SAS programs that can be used to visualize tree structure.

          Translated abstract

          概述

          决策树是一种常用的数据挖掘方法,用于多变量分析时建立分类系统或制定预测结果变量的算法。此方法将一个数据群分割成分枝状节段,构造出包括根节点、内部节点和叶节点的倒置形树状模型。该算法运用非参数方法,不需要套用任何复杂的参数模型就能有效地处理大型复杂的数据库。当样本足够大时,可将研究数据分为训练数据集和验证数据集。使用训练数据集构建决策树模型,使用验证数据集来决定树的适合大小,以获得最优模型。本文介绍了构建决策树的常用算法(包括CART,C4.5,CHAID和QUEST),并描述了SPSS和SAS软件中将树结构可视化的程序。

          中文全文

          本文全文中文版从2015年X月X日起在 http://dx.doi.org/10.11919/j.issn.1002-0829.XXXXXX可供免费阅览下载

          Related collections

          Most cited references30

          • Record: found
          • Abstract: not found
          • Article: not found

          Regression Trees for Censored Data

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Relative risk trees for censored survival data.

            A method is developed for obtaining tree-structured relative risk estimates for censored survival data. The first step of a full likelihood estimation procedure is used in a recursive partitioning algorithm that adopts most aspects of the widely used Classification and Regression Tree (CART) algorithm of Breiman et al. (1984, Classification and Regression Trees, Belmont, California: Wadsworth). The performance of the technique is investigated through stimulation and compared to the tree-structured survival methods proposed by Davis and Anderson (1989, Statistics in Medicine 8, 947-961) and Therneau, Grambsch, and Fleming (1990, Biometrika 77, 147-160).
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              C4. 5: programs for machine learning

                Bookmark

                Author and article information

                Journal
                Shanghai Arch Psychiatry
                Shanghai Arch Psychiatry
                SAP
                Shanghai Archives of Psychiatry
                Shanghai Municipal Bureau of Publishing (Shanghai, China )
                1002-0829
                25 April 2015
                : 27
                : 2
                : 130-135
                Affiliations
                [1] 1Department of Pharmacology and Biostatistics, Institute of Medical Sciences, Shanghai Jiao Tong University School of Medicine, Shanghai, China
                [2] 2Division of Biostatistics, Department of Health Research and Policy, Stanford University, Stanford, CA, USA
                [3] 3Veterans Affairs Cooperative Studies Program Palo Alto Coordinating Center, the VA Palo Alto Health Care System, Palo Alto, CA, USA
                Author notes
                [* ]correspondence: yanyansong@ 123456sjtu.edu.cn (Yan-yan SONG);
                Article
                sap-27-02-130
                10.11919/j.issn.1002-0829.215044
                4466856
                26120265
                3e88d0f0-2dc2-4efa-9327-23c55a5af863
                Copyright © 2015 by Shanghai Municipal Bureau of Publishing

                This work is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/

                History
                : 01 April 2015
                : 09 April 2015
                Funding
                Funded by: The author received no funding for the preparation of this report.
                Categories
                Biostatistics in Psychiatry (26)

                decision tree,data mining,classification,prediction
                decision tree, data mining, classification, prediction

                Comments

                Comment on this article