58
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      The importance of rating scales in measuring patient-reported outcomes

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          A critical component that influences the measurement properties of a patient-reported outcome (PRO) instrument is the rating scale. Yet, there is a lack of general consensus regarding optimal rating scale format, including aspects of question structure, the number and the labels of response categories. This study aims to explore the characteristics of rating scales that function well and those that do not, and thereby develop guidelines for formulating rating scales.

          Methods

          Seventeen existing PROs designed to measure vision-related quality of life dimensions were mailed for self-administration, in sets of 10, to patients who were on a waiting list for cataract extraction. These PROs included questions with ratings of difficulty, frequency, severity, and global ratings. Using Rasch analysis, performance of rating scales were assessed by examining hierarchical ordering (indicating categories are distinct from each other and follow a logical transition from lower to higher value), evenness (indicating relative utilization of categories), and range (indicating coverage of the attribute by the rating scale).

          Results

          The rating scales with complicated question format, a large number of response categories, or unlabelled categories, tended to be dysfunctional. Rating scales with five or fewer response categories tended to be functional. Most of the rating scales measuring difficulty performed well. The rating scales measuring frequency and severity demonstrated hierarchical ordering but the categories lacked even utilization.

          Conclusion

          Developers of PRO instruments should use a simple question format, fewer (four to five) and labelled response categories.

          Related collections

          Most cited references35

          • Record: found
          • Abstract: found
          • Article: not found

          Optimizing rating scale category effectiveness.

          Rating scales are employed as a means of extracting more information out of an item than would be obtained from a mere "yes/no", "right/wrong" or other dichotomy. But does this additional information increase measurement accuracy and precision? Eight guidelines are suggested to aid the analyst in optimizing the manner in which rating scales categories cooperate in order to improve the utility of the resultant measures. Though these guidelines are presented within the context of Rasch analysis, they reflect aspects of rating scale functioning which impact all methods of analysis. The guidelines feature rating-scale-based data such as category frequency, ordering, rating-to-measure inferential coherence, and the quality of the scale from measurement and statistical perspectives. The manner in which the guidelines prompt recategorization or reconceptualization of the rating scale is indicated. Utilization of the guidelines is illustrated through their application to two published data sets.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Optimal number of response categories in rating scales: reliability, validity, discriminating power, and respondent preferences.

            Using a self-administered questionnaire, 149 respondents rated service elements associated with a recently visited store or restaurant on scales that differed only in the number of response categories (ranging from 2 to 11) and on a 101-point scale presented in a different format. On several indices of reliability, validity, and discriminating power, the two-point, three-point, and four-point scales performed relatively poorly, and indices were significantly higher for scales with more response categories, up to about 7. Internal consistency did not differ significantly between scales, but test-retest reliability tended to decrease for scales with more than 10 response categories. Respondent preferences were highest for the 10-point scale, closely followed by the seven-point and nine-point scales. Implications for research and practice are discussed.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Remediating serious flaws in the National Eye Institute Visual Function Questionnaire.

              To test the assumption that the National Eye Institute Visual Function Questionnaire (NEI VFQ) measures visual functioning, assess the validity of its subscales, and, if flawed, revise the questionnaire and derive a shortened version with sound psychometric properties. Flinders Medical Centre, Adelaide, Australia. Patients from the cataract surgery waiting list self-administered and completed the 39-item NEI VFQ (NEI VFQ-39). Rasch analysis was applied, and the psychometric performance of the entire questionnaire and each subscale was tested. Instrument revision was performed in the context of Rasch analysis statistics. Five hundred thirty-six patients (mean age 73.8 years) completed the questionnaire. Response categories for 2 question types were not used as intended so dysfunctional categories were combined. The NEI VFQ-39 and the 25-item version (NEI VFQ-25) had good precision but evidence of multidimensionality (more than 1 construct in 1 score), questions that did not fit the construct, suboptimum targeting of item difficulty to person ability, and dysfunctional subscales (8 NEI VFQ-39; 12 NEI VFQ-25). Questions could be reorganized into 2 constructs (a visual functioning scale and a socioemotional scale) that, after misfitting questions were removed, gave valid measurement of each construct and preserved 3 subscales. Removing redundancy from these long-form subscales yielded valid short-form scales. Several NEI VFQ subscales were not psychometrically sound; as an overall measure, it is flawed by multidimensionality. This was repaired by segregation into visual functioning and socioemotional scales. Valid long and short forms of the scales could enhance application of the questionnaire. (c) 2010 ASCRS and ESCRS. All rights reserved.
                Bookmark

                Author and article information

                Contributors
                Journal
                Health Qual Life Outcomes
                Health Qual Life Outcomes
                Health and Quality of Life Outcomes
                BioMed Central
                1477-7525
                2012
                13 July 2012
                : 10
                : 80
                Affiliations
                [1 ]NH & MRC Centre for Clinical Eye Research, Discipline of Optometry and Vision Science, Flinders Medical Centre and Flinders University of South Australia, Adelaide, South Australia, 5042, Australia
                [2 ]Meera and L B Deshpande Centre for Sight Enhancement, Vision Rehabilitation Centres, L V Prasad Eye Institute, Hyderabad, India
                [3 ]Centre for Eye Research Australia, Department of Ophthalmology, University of Melbourne, Victoria, 8002, Australia
                [4 ]Singapore National Eye Centre, Singapore Eye Research Institute, Singapore, Singapore
                Article
                1477-7525-10-80
                10.1186/1477-7525-10-80
                3503574
                22794788
                952075db-a2d2-44db-94a9-b941376b960a
                Copyright ©2012 Khadka et al.; licensee BioMed Central Ltd.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 11 February 2012
                : 27 June 2012
                Categories
                Research

                Health & Social care
                rasch analysis,response categories,rating scales,patient reported outcomes,quality of life

                Comments

                Comment on this article