The importance of rating scales in measuring patient-reported outcomes

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Background

A critical component that influences the measurement properties of a patient-reported outcome (PRO) instrument is the rating scale. Yet, there is a lack of general consensus regarding optimal rating scale format, including aspects of question structure, the number and the labels of response categories. This study aims to explore the characteristics of rating scales that function well and those that do not, and thereby develop guidelines for formulating rating scales.

Methods

Seventeen existing PROs designed to measure vision-related quality of life dimensions were mailed for self-administration, in sets of 10, to patients who were on a waiting list for cataract extraction. These PROs included questions with ratings of difficulty, frequency, severity, and global ratings. Using Rasch analysis, performance of rating scales were assessed by examining hierarchical ordering (indicating categories are distinct from each other and follow a logical transition from lower to higher value), evenness (indicating relative utilization of categories), and range (indicating coverage of the attribute by the rating scale).

Results

The rating scales with complicated question format, a large number of response categories, or unlabelled categories, tended to be dysfunctional. Rating scales with five or fewer response categories tended to be functional. Most of the rating scales measuring difficulty performed well. The rating scales measuring frequency and severity demonstrated hierarchical ordering but the categories lacked even utilization.

Conclusion

Developers of PRO instruments should use a simple question format, fewer (four to five) and labelled response categories.

Related collections

Most cited references 35

Record: found
Abstract: found
Article: not found

Optimizing rating scale category effectiveness.

John M Linacre (2002)

Rating scales are employed as a means of extracting more information out of an item than would be obtained from a mere "yes/no", "right/wrong" or other dichotomy. But does this additional information increase measurement accuracy and precision? Eight guidelines are suggested to aid the analyst in optimizing the manner in which rating scales categories cooperate in order to improve the utility of the resultant measures. Though these guidelines are presented within the context of Rasch analysis, they reflect aspects of rating scale functioning which impact all methods of analysis. The guidelines feature rating-scale-based data such as category frequency, ordering, rating-to-measure inferential coherence, and the quality of the scale from measurement and statistical perspectives. The manner in which the guidelines prompt recategorization or reconceptualization of the rating scale is indicated. Utilization of the guidelines is illustrated through their application to two published data sets.

0 comments Cited 186 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Optimal number of response categories in rating scales: reliability, validity, discriminating power, and respondent preferences.

Jill Preston, Michael A. Colman (2000)

Using a self-administered questionnaire, 149 respondents rated service elements associated with a recently visited store or restaurant on scales that differed only in the number of response categories (ranging from 2 to 11) and on a 101-point scale presented in a different format. On several indices of reliability, validity, and discriminating power, the two-point, three-point, and four-point scales performed relatively poorly, and indices were significantly higher for scales with more response categories, up to about 7. Internal consistency did not differ significantly between scales, but test-retest reliability tended to decrease for scales with more than 10 response categories. Respondent preferences were highest for the 10-point scale, closely followed by the seven-point and nine-point scales. Implications for research and practice are discussed.

0 comments Cited 118 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Remediating serious flaws in the National Eye Institute Visual Function Questionnaire.

Konrad Pesudovs, Vijaya K Gothwal, Thomas Wright … (2010)

To test the assumption that the National Eye Institute Visual Function Questionnaire (NEI VFQ) measures visual functioning, assess the validity of its subscales, and, if flawed, revise the questionnaire and derive a shortened version with sound psychometric properties. Flinders Medical Centre, Adelaide, Australia. Patients from the cataract surgery waiting list self-administered and completed the 39-item NEI VFQ (NEI VFQ-39). Rasch analysis was applied, and the psychometric performance of the entire questionnaire and each subscale was tested. Instrument revision was performed in the context of Rasch analysis statistics. Five hundred thirty-six patients (mean age 73.8 years) completed the questionnaire. Response categories for 2 question types were not used as intended so dysfunctional categories were combined. The NEI VFQ-39 and the 25-item version (NEI VFQ-25) had good precision but evidence of multidimensionality (more than 1 construct in 1 score), questions that did not fit the construct, suboptimum targeting of item difficulty to person ability, and dysfunctional subscales (8 NEI VFQ-39; 12 NEI VFQ-25). Questions could be reorganized into 2 constructs (a visual functioning scale and a socioemotional scale) that, after misfitting questions were removed, gave valid measurement of each construct and preserved 3 subscales. Removing redundancy from these long-form subscales yielded valid short-form scales. Several NEI VFQ subscales were not psychometrically sound; as an overall measure, it is flawed by multidimensionality. This was repaired by segregation into visual functioning and socioemotional scales. Valid long and short forms of the scales could enhance application of the questionnaire. (c) 2010 ASCRS and ESCRS. All rights reserved.

0 comments Cited 71 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Jyoti Khadka

Vijaya K Gothwal

Colm McAlinden

Ecosse L Lamoureux

Konrad Pesudovs

Journal

Journal ID (nlm-ta): Health Qual Life Outcomes

Journal ID (iso-abbrev): Health Qual Life Outcomes

Title: Health and Quality of Life Outcomes

Publisher: BioMed Central

ISSN (Electronic): 1477-7525

Publication date Collection: 2012

Publication date (Electronic): 13 July 2012

Volume: 10

Page: 80

Affiliations

[1 ]NH & MRC Centre for Clinical Eye Research, Discipline of Optometry and Vision Science, Flinders Medical Centre and Flinders University of South Australia, Adelaide, South Australia, 5042, Australia

[2 ]Meera and L B Deshpande Centre for Sight Enhancement, Vision Rehabilitation Centres, L V Prasad Eye Institute, Hyderabad, India

[3 ]Centre for Eye Research Australia, Department of Ophthalmology, University of Melbourne, Victoria, 8002, Australia

[4 ]Singapore National Eye Centre, Singapore Eye Research Institute, Singapore, Singapore

Article

Publisher ID: 1477-7525-10-80

DOI: 10.1186/1477-7525-10-80

PMC ID: 3503574

PubMed ID: 22794788

SO-VID: 952075db-a2d2-44db-94a9-b941376b960a

License:

This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

The importance of rating scales in measuring patient-reported outcomes

Read this article at

Abstract

Background

Methods

Results

Conclusion

Related collections

Tobacco Prevention & Cessation

Most cited references 35

Optimizing rating scale category effectiveness.

Optimal number of response categories in rating scales: reliability, validity, discriminating power, and respondent preferences.

Remediating serious flaws in the National Eye Institute Visual Function Questionnaire.

Author and article information

Contributors

Journal

Affiliations

Article

History

Categories

Comments

Comment on this article

Similar content 97

Cited by 48

Most referenced authors 156