Polycystic ovary syndrome is one of the most common types of endocrine and metabolic diseases in women of reproductive age that needs to be screened early and assessed non-invasively. The objective of the current study was to develop prediction models for polycystic ovary syndrome based on data of tongue and pulse using machine learning techniques.
A dataset of 285 polycystic ovary syndrome patients and 201 healthy women were investigated to identify the significant tongue and pulse parameters for predicting polycystic ovary syndrome. In this study, feature selection was performed using least absolute shrinkage and selection operator regression. Several machine learning algorithms (multilayer perceptron classifier, eXtreme gradient boosting classifier, and support vector machine) were used to construct the classification models to predict the presence of polycystic ovary syndrome.
TB-L, TB-a, TB-b, TC-L, TC-a, h 3, and h 4/h 1 in tongue and pulse parameters were statistically associated with polycystic ovary syndrome presence. Among the several machine learning techniques, the support vector machine model was optimal for the comprehensive evaluation of this dataset and deduced the area under the receiver operating characteristic curve, DeLong test, calibration curve, and decision curve analysis.