Cannabis sativa L., known for its medicinal and psychoactive properties, has recently experienced rapid market expansion but remains understudied in terms of its fundamental biology due to historical prohibitions. This pioneering study implements GS and ML to optimize cannabinoid profiles in cannabis breeding. We analyzed a representative population of drug‐type cannabis accessions, quantifying major cannabinoids and utilizing high‐density genotyping with 250K SNPs for GS. Our evaluations of various models—including ML algorithms, statistical methods, and Bayesian approaches—highlighted Random Forest's superior predictive accuracy for single and multi‐trait genomic predictions, particularly for THC, CBD, and their precursors. Multi‐trait analyses elucidated complex genetic interdependencies and identified key loci crucial to cannabinoid biosynthesis. These results demonstrate the efficacy of integrating GS and ML in developing cannabis varieties with tailored cannabinoid profiles.
This study pioneers the integration of genomic selection and machine learning to optimize cannabinoid profiles in Cannabis sativa L., demonstrating that Random Forest models effectively predict genetic traits critical for cannabinoid biosynthesis. These findings provide a foundation for breeding cannabis varieties with customized medicinal and psychoactive properties, facilitating advancements in cannabis research and cultivation.
See how this article has been cited at scite.ai
scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.