Re: ChatGPT encounters multiple opportunities and challenges in neurosurgery

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Related collections

Most cited references 7

Record: found
Abstract: not found
Article: not found

Is ChatGPT an Evidence-based Doctor?

Zhonghan Zhou, Xuesheng Wang, Xunhua Li … (2023)

0 comments Cited 19 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Evaluating large language models on a highly-specialized topic, radiation oncology physics

Jason Holmes, Zhengliang Liu, Lian Zhang … (2023)

Purpose We present the first study to investigate Large Language Models (LLMs) in answering radiation oncology physics questions. Because popular exams like AP Physics, LSAT, and GRE have large test-taker populations and ample test preparation resources in circulation, they may not allow for accurately assessing the true potential of LLMs. This paper proposes evaluating LLMs on a highly-specialized topic, radiation oncology physics, which may be more pertinent to scientific and medical communities in addition to being a valuable benchmark of LLMs. Methods We developed an exam consisting of 100 radiation oncology physics questions based on our expertise. Four LLMs, ChatGPT (GPT-3.5), ChatGPT (GPT-4), Bard (LaMDA), and BLOOMZ, were evaluated against medical physicists and non-experts. The performance of ChatGPT (GPT-4) was further explored by being asked to explain first, then answer. The deductive reasoning capability of ChatGPT (GPT-4) was evaluated using a novel approach (substituting the correct answer with “None of the above choices is the correct answer.”). A majority vote analysis was used to approximate how well each group could score when working together. Results ChatGPT GPT-4 outperformed all other LLMs and medical physicists, on average, with improved accuracy when prompted to explain before answering. ChatGPT (GPT-3.5 and GPT-4) showed a high level of consistency in its answer choices across a number of trials, whether correct or incorrect, a characteristic that was not observed in the human test groups or Bard (LaMDA). In evaluating deductive reasoning ability, ChatGPT (GPT-4) demonstrated surprising accuracy, suggesting the potential presence of an emergent ability. Finally, although ChatGPT (GPT-4) performed well overall, its intrinsic properties did not allow for further improvement when scoring based on a majority vote across trials. In contrast, a team of medical physicists were able to greatly outperform ChatGPT (GPT-4) using a majority vote. Conclusion This study suggests a great potential for LLMs to work alongside radiation oncology experts as highly knowledgeable assistants.

0 comments Cited 19 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Emergency surgery in the era of artificial intelligence: ChatGPT could be the doctor’s right-hand man

Kunming Cheng, Zhiyong Li, Qiang Guo … (2023)

0 comments Cited 18 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Qing-xin Yu

Rui-cheng Wu

De-chao Feng

Deng-xiong Li

Journal

Journal ID (nlm-ta): Int J Surg

Journal ID (iso-abbrev): Int J Surg

Journal ID (publisher-id): JS9

Title: International Journal of Surgery (London, England)

Publisher: Lippincott Williams & Wilkins (Hagerstown, MD )

ISSN (Print): 1743-9191

ISSN (Electronic): 1743-9159

Publication date Collection: December 2023

Publication date (Electronic): 13 September 2023

Volume: 109

Issue: 12

Pages: 4393-4394

Affiliations

[a ]Department of pathology,Ningbo Clinical Pathology Diagnosis Center, Ningbo City, Zhejiang Province

[b ]Department of Urology, Institute of Urology, West China Hospital, Sichuan University, Chengdu, Sichuan Province, People’s Republic of China

Author notes

[* ]Corresponding author. Address: Ningbo Clinical Pathology Diagnosis Center, Ningbo City, Zhejiang Province, People’s Republic of China. Tel: +86 183 13736646. E-mail: qingxinyu0220@ 123456163.com (Q.-x. Yu); Department of Urology, Institute of Urology, West China Hospital, Sichuan University, Chengdu City 610041, Sichuan Province, China. Tel.: +86 288 5422444, fax: +86 288 5422451. E-mail: dengxiongliwch@ 123456163.com (D.-x. Li).

Article

Publisher ID: IJS-D-23-01825 Accession ID: 00083

DOI: 10.1097/JS9.0000000000000749

PMC ID: 10720816

PubMed ID: 37720947

SO-VID: 5a7d323f-5ee3-41e9-a524-ecf426a18a4a

License:

This is an open access article distributed under the Creative Commons Attribution-ShareAlike License 4.0, which allows others to remix, tweak, and build upon the work, even for commercial purposes, as long as the author is credited and the new creations are licensed under the identical terms. http://creativecommons.org/licenses/by-sa/4.0/

History

Date received : 24 August 2023

Date accepted : 25 August 2023

Custom metadata

OPEN-ACCESS TRUE

ScienceOpen disciplines: Surgery

Data availability:

ScienceOpen disciplines: Surgery

Re: ChatGPT encounters multiple opportunities and challenges in neurosurgery

Read this article at

Related collections

25th European Students' Conference

Most cited references 7

Is ChatGPT an Evidence-based Doctor?

Evaluating large language models on a highly-specialized topic, radiation oncology physics

Emergency surgery in the era of artificial intelligence: ChatGPT could be the doctor’s right-hand man

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Categories

Custom metadata

Comments

Comment on this article

Similar content 4,871

Cited by 1

Most referenced authors 67