Open this publication in new window or tab >>Show others...
2026 (English)In: JPRAS Open, E-ISSN 2352-5878, Vol. 48, p. 669-678Article in journal (Refereed) Published
Abstract [en]
INTRODUCTION: Large language models (LLMs) are increasingly used in clinical communication, but their accuracy and readability in patient education remain unclear. This study compared three LLMs for preoperative counseling before a DIEP breast reconstruction.
METHODS: A total of 40 frequently asked preoperative questions regarding DIEP breast reconstruction were collected and categorized using the BREAST-Q framework. These were submitted in English to three LLMs: ChatGPT, Gemini and Copilot (anonymized as Model A-C). Each question was submitted to all three models and the responses were anonymized. An expert panel of eight board-certified plastic surgeons from both Europe and USA. Ratings were made of a 5-point Likert scale for accuracy, informativeness and readability. Together with a general evaluation (easiness, problematic content, incorrectness) and information-material specific evaluation (relevance and lowest reading level).
RESULTS: Significant differences were found between models across all domains. ChatGPT achieved the highest accuracy (p = 0.019), Copilot was the most informative (p = 0.041), and both ChatGPT and Copilot produced more readable responses than Gemini (p < 0.001). Copilot had fewer problematic statements, while Gemini generated text at the simplest reading level but with lower accuracy. Agreement among raters was strong for accuracy (κ = 0.96) but weak for qualitative domains.
CONCLUSION: Each LLM showed distinct strength ChatGPT produced the most accurate answers, Copilot the most informative, and Gemini the simplest language. No model was uniformly superior. These findings support supervised, task-specific use of LLMs in patient education for breast reconstruction.
Place, publisher, year, edition, pages
Elsevier, 2026
Keywords
AI, Artificial intelligence, Breast reconstruction, DIEP, Large language models, Microsurgery
National Category
Surgery
Identifiers
urn:nbn:se:oru:diva-127043 (URN)10.1016/j.jpra.2025.12.029 (DOI)001678606800001 ()41631018 (PubMedID)
2026-02-032026-02-032026-02-11Bibliographically approved