SinghalK., TuT., GottweisJ., et al.Toward expert-level medical question answering with large language models. Nature Medicine, 31:943–950. (2025).
2.
GaberF., ShaikM., AllegaF., et al.Evaluating large language model workflows in clinical decision support for triage and referral and diagnosis. Njp Digital Medicine, 8, 263. (2025).
3.
GohE., GalloR.J., StrongE., et al.GPT-4 assistance for improvement of physician performance on patient care tasks: a randomized controlled trial. Nature Medicine, 31:1233–1238. (2025).
4.
HuoB., BoyleA., MarfoN., et al.Large language models for chatbot health advice studies. JAMA Network Open, 8(2) :e2457879. (2025).
5.
The CHART Collaborative. Reporting guideline for chatbot health advice studies: The CHART statement. JAMA Network Open, 8(8) :e2530220 (2025).