Evaluating the Accuracy and Limitations of Large Language Models in Medical Evidence Summarization
A recent study systematically examines the capabilities and limitations of large language models, specifically GPT-3.5 and ChatGPT, in performing zero-shot medical evidence summarization across six clinical domains, revealing potential risks…