
medRxiv (Cold Spring Harbor Laboratory), Год журнала: 2024, Номер unknown
Опубликована: Сен. 14, 2024
Abstract Symptom-Assessment Application (SAAs, e.g., NHS 111 online) that assist medical laypeople in deciding if and where to seek care ( self-triage ) are gaining popularity their accuracy has been examined numerous studies. With the public release of Large Language Models (LLMs, ChatGPT), use such decision-making processes is growing as well. However, there currently no comprehensive evidence synthesis for LLMs, review contextualized SAAs LLMs relative users. Thus, this systematic evaluates both compares them laypeople. A total 1549 studies were screened, with 19 included final analysis. The was found be moderate but highly variable (11.5 – 90.0%), while (57.8 76.0%) (47.3 62.4%) low variability. Despite some published recommendations standardize evaluation methodologies, remains considerable heterogeneity among should not universally recommended or discouraged; rather, utility assessed based on specific case tool under consideration.
Язык: Английский