
Diagnostics, Journal Year: 2024, Volume and Issue: 14(24), P. 2835 - 2835
Published: Dec. 17, 2024
Background: Orofacial pain (OFP) encompasses a complex array of conditions affecting the face, mouth, and jaws, often leading to significant diagnostic challenges high rates misdiagnosis. Artificial intelligence, particularly large language models like GPT4 (OpenAI, San Francisco, CA, USA), offers potential as aid in healthcare settings. Objective: To evaluate accuracy OFP cases clinical decision support system (CDSS) compare its performance against treating clinicians, expert evaluators, medical students, general practitioners. Methods: A total 100 anonymized patient case descriptions involving diverse were collected. was prompted generate primary differential diagnoses for each using International Classification Pain (ICOP) criteria. Diagnoses compared gold-standard established by scoring used assess at three hierarchical ICOP levels. subset 24 also evaluated two experts, final-year practitioners comparative analysis. Diagnostic interrater reliability calculated. Results: achieved highest level (ICOP 3) 38% cases, with an overall score 157 out 300 points (52%). The model provided accurate 80% (400 500 points). In model’s comparable non-expert human evaluators but surpassed who correctly diagnosed 54% 3. demonstrated specific categories, diagnosing 81% trigeminal neuralgia Interrater between low (κ = 0.219, p < 0.001), indicating variability agreement. Conclusions: shows promise CDSS improving offering structured diagnoses. While not yet outperforming can augment workflows, care or educational Effective integration into practice requires adherence rigorous guidelines, thorough validation, ongoing professional oversight ensure safety reliability.
Language: Английский