
BMC Oral Health, Год журнала: 2025, Номер 25(1)
Опубликована: Май 21, 2025
With the growing capabilities of language models like ChatGPT to process text and images, this study evaluated their accuracy in detecting supernumerary teeth on periapical radiographs. A customized GPT-4V model (CGPT-4V) was also developed assess whether domain-specific training could improve diagnostic performance compared standard GPT-4o models. One hundred eighty radiographs (90 with 90 without teeth) were using GPT-4 V, GPT-4o, a fine-tuned CGPT-4V model. Each image assessed separately standardized prompt "Are there any radiograph above?" avoid contextual bias. Three dental experts scored responses three-point Likert scale for positive cases binary negatives. Chi-square tests ROC analysis used compare performances (p < 0.05). Among three models, CGPT-4 V exhibited highest accuracy, correctly 91% cases, 77% 63% GPT-4V. The demonstrated significantly lower false rate (16%) than (42%). statistically significant difference found between 0.001), while no observed or GPT-4o. Additionally, successfully identified multiple where present. These findings highlight potential GPT radiology. Future research should focus multicenter validation, seamless clinical integration, cost-effectiveness support real-world implementation.
Язык: Английский