Evaluation of the Readability, Understandability, and Accuracy of Artificial Intelligence Chatbots in Terms of Biostatistics Literacy DOI Open Access
İlkay Doğan, Pınar Günel Karadeniz, İhsan BERK

и другие.

European Journal of Therapeutics, Год журнала: 2024, Номер 30(6), С. 900 - 909

Опубликована: Дек. 31, 2024

Objective: Chatbots have been frequently used in many different areas recent years, such as diagnosis and imaging, treatment, patient follow-up support, health promotion, customer service, sales, marketing, information technical support. The aim of this study is to evaluate the readability, comprehensibility, accuracy queries made by researchers field through artificial intelligence chatbots biostatistics. Methods: A total 10 questions from topics asked basic biostatistics were determined 4 experts. addressed one experts answers recorded. In study, free versions most widely preferred ChatGPT4, Gemini Copilot used. recorded independently evaluated “Correct”, “Partially correct” “Wrong” three who blinded which chatbot belonged to. Then, these came together examined final evaluation reaching a consensus on levels accuracy. readability understandability with Ateşman formula, Sönmez Çetinkaya-Uzun formula Bezirci-Yılmaz formulas. Results: According given chatbots, it was that at “difficult” level according “insufficient reading level” “academic formula. On other hand, gave result “the text understandable” for all chatbots. It there no statistically significant difference (p=0.819) terms rates questions. Conclusion: although tended provide accurate information, not readable, understandable their high.

Язык: Английский

Emerging Insights in Dental Trauma: Exploring Potential Risk Factors, Innovations, and Preventive Strategies DOI Open Access

Ana Beatriz Carvalho de Souza Cantão,

Liran Levin

Dental Traumatology, Год журнала: 2025, Номер 41(2), С. 129 - 132

Опубликована: Март 14, 2025

Several factors are associated with dental trauma (DT) occurrence, and these well-established in the literature [1, 2]. However, new studies hypotheses suggest that a lack of balance is positively falls children older individuals [3, 4]. In this issue Lunardelli et al. investigated relationship between orthostatic occurrence DT 6-year-old school Brazil [5]. The study emphasizes connection reduced an increased risk falls, identifying it as factor etiology children. This highlights need for preventive measures focused on creating safer environments, particularly schools, involving multidisciplinary team to support school-aged minimizing trauma. Traumatic injuries (TDI) highly prevalent during childhood, primary dentition [6, 7]. Although TDI recognized common issue, influencing its prevalence preschool-aged have been poorly understood [8, 9]. Rivera López through cohort 4-year-old from South [10]. Using directed acyclic graphs (DAGs), researchers developed theoretical model explore complex relationships among potential factors, including demographic, behavioral, environmental variables. provides insights into multifactorial nature young children, offering foundation targeted prevention early intervention strategies. By applying advanced analytical techniques, contribute deeper understanding causes broader implications pediatric oral health. Autotransplanted teeth demonstrated remarkable survival rate over 95% [11-13]. standardization precision quantifying root development results inconsistent findings impedes comparisons studies. Traditional analysis methods based two-dimensional radiographs shows significant limitations, such image overlap, patient positioning challenges, low measurement accuracy. Cone-beam computed tomography (CBCT) might provide more accurate detailed analysis, regions like periapical areas [14]. Beltrame proposed methodology assessing measuring length using CBCT 12 patients [15]. highlighted CBCT's superiority conventional radiography evaluating after autogenous tooth transplants. method, any radiographic tool, should be assessed terms radiation risks vs. suggested benefits. Advances 3D-printed replicas improvements efficiency procedures, autotransplantation [16-18]. most evidence supporting outcomes comes observational studies, case reports, limited number case–control Consequently, there insufficient data controlled clinical trials biological efficacy (i.e., long-term radiological outcomes), limiting impact technologies. Lejnieks combined 3D replica protocol trial. goal was investigate 1 year follow-up, providing stronger benefits technologies molar [19]. investigation underscores protocols enhance surgical serve valuable training tools, calling further address current limitations their applications practice. Artificial Intelligence (AI) has emerged resource, information clinicians seeking online healthcare knowledge medical decision-making [20, 21]. rigorous evaluations accuracy consistency responses provided by AI Google Gemini, context managing traumatized permanent teeth, still lacking. hold transform access information, concerns remain regarding reliability biases. Previous tools Bard, experimental version less consistent compared other models, ChatGPT, topics related endodontics [22]. Portilla evaluated about management Gemini those experienced academic endodontists. set predefined questions, they conducted comprehensive comparison [23]. suggests become accessible tool professionals. improving database algorithms crucial enhancing topics. Advancing refinement essential ensure reliable robust future. Furthermore Johnson aimed assess validity chatbots, Bing, ChatGPT 3.5, Claude AI, addressing frequently asked questions [24]. Ensuring secure distribution field trauma, requires authorities establish clear guidelines regulations governing chatbot use. Collaborative efforts ethical platforms. Public remains [25, 26]. advancements now able deliver public health helps professionals deal issues avulsions. Large language models (LLMs) offer education, however, since fine-tuned human feedback, may biased or incorrect, answers avulsion [20]. Tokgöz Kaplan Cankar verified comprehensiveness Gemini. Four dentist reviewers scored according IADT [27]. evaluation revealed although both potential, quality necessary answers. Avulsion one severe types [28]. Treatment prompt correct emergency management, depends viability periodontal ligament (PDL) cells [29]. can cause PDL, dehydration structural damage fibers [30]. For reason, storing avulsed appropriate medium help preserve prevent breakdown [31]. According International Association Dental Traumatology (IADT), saline, saliva, milk considered natural solutions maintain PDL before replantation [32]. Lee effects six media fibroblasts (PDLF). included Hank's Balanced Salt Solution (HBSS), HBSS supplemented ascorbic acid (Vitamin C), platelet-derived growth (PDGF), combination PDGF acid, platelet lysate, Dulbecco's Modified Eagle Medium [33]. modified mixtures were ability PDLF success procedures. could our overall future considerations proper storage teeth. Central incisors affected [34, 35], especially cases proclined maxillary anterior [36]. Various available different activities age groups [37, 38]. use mouthguards method sports. Different manufacturing exist [39]; custom-made professionally crafted, better fit and, some improved cardiopulmonary capacity athletes. Bhadule created finite element (FEA) scan 12-year-old male [40]. They simulated actual without mouthguard. emphasized importance properly fitted protecting against maxillofacial reducing stress magnitude, dentitions Mouthguards also play role soft tissues sports high-risk [41, 42]. commonly used include stock mouthguards, mouth-formed options, each varying levels comfort protection 43]. despite widespread use, made materials not sufficient shock absorption, durability, user [44, 45]. Therefore, growing alternative develop performance, improve experience, promote adoption. Nassani absorption capacities thermoformed ethylene vinyl acetate (EVA) polyolefin dentistry [46]. toughness traditional EVA sports, highlighting material. Furthermore, fabricated adequate uniform thickness reduce strain produced [47, 48]. Some production processes mouthguards. mouthguard significantly impacts protect effectiveness. single-layered often longitudinal dimensional stability, leading athletes increasingly double-layered [49], which best option [50]. Uma final methods, puncturing technique and/or cooldown period pressing second sheet [51]. fabrication techniques Fiber splint stabilization vital secondary promoting favorable healing due flexibility distribute effectively [52, 53]. Their mastication, advantage [54], making them key component effective [55]. While many stress-distributing properties fiber splints research focusing effectiveness mastication limited. Specifically, whether incisal cervical region unclear. gap restricts optimization treatment approaches biomechanical behavior under dynamic occlusal forces. Ding how splints, either region, affects [56]. Utilizing FEA, placement, contributing strategies traumatic injuries.

Язык: Английский

Процитировано

0

Evaluation of Artificial Intelligence Chatbots in the Management of Primary Tooth Traumas: A Comparative Analysis DOI Open Access
Mihriban Gökcek Taraç

Journal of International Dental Sciences, Год журнала: 2025, Номер 11(1), С. 22 - 31

Опубликована: Март 27, 2025

Aim: This study aimed to evaluate the reliability and consistency of four artificial intelligence (AI) chatbots—ChatGPT 3.5, Google Gemini, Bing, Claude AI—as public sources information on management primary tooth trauma. Materials Methods: A total 31 dichotomous questions were developed based common issues concerns related dental trauma, particularly those frequently raised by parents. Each question, sequentially presented AI chatbots, was repeated three times daily, with a one-hour interval between repetitions, over five-day period, assess reproducibility responses. Accuracy determined calculating proportion correct responses, 95% confidence intervals estimated using Wald binomial method. Reliability assessed Fleiss’ kappa coefficient. Results: All chatbots demonstrated high accuracy. Bing emerged as most accurate model, achieving an accuracy rate 96.34%, while had lowest at 88.17%. Consistency classified “almost perfect” for ChatGPT, whereas exhibited “substantial” level agreement. These findings underscore relative performance models in tasks requiring reliability. Conclusion: results emphasize importance critically evaluating AI-based systems their potential use clinical applications. Continuous improvements updates are essential enhance ensure effectiveness tools.

Язык: Английский

Процитировано

0

Can Artificial Intelligence Language Models Effectively Address Dental Trauma Questions? DOI Creative Commons
Elif Kuru, Aslı Aşık,

Doğukan Mert Demir

и другие.

Dental Traumatology, Год журнала: 2025, Номер unknown

Опубликована: Апрель 1, 2025

Artificial intelligence (AI) chatbots, also known as large language models (LLMs), have become increasingly common educational tools in healthcare. Although the use of LLMs for emergency dental trauma is gaining popularity, it crucial to assess their reliability. This study aimed compare reliabilities different response multiple questions related trauma. In a cross-sectional observational conducted October 2024, 30 (10 multiple-choice, 10 fill-in-the-blank, and dichotomous) based on International Association Dental Traumatology guidelines were posed five LLMs: ChatGPT 4, 3.5, Copilot Free version (Copilot F), Pro P), Google Gemini over nine consecutive days. Responses each model (1350 total) recorded binary format analyzed using Pearson's chi-square Fisher's exact tests correctness consistency (p < 0.05). The answers provided by repeated days showed high degree repeatability. there was no statistically significant difference success rate providing correct among > 0.05), rankings successful follows: 3.5 (76.7%) P (73.3%) F (70%) 4 (63.3%) (46.7%). significantly higher choice fill blank compared performance dichotomous (true/false) Conversely, did not exhibit differences across question types. Notably, explanations often inaccurate, Copilot's cited references had low evidential value. While show potential adjunct traumatology, variable accuracy inclusion unreliable call careful integration strategies.

Язык: Английский

Процитировано

0

Enhancing Patient Comprehension of Glomerular Disease Treatments Using ChatGPT DOI Open Access

Yasir Abdelgadir,

Charat Thongprayoon,

Iasmina Craici

и другие.

Healthcare, Год журнала: 2024, Номер 13(1), С. 57 - 57

Опубликована: Дек. 31, 2024

Background/Objectives: It is often challenging for patients to understand treatment options, their mechanisms of action, and the potential side effects each option glomerular disorders. This study explored ability ChatGPT simplify these options enhance patient understanding. Methods: GPT-4 was queried on sixty-seven disorders using two distinct queries a general explanation an adjusted 8th grade level or lower. Accuracy rated scale 1 (incorrect) 5 (correct comprehensive). Readability measured average Flesch–Kincaid Grade (FKG) SMOG indices, along with Flesch Reading Ease (FRE) score. The understandability score (%) determined Patient Education Materials Assessment Tool Printable (PEMAT-P). Results: GPT-4’s explanations had readability 12.85 ± 0.93, corresponding upper end high school. When tailored at below 8th-grade level, improved middle school 8.44 0.72. FRE PEMAT-P scores also reflected understandability, increasing from 25.73 6.98 60.75 4.56 60.7% 76.8% (p < 0.0001 both), respectively. accuracy significantly lower compared (3.99 0.39 versus 0.66, p 0.0001). Conclusions: shows significant enhancing disorder therapies patients, but cost reduced comprehensiveness. Further research needed refine performance, evaluate real-world impact, ensure ethical use in healthcare settings.

Язык: Английский

Процитировано

1

Evaluation of the Readability, Understandability, and Accuracy of Artificial Intelligence Chatbots in Terms of Biostatistics Literacy DOI Open Access
İlkay Doğan, Pınar Günel Karadeniz, İhsan BERK

и другие.

European Journal of Therapeutics, Год журнала: 2024, Номер 30(6), С. 900 - 909

Опубликована: Дек. 31, 2024

Objective: Chatbots have been frequently used in many different areas recent years, such as diagnosis and imaging, treatment, patient follow-up support, health promotion, customer service, sales, marketing, information technical support. The aim of this study is to evaluate the readability, comprehensibility, accuracy queries made by researchers field through artificial intelligence chatbots biostatistics. Methods: A total 10 questions from topics asked basic biostatistics were determined 4 experts. addressed one experts answers recorded. In study, free versions most widely preferred ChatGPT4, Gemini Copilot used. recorded independently evaluated “Correct”, “Partially correct” “Wrong” three who blinded which chatbot belonged to. Then, these came together examined final evaluation reaching a consensus on levels accuracy. readability understandability with Ateşman formula, Sönmez Çetinkaya-Uzun formula Bezirci-Yılmaz formulas. Results: According given chatbots, it was that at “difficult” level according “insufficient reading level” “academic formula. On other hand, gave result “the text understandable” for all chatbots. It there no statistically significant difference (p=0.819) terms rates questions. Conclusion: although tended provide accurate information, not readable, understandable their high.

Язык: Английский

Процитировано

0