Performance of large language artificial intelligence models on solving restorative dentistry and endodontics student assessments DOI Creative Commons

Paul Künzle,

Sebastian Paris

Clinical Oral Investigations, Год журнала: 2024, Номер 28(11)

Опубликована: Окт. 7, 2024

Abstract Objectives The advent of artificial intelligence (AI) and large language model (LLM)-based AI applications (LLMAs) has tremendous implications for our society. This study analyzed the performance LLMAs on solving restorative dentistry endodontics (RDE) student assessment questions. Materials methods 151 questions from a RDE question pool were prepared prompting using OpenAI (ChatGPT-3.5,-4.0 -4.0o) Google (Gemini 1.0). Multiple-choice sorted into four subcategories, entered answers recorded analysis. P-value chi-square statistical analyses performed Python 3.9.16. Results total answer accuracy ChatGPT-4.0o was highest, followed by ChatGPT-4.0, Gemini 1.0 ChatGPT-3.5 (72%, 62%, 44% 25%, respectively) with significant differences between all except GPT-4.0 models. subcategories direct restorations caries indirect endodontics. Conclusions Overall, there are among LLMAs. Only ChatGPT-4 models achieved success ratio that could be used caution to support dental academic curriculum. Clinical relevance While clinicians field-related questions, this capacity depends strongly employed model. most performant acceptable rates in some subject sub-categories analyzed.

Язык: Английский

ChatGPT Hallucinates when Attributing Answers DOI
Guido Zuccon, Bevan Koopman, Razia S. Shaik

и другие.

Опубликована: Ноя. 23, 2023

Can ChatGPT provide evidence to support its answers? Does the it suggests actually exist and does really answer? We investigate these questions using a collection of domain-specific knowledge-based questions, specifically prompting both an answer supporting in form references external sources. also how different prompts impact answers evidence.

Язык: Английский

Процитировано

22

Detection of fake papers in the era of artificial intelligence DOI
Mehdi Dadkhah, Marilyn H. Oermann, Mihály Hegedüs

и другие.

Diagnosis, Год журнала: 2023, Номер 10(4), С. 390 - 397

Опубликована: Авг. 17, 2023

Abstract Objectives Paper mills, companies that write scientific papers and gain acceptance for them, then sell authorships of these papers, present a key challenge in medicine other healthcare fields. This is becoming more acute with artificial intelligence (AI), where AI writes the manuscripts paper mills papers. The aim current research to provide method detecting fake Methods reported this article uses machine learning approach create decision trees identify data were collected from Web Science multiple journals various Results presents based on results trees. Use case study indicated its effectiveness identifying paper. Conclusions applicable authors, editors, publishers across fields investigate single or conduct an analysis group manuscripts. Clinicians others can use evaluate articles they find search ensure are not instead report actual was peer reviewed prior publication journal.

Язык: Английский

Процитировано

21

ChatGPT in academia: exploring university students’ risks, misuses, and challenges in Jordan DOI
Leen Adel Gammoh

Journal of Further and Higher Education, Год журнала: 2024, Номер 48(6), С. 608 - 624

Опубликована: Июль 2, 2024

ChatGPT, a user-friendly and accessible AI tool, offers revolutionary approach to academic learning. In spite of its benefits, the implementation ChatGPT into university assignments presents possible risks for students. While extensive global research has studied these from students' perspectives, notable gap exists in comprehending academics' standpoints, specifically, Jordan. This study addresses this by conducting semi-structured interviews with 25 academics various professional backgrounds both public private Jordanian universities. Thematic analysis revealed four key associated integration: plagiarism compromised originality; overdependency on technology; diminished critical thinking skills; reduced overall assignment quality. The suggests risk mitigation strategies, including using detection software, implementing disciplinary measures upon discovering students resorting assignments, raising awareness about ChatGPT's advantages risks, establishing clear guidelines usage within institutions. Theoretical contributions encompass filling literature recognising perspectives Jordan providing deeper insights their impact student Practically, findings emphasise need applying prevent misuse, thus, enhancing learning teaching environment. Recognising limitations, instance, context specificity methodology, underlines necessity future explore diverse educational contexts employ mixed methodologies more comprehensive understanding impacts education

Язык: Английский

Процитировано

9

Between tech and text: the use of generative AI in Palestinian universities - a ChatGPT case study DOI Creative Commons
Bilal Hamamra, Asala Mayaleh, Zuheir N. Khlaif

и другие.

Cogent Education, Год журнала: 2024, Номер 11(1)

Опубликована: Июль 22, 2024

This article, drawing on essays written by students with the assistance of ChatGPT and interviews some who used this learning machine, highlights a shift in educational landscape brought about technology. In broader terms, Palestinian universities follow traditional methods teaching based memorization rote learning. These conventional strategies are stark contrast to manner which engage topics when utilizing ChatGPT. However, use has led noticeable declining participation classes, increased absences, diminished enthusiasm for examinations—a departure from value they previously placed them. patterns suggest that introduction is shaping paradigm education, leading us question reevaluate efficacy relevance today's digitized world importance different means evaluation rather than essay writing.

Язык: Английский

Процитировано

9

Performance of large language artificial intelligence models on solving restorative dentistry and endodontics student assessments DOI Creative Commons

Paul Künzle,

Sebastian Paris

Clinical Oral Investigations, Год журнала: 2024, Номер 28(11)

Опубликована: Окт. 7, 2024

Abstract Objectives The advent of artificial intelligence (AI) and large language model (LLM)-based AI applications (LLMAs) has tremendous implications for our society. This study analyzed the performance LLMAs on solving restorative dentistry endodontics (RDE) student assessment questions. Materials methods 151 questions from a RDE question pool were prepared prompting using OpenAI (ChatGPT-3.5,-4.0 -4.0o) Google (Gemini 1.0). Multiple-choice sorted into four subcategories, entered answers recorded analysis. P-value chi-square statistical analyses performed Python 3.9.16. Results total answer accuracy ChatGPT-4.0o was highest, followed by ChatGPT-4.0, Gemini 1.0 ChatGPT-3.5 (72%, 62%, 44% 25%, respectively) with significant differences between all except GPT-4.0 models. subcategories direct restorations caries indirect endodontics. Conclusions Overall, there are among LLMAs. Only ChatGPT-4 models achieved success ratio that could be used caution to support dental academic curriculum. Clinical relevance While clinicians field-related questions, this capacity depends strongly employed model. most performant acceptable rates in some subject sub-categories analyzed.

Язык: Английский

Процитировано

7