2021 IEEE International Conference on Big Data (Big Data), Journal Year: 2024, Volume and Issue: unknown, P. 2419 - 2430
Published: Dec. 15, 2024
Language: Английский
2021 IEEE International Conference on Big Data (Big Data), Journal Year: 2024, Volume and Issue: unknown, P. 2419 - 2430
Published: Dec. 15, 2024
Language: Английский
Research Square (Research Square), Journal Year: 2024, Volume and Issue: unknown
Published: Oct. 8, 2024
Language: Английский
Citations
2Authorea (Authorea), Journal Year: 2024, Volume and Issue: unknown
Published: Aug. 6, 2024
Artificial intelligence has revolutionized various fields through its ability to process and generate human-like text, leading significant advancements in tasks requiring language comprehension generation. However, the evaluation of fundamental reasoning abilities within commercial LLMs, specifically inductive deductive reasoning, remains crucial for understanding their cognitive capabilities limitations. This research provides a comprehensive assessment ChatGPT, Gemini, Claude, using meticulously designed set evaluate performance. The methodology involved selection diverse datasets, design complex tasks, implementation robust automated testing framework. Statistical analyses, including ANOVA regression techniques, were employed rigorously compare models’ performance across different tasks. Results indicated that ChatGPT consistently outperformed other models, particularly excelling high precision recall, while Gemini Claude exhibited variability capabilities. study highlights strengths weaknesses each model, offering insights into relative potential areas improvement. Implications AI development are significant, emphasizing need tailored model designs continued innovation training techniques enhance abilities. contributes broader providing foundation future developing more capable reliable intelligent systems.
Language: Английский
Citations
0Authorea (Authorea), Journal Year: 2024, Volume and Issue: unknown
Published: Oct. 14, 2024
The growing complexity and scale of modern deep learning models have improved the ability to generate understand human language, yet challenges persist in achieving robust generalization syntactic flexibility.Dynamic Syntactic Insertion (DSI) addresses these limitations through novel introduction random variations during finetuning phase, enhancing model's capacity process diverse linguistic structures.Through empirical experiments on GPT-NeoX architecture, significant performance improvements were observed across multiple metrics, including robustness, fluency, accuracy.The DSI-enhanced model consistently outperformed baseline, particularly handling syntactically complex perturbed datasets, demonstrating its adaptability a broader range inputs.Furthermore, incorporation variability led reductions perplexity increased tasks GLUE benchmark, highlighting method's effectiveness.The findings from this study suggest that augmentation techniques, such as DSI, provide promising pathway for improving resilience language environments.
Language: Английский
Citations
02021 IEEE International Conference on Big Data (Big Data), Journal Year: 2024, Volume and Issue: unknown, P. 2419 - 2430
Published: Dec. 15, 2024
Language: Английский
Citations
0