ChatGPT Achieves 85% in Professional-Level Neurology Exam
In a recent cross-sectional study researchers explored the performance of large language models (LLMs) in neurology board-style examinations.
The study, which utilized a question bank approved by the American Board of Psychiatry and Neurology, revealed insights into these advanced language models.
ChatGPT Dominates Neurology Exam
The study involved two versions of the LLM ChatGPT—version 3.5 and version 4. The findings revealed that LLM 2 significantly outperforms its predecessor . Furthermore, even surpassing the mean human score on the neurology board examination.
According to the findings , LLM 2 correctly answered 85.0% of questions. Meanwhile, the mean human score is 73.8%.
This data suggests that, with further refinements, large language models could find significant applications in clinical neurology and healthcare.
Read more: 9 ChatGPT Prompts And Tips To Craft The Perfect Job Description
ChatGPT Performs Better On Lower-Order Exam Questions
However, even the older model, LLM 1, demonstrated sufficient performance , albeit slightly below the human average, scoring 66.8%.
Both models consistently used confident language, irrespective of the correctness of their answers, indicating a potential area for improvement in future iterations .
According to the study categorized questions into lower-order and higher-order based on the Bloom taxonomy.
Both models performed better on lower-order questions. However, LLM 2 exhibited excellence in both lower and higher-order questions, showcasing its versatility and cognitive abilities .
Read more: ChatGPT vs. Google Bard: A Comparison of AI Chatbots
Disclaimer
In adherence to the Trust Project guidelines, BeInCrypto is committed to unbiased, transparent reporting. This news article aims to provide accurate, timely information. However, readers are advised to verify facts independently and consult with a professional before making any decisions based on this content.
Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
You may also like
The Daily: US appeals court rules OFAC exceeded its authority in Tornado Cash sanctions, WalletConnect launches its first airdrop season and more
The Fifth Circuit Court of Appeals ruled on Tuesday that the Treasury Department’s Office of Foreign Assets Control (OFAC) ”overstepped its authority” by sanctioning crypto mixer Tornado Cash, reversing a lower district court decision.WalletConnect has launched its first airdrop season and eligibility checker, allocating 50 million of the total supply of 1 billion WCT tokens to over 160,000 users, including builders and contributors.A Brazilian lawmaker has introduced a bill to create the Strategic Soverei
Avant, Ethena-like DeFi protocol, raises $6.5 million in seed funding
Avant Protocol has raised $6.5 million in a seed funding round at a $25 million valuation.Avant is an Avalanche-based crypto yield protocol that offers a yield-bearing ‘stable-value’ token similar to the popular Ethena.
io.net Partners with OpenLedger to Scale Decentralized AI Innovation
TORN Soars Over 500% after the appeal against U.S. Sanctions