Xorte logo

News Markets Groups

USA | Europe | Asia | World| Stocks | Commodities



Add a new RSS channel

 
 


Keywords

2024-04-18 15:15:17| Engadget

As learning language models (LLMs) continue to advance, so do questions about how they can benefit society in areas such as the medical field. A recent study from the University of Cambridge's School of Clinical Medicine found that OpenAI's GPT-4 performed nearly as well in an ophthalmology assessment as experts in the field, the Financial Times first reported. In the study, published in PLOS Digital Health, researchers tested the LLM, its predecessor GPT-3.5, Google's PaLM 2 and Meta's LLaMA with 87 multiple choice questions. Five expert ophthalmologists, three trainee ophthalmologists and two unspecialized junior doctors received the same mock exam. The questions came from a textbook for trialing trainees on everything from light sensitivity to lesions. The contents aren't publicly available, so the researchers believe LLMs couldn't have been trained on them previously. ChatGPT, equipped with GPT-4 or GPT-3.5, was given three chances to answer definitively or its response was marked as null.  GPT-4 scored higher than the trainees and junior doctors, getting 60 of the 87 questions right. While this was significantly higher than the junior doctors' average of 37 correct answers, it just beat out the three trainees' average of 59.7. While one expert ophthalmologist only answered 56 questions accurately, the five had an average score of 66.4 right answers, beating the machine. PaLM 2 scored a 49, and GPT-3.5 scored a 42. LLaMa scored the lowest at 28, falling below the junior doctors. Notably, these trials occurred in mid-2023.  While these results have potential benefits, there are also quite a few risks and concerns. Researchers noted that the study offered a limited number of questions, especially in certain categories, meaning the actual results might be varied. LLMs also have a tendency to "hallucinate" or make things up. That's one thing if its an irrelevant fact but claiming there's a cataract or cancer is another story. As is the case in many instances of LLM use, the systems also lack nuance, creating further opportunities for inaccuracy.This article originally appeared on Engadget at https://www.engadget.com/gpt-4-performed-close-to-the-level-of-expert-doctors-in-eye-assessments-131517436.html?src=rss


Category: Marketing and Advertising

 

Latest from this category

28.02This retro-inspired handheld comes with Banjo-Kazooie and Battletoads built in
28.02Alaska could be the next state to crack down on AI-generated CSAM and restrict kids' social media use
28.02Shuttered studio Bluepoint reportedly pitched a Bloodborne remake, but it got shot down by FromSoftware
28.02Everything announced at MWC 2026: The new Leica Leitzphone by Xiaomi, Honor's ultra-thin MagicPad 4 and more
28.02Xiaomi 17 Ultra hands-on: Incredible cameras, but maybe hard to get
28.02Leicas Leitzphone by Xiaomi has a huge 1-inch camera sensor and a stylish new design
28.02Steam Next Fest, a different flavor of The Witcher and other new indie games worth checking out
28.02OpenAI strikes a deal with the Defense Department to deploy its AI models
Marketing and Advertising »

All news

28.02Living Fresh Market holds 60-second shopping spree to celebrate Black History Month
28.02Hundreds of thousands of travelers stranded by flight disruptions after attack on Iran
28.02What to know about the clash between the Pentagon and Anthropic over militarys AI use
28.023 conversation-killers to avoid at work
28.02This retro-inspired handheld comes with Banjo-Kazooie and Battletoads built in
28.02Alaska could be the next state to crack down on AI-generated CSAM and restrict kids' social media use
28.02Shuttered studio Bluepoint reportedly pitched a Bloodborne remake, but it got shot down by FromSoftware
28.02Everything announced at MWC 2026: The new Leica Leitzphone by Xiaomi, Honor's ultra-thin MagicPad 4 and more
More »
Privacy policy . Copyright . Contact form .