PediatricDigest

PediatricDigest

Wednesday, 1 May 2024

ChatGPT’s Inconsistent Heart Risk Assessments Raise Concerns for Medical Use

A new study published in PLOS ONE has found that ChatGPT, despite its reported ability to pass medical exams, may not be reliable for certain health assessments, such as determining the urgency of a patient's chest pain. The research, conducted by Dr. Th…
Read on blog or Reader
Site logo image ScienceBlog.com Read on blog or Reader

ChatGPT's Inconsistent Heart Risk Assessments Raise Concerns for Medical Use

PLOS ONE

May 1

A new study published in PLOS ONE has found that ChatGPT, despite its reported ability to pass medical exams, may not be reliable for certain health assessments, such as determining the urgency of a patient's chest pain. The research, conducted by Dr. Thomas Heston from Washington State University's Elson S. Floyd College of Medicine and Dr. Lawrence Lewis from Washington University in St. Louis, highlights the potential dangers of relying on the AI system for high-stakes clinical situations.

Inconsistent Results and Varying Risk Assessments

The researchers generated three datasets of 10,000 randomized, simulated cases each, using variables from the TIMI and HEART scales, which are traditionally used by physicians to assess heart risk. When presented with the same patient data, ChatGPT provided inconsistent conclusions, returning different risk assessment levels for identical cases.

"ChatGPT was not acting in a consistent manner," said lead author Dr. Thomas Heston. "Given the exact same data, ChatGPT would give a score of low risk, then next time an intermediate risk, and occasionally, it would go as far as giving a high risk."

The authors believe this inconsistency is likely due to the level of randomness built into the current version of the software, ChatGPT4, which helps it vary its responses to simulate natural language. However, this randomness does not work well for healthcare uses that require a single, consistent answer.

The Potential and Pitfalls of AI in Healthcare

Despite the negative findings of this study, Dr. Heston sees great potential for generative AI in healthcare, with further development. For instance, assuming privacy standards could be met, entire medical records could be loaded into the program, allowing doctors to quickly access the most pertinent facts about a patient in an emergency setting. Additionally, for difficult, complex cases, doctors could ask the program to generate several possible diagnoses.

"ChatGPT could be excellent at creating a differential diagnosis and that's probably one of its greatest strengths," said Heston. "If you don't quite know what's going on with a patient, you could ask it to give the top five diagnoses and the reasoning behind each one. So it could be good at helping you think through a problem, but it's not good at giving the answer."

The researchers emphasize the importance of conducting further research on AI systems like ChatGPT, especially in high-stakes clinical situations, to ensure their safe and effective use in healthcare.

"We found there was a lot of variation, and that variation in approach can be dangerous," Heston said. "It can be a useful tool, but I think the technology is going a lot faster than our understanding of it, so it's critically important that we do a lot of research."

Keyword/Phrase: ChatGPT Heart Risk Assessment

ScienceBlog.com © 2024. Manage your email settings or unsubscribe.

WordPress.com and Jetpack Logos

Get the Jetpack app

Subscribe, bookmark, and get real-time notifications - all from one app!

Download Jetpack on Google Play Download Jetpack from the App Store
WordPress.com Logo and Wordmark title=

Automattic, Inc. - 60 29th St. #343, San Francisco, CA 94110  

at May 01, 2024
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest

No comments:

Post a Comment

Newer Post Older Post Home
Subscribe to: Post Comments (Atom)

Latest from Food Politics: A rare exception: an industry-funded study with negative results

As I endlessly repeat, industry-funded studies tend to favor the sponsor’s commercial interests.   The correlation between industry fundin...

  • PowKids Clean Protein: Raising Powerful Kids!
    Photo courtesy of PowKids! I received samples of Powkids protein ($79.98 valu...
  • Does Lauren Boebert have her GOP primary locked up — or will a lesser-known candidate break out?
    Money. Incumbency. Near-universal name recognition.U.S. Rep. Lauren Boebert [cq ...
  • [New post] Please Take the Time to Read or Watch the President’s Most Important Speech!
    ...

Search This Blog

  • Home

About Me

PodiatryDigest
View my complete profile

Report Abuse

Blog Archive

  • May 2026 (9)
  • April 2026 (31)
  • March 2026 (31)
  • February 2026 (29)
  • January 2026 (29)
  • December 2025 (32)
  • November 2025 (29)
  • October 2025 (33)
  • September 2025 (33)
  • August 2025 (36)
  • July 2025 (40)
  • June 2025 (24)
  • May 2025 (17)
  • April 2025 (16)
  • March 2025 (16)
  • February 2025 (11)
  • January 2025 (6)
  • December 2024 (8)
  • November 2024 (8)
  • October 2024 (8)
  • September 2024 (1481)
  • August 2024 (1712)
  • July 2024 (2057)
  • June 2024 (2105)
  • May 2024 (2319)
  • April 2024 (2069)
  • March 2024 (2286)
  • February 2024 (2422)
  • January 2024 (2539)
  • December 2023 (1955)
  • November 2023 (1449)
  • October 2023 (1186)
  • September 2023 (1072)
  • August 2023 (826)
  • July 2023 (771)
  • June 2023 (793)
  • May 2023 (829)
  • April 2023 (707)
  • March 2023 (753)
  • February 2023 (673)
  • January 2023 (752)
  • December 2022 (706)
  • November 2022 (731)
  • October 2022 (701)
  • September 2022 (694)
  • August 2022 (716)
  • July 2022 (752)
  • June 2022 (845)
  • May 2022 (1011)
  • April 2022 (1138)
  • March 2022 (596)
  • February 2022 (423)
  • January 2022 (449)
  • December 2021 (581)
  • November 2021 (1495)
  • October 2021 (1539)
  • September 2021 (1455)
  • August 2021 (196)
Powered by Blogger.