Monday, April 13, 2026
Science
No Result
View All Result
  • Login
  • HOME
  • SCIENCE NEWS
  • CONTACT US
  • HOME
  • SCIENCE NEWS
  • CONTACT US
No Result
View All Result
Scienmag
No Result
View All Result
Home Science News Technology and Engineering

Research spotlight: Generative AI “drift” and “nondeterminism” inconsistences are important considerations in healthcare applications

August 12, 2024
in Technology and Engineering
Reading Time: 4 mins read
0
Samuel (Sandy) Aronson, ALM, MA
69
SHARES
624
VIEWS
Share on FacebookShare on Twitter
ADVERTISEMENT

Samuel (Sandy) Aronson, ALM, MA, executive director of IT and AI Solutions for Mass General Brigham Personalized Medicine and senior director of IT and AI Solutions for the Accelerator for Clinical Transformation, is the corresponding author of a paper published in NEJM AI that looked at whether generative AI could hold promise for improving scientific literature review of variants in clinical genetic testing. Their findings could have a wide impact beyond this use case.   

Samuel (Sandy) Aronson, ALM, MA

Credit: Mass General Brigham

Samuel (Sandy) Aronson, ALM, MA, executive director of IT and AI Solutions for Mass General Brigham Personalized Medicine and senior director of IT and AI Solutions for the Accelerator for Clinical Transformation, is the corresponding author of a paper published in NEJM AI that looked at whether generative AI could hold promise for improving scientific literature review of variants in clinical genetic testing. Their findings could have a wide impact beyond this use case.   

How would you summarize your study for a lay audience?

We tested whether generative AI can be used to identify whether scientific articles contain information that can help geneticists determine whether genetic variants are harmful to patients. While testing this work, we identified inconsistencies in generative AI that could present a risk for patients if not adequately addressed. We suggest forms of testing and monitoring that could improve safety.

What question were you investigating?

We investigated whether generative AI can be used to determine: 1) whether a scientific article contains evidence about a variant that could help a geneticist’s assessment of a genetic variant and 2) whether any evidence found about the variant supports a benign, pathogenic, intermediate or inconclusive conclusion.

What methods or approach did you use?

We tested a generative AI strategy based on GPT-4 using a labeled dataset of 72 articles and compared generative AI to assessments from expert geneticists.

What did you find?

Generative AI performed relatively well, but more improvement is needed for most use cases. However, as we ran our tests repeatedly, we observed a phenomenon we deemed important: running the same test dataset repeatedly produced different results. Through repeated running of the test set over time, we characterized the variability. We found that both drift (changes in model performance over time) and nondeterminism (inconsistency between consecutive runs) were present. We developed visualizations that demonstrate the nature of these problems.

What are the implications?

If a clinical tool developer is not aware that large language models can exhibit significant drift and nondeterminism, they may run their test set once and use the results to determine whether their tool can be introduced into practice. This could be unsafe.

What are the next steps?

Our results show that it could be important to run a test set multiple times to demonstrate the degree of variability (nondeterminism) present. Our results also show that it is important to monitor for changes in performance (drift) over time.          

Authorship: In addition to Aronson, Mass General Brigham authors include Kalotina Machini, Jiyeon Shin, Pranav Sriraman, Emma R. Henricks, Charlotte J. Mailly, Angie J. Nottage, Sami S. Amr, Michael Oates, and Matthew S. Lebo. Additional authors include Sean Hamill.

Paper cited: Aronson SJ et al. “Integrating GPT-4 Models into a Genetic Variant Assessment Clinical Workflow: Assessing Performance, Nondeterminism, and Drift in Classifying Functional Evidence from Literature” NEJM AI DOI: 10.1056/AIcs2400245

Disclosures: Aronson, Shin, Mailly, and Oates report research grants and similar funding via Brigham and Women’s Hospital from Better Therapeutics, Boehringer Ingelheim, Eli Lilly, Milestone Pharmaceuticals, NovoNordisk, and PICORI.  Aronson, Oates, Machini, Henricks, and Lebo report NIH funding through Mass General Brigham. Aronson reports serving as a paid consultant for Nest Genomics.



Journal

NEJM AI

DOI

10.1056/AIcs2400245

Method of Research

Computational simulation/modeling

Subject of Research

Not applicable

Article Title

GPT-4 Performance, Nondeterminism, and Drift in Genetic Literature Review

Article Publication Date

8-Aug-2024

COI Statement

The authors declare no relevant financial disclosures.

Share28Tweet17
Previous Post

Joslin Diabetes Center investigator Rohit N. Kulkarni, MD, PhD, awarded $10 million NIH/NIDDK grant for pioneering diabetes and obesity research

Next Post

Babbling babies need timely responses to learn language, social norms

Related Posts

blank
Technology and Engineering

Bio-Based Sulfonated Cyclodextrin Catalyzes 5-HMF Synthesis

April 12, 2026
blank
Technology and Engineering

Eco-Friendly Geopolymer Bricks Boost Thermal Comfort

April 12, 2026
blank
Technology and Engineering

Prebiotic Xylooligosaccharides Improve Liver Disease via Gut

April 12, 2026
blank
Technology and Engineering

Bayesian Study Links Aging to Visual Hand Bias

April 12, 2026
blank
Technology and Engineering

Hybrid Framework Optimizes Sustainable Heating in Cold Climates

April 12, 2026
blank
Technology and Engineering

E-selectin Microbubbles Boost Kidney Protection in Rats

April 11, 2026
Next Post
Babbling babies need timely responses to learn language, social norms

Babbling babies need timely responses to learn language, social norms

  • Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

    Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

    27634 shares
    Share 11050 Tweet 6906
  • University of Seville Breaks 120-Year-Old Mystery, Revises a Key Einstein Concept

    1037 shares
    Share 415 Tweet 259
  • Bee body mass, pathogens and local climate influence heat tolerance

    675 shares
    Share 270 Tweet 169
  • Researchers record first-ever images and data of a shark experiencing a boat strike

    538 shares
    Share 215 Tweet 135
  • Groundbreaking Clinical Trial Reveals Lubiprostone Enhances Kidney Function

    523 shares
    Share 209 Tweet 131
Science

Embark on a thrilling journey of discovery with Scienmag.com—your ultimate source for cutting-edge breakthroughs. Immerse yourself in a world where curiosity knows no limits and tomorrow’s possibilities become today’s reality!

RECENT NEWS

  • Distinct Heart, Metabolic, and Inflammatory Risk Profiles Found in Men and Women with Obesity
  • Bio-Based Sulfonated Cyclodextrin Catalyzes 5-HMF Synthesis
  • Eco-Friendly Geopolymer Bricks Boost Thermal Comfort
  • Prebiotic Xylooligosaccharides Improve Liver Disease via Gut

Categories

  • Agriculture
  • Anthropology
  • Archaeology
  • Athmospheric
  • Biology
  • Biotechnology
  • Blog
  • Bussines
  • Cancer
  • Chemistry
  • Climate
  • Earth Science
  • Editorial Policy
  • Marine
  • Mathematics
  • Medicine
  • Pediatry
  • Policy
  • Psychology & Psychiatry
  • Science Education
  • Social Science
  • Space
  • Technology and Engineering

Subscribe to Blog via Email

Success! An email was just sent to confirm your subscription. Please find the email now and click 'Confirm Follow' to start subscribing.

Join 5,145 other subscribers

© 2025 Scienmag - Science Magazine

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • HOME
  • SCIENCE NEWS
  • CONTACT US

© 2025 Scienmag - Science Magazine