Friday, August 15, 2025
Science
No Result
View All Result
  • Login
  • HOME
  • SCIENCE NEWS
  • CONTACT US
  • HOME
  • SCIENCE NEWS
  • CONTACT US
No Result
View All Result
Scienmag
No Result
View All Result
Home Science News Mathematics

New FSU research shows statistical analysis can detect when ChatGPT is used to cheat on multiple-choice chemistry exams

August 14, 2024
in Mathematics
Reading Time: 4 mins read
0
Hanson
66
SHARES
599
VIEWS
Share on FacebookShare on Twitter
ADVERTISEMENT
ADVERTISEMENT

As use of generative artificial intelligence continues to extend into all reaches of education, much of the concern related to its impact on cheating has focused on essays, essay exam questions and other narrative assignments. Use of AI tools such as ChatGPT to cheat on multiple-choice exams has largely gone ignored.

Hanson

Credit: Photo by Colin Hackley

As use of generative artificial intelligence continues to extend into all reaches of education, much of the concern related to its impact on cheating has focused on essays, essay exam questions and other narrative assignments. Use of AI tools such as ChatGPT to cheat on multiple-choice exams has largely gone ignored.

A Florida State University chemist is half of a research partnership whose latest work is changing what we know about this type of cheating, and their findings have revealed how the use of ChatGPT to cheat on general chemistry multiple-choice exams can be detected through specific statistical methods. The work was published in Journal of Chemical Education.

“While many educators and researchers try to detect AI assisted cheating in essays and open-ended responses, such as Turnitin AI detection, as far as we know, this is the first time anyone has proposed detecting its use on multiple-choice exams,” said Ken Hanson, an associate professor in the FSU Department of Chemistry and Biochemistry.  “By evaluating differences in performances between student- and ChatGPT-based multiple-choice chemistry exams, we were able to identify ChatGPT instances across all exams with a false positive rate of almost zero.”

The research is the latest publication in a seven-year collaboration between Hanson and machine learning engineer Ben Sorenson.

Hanson and Sorenson, who first met in third grade, both attended St. Cloud State University in Minnesota for their undergraduate degrees and stayed in touch after moving into their careers. As a faculty member at FSU, Hanson became curious about measuring how much knowledge his students retained from lectures, courses and lab work.

“This was a conversation that I brought to Ben, who’s great with statistics, computer science and data processing,” said Hanson, who is part of a group of FSU faculty working to improve student success in gateway STEM courses such as general chemistry and college algebra. “He said we could use statistical tools to understand if my exams are good, and in 2017, we started analyzing exams.”

The core of this Rasch model is that a student’s probability of getting any test question correct is a function of two things: how difficult the question is and the student’s ability to answer the question. In this case, a student’s ability refers to how much knowledge they have and how many of the necessary components are needed to answer the question they have. Viewing the outcomes of an exam in this way provides powerful insights, researchers said.

“The collaboration between Ken and I, though remote, has been a really seamless, smooth process,” Sorenson said. “Our work is a great way to provide supporting evidence when educators might already suspect that cheating may be happening. What we didn’t expect was that the patterns of artificial intelligence would be so easy to identify.”

Hanson earned his doctorate in chemistry from the University of Southern California in 2010 and completed a postdoctoral position at the University of North Carolina at Chapel Hill before joining FSU’s chemistry faculty in 2013. His lab, the Hanson Research Group, focuses on molecular photochemistry and photophysics, or the study of light — photons — and light’s interaction with molecules. Hanson, a member of the American Chemical Society, has published more than 100 papers and holds over a dozen patents.

To learn more about Hanson’s research and the FSU Department of Chemistry and Biochemistry, visit chem.fsu.edu.

Researchers collected previous FSU student responses from five semesters worth of exams, input nearly 1,000 questions into ChatGPT and compared the outcomes. Average score and raw statistics were not enough to identify ChatGPT-like behavior because there are certain questions that ChatGPT always answered correctly or always answered incorrectly resulting in an overall score that was indistinguishable from students.

“That’s the thing about ChatGPT – it can generate content, but it doesn’t necessarily generate correct content,” Hanson said. “It’s simply an answer generator. It’s trying to look like it knows the answer, and to someone who doesn’t understand the material, it probably does look like a correct answer.”

By using fit statistics, researchers fixed the ability parameters and refit the outcomes, finding ChatGPT’s response pattern was clearly different from that of the students.

On exams, high-performing students frequently answer difficult and easy questions correctly, while average students tend to answer some difficult questions and most easy questions correctly. Low-performing students typically only answer easy questions correctly. But on repeated attempts by ChatGPT to complete an exam, the AI tool sometimes answered every easier question incorrectly and every hard question correctly. Hanson and Sorenson used these behavior differences to detect the use of ChatGPT with almost 100-percent accuracy.

The duo’s strategy of employing a technique known as Rasch modeling and fit statistics can be readily applied to any and all generative AI chat bots, which will exhibit their own unique patterns to help educators identify the use of these chat bots in completing multiple-choice exams.

 



Journal

Journal of Chemical Education

DOI

10.1021/acs.jchemed.4c00165

Article Title

Identifying Generative Artificial Intelligence Chatbot Use on Multiple-Choice, General Chemistry Exams Using Rasch Analysis

Article Publication Date

9-Jul-2024

COI Statement

The authors declare no competing financial interest.

Share26Tweet17
Previous Post

A belly full of jelly

Next Post

Larger teams in academic research worsen career prospects, study finds

Related Posts

blank
Mathematics

Meta-Analysis Suggests Helicobacter pylori Eradication Could Increase Risk of Reflux Esophagitis

August 14, 2025
blank
Mathematics

Innovative Few-Shot Learning Model Boosts Accuracy in Crop Disease Detection

August 13, 2025
blank
Mathematics

Scientists Unveil Mathematical Model Explaining ‘Matrix Tides’ and Complex Wave Patterns in Qiantang River

August 12, 2025
blank
Mathematics

Enhancing Medical Imaging with Advanced Pixel-Particle Analogies

August 12, 2025
blank
Mathematics

Brain-Inspired Devices Become Reality Through Neuromorphic Technology and Machine Learning

August 12, 2025
blank
Mathematics

AI Revolutionizes Gene Editing Precision with CRISPR Technology

August 12, 2025
Next Post

Larger teams in academic research worsen career prospects, study finds

  • Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

    Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

    27533 shares
    Share 11010 Tweet 6881
  • University of Seville Breaks 120-Year-Old Mystery, Revises a Key Einstein Concept

    947 shares
    Share 379 Tweet 237
  • Bee body mass, pathogens and local climate influence heat tolerance

    641 shares
    Share 256 Tweet 160
  • Researchers record first-ever images and data of a shark experiencing a boat strike

    507 shares
    Share 203 Tweet 127
  • Warm seawater speeding up melting of ‘Doomsday Glacier,’ scientists warn

    310 shares
    Share 124 Tweet 78
Science

Embark on a thrilling journey of discovery with Scienmag.com—your ultimate source for cutting-edge breakthroughs. Immerse yourself in a world where curiosity knows no limits and tomorrow’s possibilities become today’s reality!

RECENT NEWS

  • Trapped in a Social Media Echo Chamber? A New Study Reveals How AI Can Offer an Escape
  • Rewrite FDA-approved MI cancer seek test enhances tumor profiling for precision oncology this news headline for the science magazine post
  • Rewrite Solved: 90-year-old mystery in quantum physics this news headline for the science magazine post
  • Rewrite Rethinking how medicine can approach aging this news headline for the science magazine post

Categories

  • Agriculture
  • Anthropology
  • Archaeology
  • Athmospheric
  • Biology
  • Bussines
  • Cancer
  • Chemistry
  • Climate
  • Earth Science
  • Marine
  • Mathematics
  • Medicine
  • Pediatry
  • Policy
  • Psychology & Psychiatry
  • Science Education
  • Social Science
  • Space
  • Technology and Engineering

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 4,859 other subscribers

© 2025 Scienmag - Science Magazine

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • HOME
  • SCIENCE NEWS
  • CONTACT US

© 2025 Scienmag - Science Magazine

Discover more from Science

Subscribe now to keep reading and get access to the full archive.

Continue reading