ChatGPT matches radiologists in pancreatic cyst analysis

Artificial intelligence (AI) models such as ChatGPT are designed to rapidly process data. Using the AI ChatGPT-4 platform to extract and analyze specific data points from the Magnetic Resonance Imaging (MRI) and computed tomography (CT) scans of patients with pancreatic cysts, researchers found near-perfect accuracy when compared directly against the manual approach of chart review performed by radiologists, according to a study published in the Journal of the American College of Surgeons (JACS). 

ChatGPT-4 is a much more efficient approach, is cost effective, and allows researchers to focus on data analysis and quality assurance rather than the process of reviewing chart after chart. Our study established that this AI approach was essentially equally as accurate as the manual approach, which is the gold standard." 

Kevin C. Soares, MD, MS, study coauthor, hepatopancreatobiliary cancer surgeon at Memorial Sloan Kettering Cancer Center, New York City

Using an existing database of nearly 1,000 adult patients with pancreatic lesions under surveillance between 2010 and 2024 at Memorial Sloan Kettering Cancer Center in New York City, ChatGPT-4 was deployed to identify nine clinical variables used to monitor cyst progression: cyst size, main pancreatic duct size, number of lesions, main pancreatic duct dilation, branch duct dilation, presence of solid component, calcific lesion, pancreatic atrophy, and pancreatitis. Pancreatic cysts are common and require ongoing surveillance because some develop into cancer and require surgery. 

Researchers evaluated ChatGPT-4's ability to identify and classify these nine factors associated with increased risk for dysplasia and cancer. A manually annotated institutional cyst database was used as the standard for comparison. 

Key findings  

  • The study involved 3,198 unique MRI and CT scans from 991 patients under long-term surveillance for premalignant lesions. 

  • ChatGPT-4 successfully extracted clinical variables with high accuracy. The accuracy rate ranged from 97% for a solid component, a high-risk variable, to 99% for calcific lesions. 

  • Accuracy was 92% for cyst size and 97% for main pancreatic duct size, other high-risk variables that may indicate cancer and require surgical resection, biopsy, or endoscopic ultrasound. 

"AI can help us expand medical research and improve patient outcomes," Dr. Soares said. "The question I get asked most often is, 'What is the chance that this cyst is going to develop into cancer?' We now have an efficient way to look at the MRI and CT scans of thousands of patients and give our patients a better answer. This approach goes a long way to reduce anxiety and help patients feel more confident about their treatment decisions." 

While this was a proof-of-concept study, moving forward the study authors say they would like to use AI to expand the number of research questions they ask to enhance patient care. 

"There is a lot of interest in understanding if AI can predict who is going to develop cancer. It's important to understand who progresses and why, so we have a better chance at tailoring surveillance," Dr. Soares said. "We want to limit the number of patient visits, costs to the health care industry, and ultimately provide a customized, rather than one-size-fits-all approach to surveillance." 

The researchers caution that the study used only one AI source, ChatGPT-4, and results are limited to the data that was used. AI can only work with the information that is handed to it. These limitations may reduce the broader applicability of the findings. 

Coauthors are Ankur P. Choubey, MD, MPH; Emanuel Eguia, MD, MS; Alexander Hollingsworth, MS; Subrata Chatterjee, PhD; Remo Alessandris, MD; Misha T. Armstrong, MD, MPH; Emily Manin, MD; Lily V. Saadat, MD; Jennifer Flood, MSN; Avijit Chatterjee, PhD; Vinod P. Balachandran, MD, FACS; Jeffrey A. Drebin, MD, PhD, FACS; T. Peter Kingham, MD, FACS; Michael I. D'Angelica, MD, FACS; William R. Jarnagin, MD, FACS; Alice C. Wei, MD, MSc, FACS; Vineet S. Rolston, MD; Mark A. Schattner, MD; and Richard K. G. Do, MD, PhD. 

Source:
Journal reference:

Choubey, A. P., et al. (2025). Data Extraction and Curation from Radiology Reports for Pancreatic Cyst Surveillance Using Large Language Models. Journal of the American College of Surgeons. doi.org/10.1097/xcs.0000000000001478.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
AI tool enhances detection of tiny breast cancer signs in mammograms