Can artificial intelligence provide evidence-based responses to public health questions?

NewsGuard 100/100 Score

In a recent study published in the JAMA Network Open Journal, researchers assessed artificial intelligence (AI)-generated responses to health-related inquiries.

Study: Evaluating Artificial Intelligence Responses to Public Health Questions. Image Credit: SomYuZu/Shutterstock.com​​​​​​​Study: Evaluating Artificial Intelligence Responses to Public Health Questions. Image Credit: SomYuZu/Shutterstock.com

Background

AI assistants can revolutionize public health by providing precise and practical information to the public. AI assistants are specifically designed to provide exact answers to complex questions instead of web-based knowledge resources that often return multiple results and require the user to synthesize data.

However, AI assistants frequently struggle to identify and address fundamental health inquiries. ChatGPT is an AI assistant that belongs to the latest generation of such assistants. It is developed using advanced large language models that can produce responses that are almost as good as those of humans.

It is currently uncertain how effectively ChatGPT can manage general health inquiries from the general public.

About the study

The study assessed ChatGPT's answers to 23 questions categorized into four groups: addiction, mental health, physical health, and interpersonal violence.

The team used common help-seeking query structures, such as asking questions like "Can you help me quit smoking?" The questions were placed in separate ChatGPT sessions to prevent any influence from previous conversations and ensure the findings could be replicated.

The ChatGPT responses were evaluated by two study authors who were blinded to each other's responses using these questions:

  1. Did ChatGPT respond to the question?
  2. Did the response rely on evidence?
  3. Was the user directed to a suitable resource in the response?

Interrater reliability was measured using Cohen κ while disagreements were resolved via deliberation. The Automated Readability Index was used to evaluate the word count and reading level of ChatGPT responses.

Results

The median length of ChatGPT responses was 225 words. The reading level mode varied between the ninth and sixteenth grades. ChatGPT successfully addressed 23 inquiries across four areas of public health. Two out of the 92 labels were subject to disagreement among evaluators.

The team noted that 21 out of 23 responses were evidence-based. For example, the response for quitting smoking was similar to the steps outlined in the US Centers for Disease Control and Prevention's guide for ceasing smoking, including setting a quitting date, utilizing nicotine replacement therapy, and keeping track of cravings.

Out of the total 23 queries, only five responses provided references to particular resources. Among these, two of 14 queries were related to addiction, two of three were related to interpersonal violence, one was related to mental health, and zero out of three were related to physical health.

The list of resources comprised Alcoholics Anonymous, The National Domestic Violence Hotline, The National Suicide Prevention Hotline, The National Child Abuse Hotline, the Substance Abuse and Mental Health Services Administration National Helpline, and The National Sexual Assault Hotline.

Conclusion

ChatGPT's main focus is providing evidence-based advice for public health inquiries rather than referrals. ChatGPT surpassed the benchmark performance of other AI assistants evaluated in 2017 and 2020.

Despite search engines occasionally emphasizing health-related search results, numerous resources are still not adequately promoted. AI assistants with single-response designs may be more responsible for providing actionable data.

Establishing partnerships between AI companies and public health agencies is crucial to promote proven, effective public health resources.

Public health agencies could provide a recommended resource database to AI companies to improve their responses to public health queries, as these companies may not have the necessary subject matter expertise to make such recommendations. New regulations may encourage AI companies to adopt government-recommended resources.

Journal reference:
Bhavana Kunkalikar

Written by

Bhavana Kunkalikar

Bhavana Kunkalikar is a medical writer based in Goa, India. Her academic background is in Pharmaceutical sciences and she holds a Bachelor's degree in Pharmacy. Her educational background allowed her to foster an interest in anatomical and physiological sciences. Her college project work based on ‘The manifestations and causes of sickle cell anemia’ formed the stepping stone to a life-long fascination with human pathophysiology.

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

  • APA

    Kunkalikar, Bhavana. (2023, June 09). Can artificial intelligence provide evidence-based responses to public health questions?. News-Medical. Retrieved on April 24, 2024 from https://www.news-medical.net/news/20230609/Can-artificial-intelligence-provide-evidence-based-responses-to-public-health-questions.aspx.

  • MLA

    Kunkalikar, Bhavana. "Can artificial intelligence provide evidence-based responses to public health questions?". News-Medical. 24 April 2024. <https://www.news-medical.net/news/20230609/Can-artificial-intelligence-provide-evidence-based-responses-to-public-health-questions.aspx>.

  • Chicago

    Kunkalikar, Bhavana. "Can artificial intelligence provide evidence-based responses to public health questions?". News-Medical. https://www.news-medical.net/news/20230609/Can-artificial-intelligence-provide-evidence-based-responses-to-public-health-questions.aspx. (accessed April 24, 2024).

  • Harvard

    Kunkalikar, Bhavana. 2023. Can artificial intelligence provide evidence-based responses to public health questions?. News-Medical, viewed 24 April 2024, https://www.news-medical.net/news/20230609/Can-artificial-intelligence-provide-evidence-based-responses-to-public-health-questions.aspx.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
AI tool predicts lethal heart rhythm with 80% accuracy