May 27, 2024

Improving AI large language models helps them better align with human brain activity

Improving AI large language models helps better align with human brain activity — NSP as a computational account of discourse comprehension. (A) Humans integrate words and sentences to achieve a full understanding of discourse. In LLMs, the NSP task proposed by BERT can serve as a computational account of human discourse comprehension. (B) Illustration of the MLM task. (C) Illustration of the NSP task and its relevance to the Mason and Just model. (D) Illustration of Mason and Just’s neurocognitive model of discourse processing. Credit: *Science Advances* (2024). DOI: 10.1126/sciadv.adn7744

With generative artificial intelligence (GenAI) transforming the social interaction landscape in recent years, large language models (LLMs), which use deep-learning algorithms to train GenAI platforms to process language, have been put in the spotlight.

A recent study by The Hong Kong Polytechnic University (PolyU) found that LLMs perform more like the human brain when being trained in more similar ways as humans process language, which has brought important insights to brain studies and the development of AI models.

Current LLMs mostly rely on a single type of pretraining—contextual word prediction. This simple learning strategy has achieved surprising success when combined with massive training data and model parameters, as shown by popular LLMs such as ChatGPT.

Recent studies also suggest that word prediction in LLMs can serve as a plausible model for how humans process language. However, humans do not simply predict the next word but also integrate high-level information in natural language comprehension.

A research team led by Prof. Li Ping, Dean of the Faculty of Humanities and Sin Wai Kin Foundation Professor in Humanities and Technology at PolyU, has investigated the next sentence prediction (NSP) task, which simulates one central process of discourse-level comprehension in the human brain to evaluate if a pair of sentences is coherent, into model pretraining and examined the correlation between the model's data and brain activation.

The study has been recently published in the journal Science Advances.

The research team trained two models, one with NSP enhancement and the other without; both also learned word prediction. Functional magnetic resonance imaging (fMRI) data were collected from people reading connected sentences or disconnected sentences. The research team examined how closely the patterns from each model matched up with the brain patterns from the fMRI brain data.

It was clear that training with NSP provided benefits. The model with NSP matched human brain activity in multiple areas much better than the model trained only on word prediction. Its mechanism also nicely maps onto established neural models of human discourse comprehension.

The results give new insights into how our brains process full discourse such as conversations. For example, parts of the right side of the brain, not just the left, helped understand longer discourse. The model trained with NSP could also better predict how fast someone read—showing that simulating discourse comprehension through NSP helped AI understand humans better.

Recent LLMs, including ChatGPT, have relied on vastly increasing the training data and model size to achieve better performance. Prof. Li Ping said, "There are limitations in just relying on such scaling. Advances should also be aimed at making the models more efficient, relying on less rather than more data. Our findings suggest that diverse learning tasks such as NSP can improve LLMs to be more human-like and potentially closer to human intelligence."

"More importantly, the findings show how neurocognitive researchers can leverage LLMs to study higher-level language mechanisms of our brain. They also promote interaction and collaboration between researchers in the fields of AI and neurocognition, which will lead to future studies on AI-informed brain studies as well as brain-inspired AI."

More information: Shaoyun Yu et al, Predicting the next sentence (not word) in large language models: What model-brain alignment tells us about discourse comprehension, Science Advances (2024). DOI: 10.1126/sciadv.adn7744

Journal information: Science Advances

Provided by Hong Kong Polytechnic University

Citation: Improving AI large language models helps them better align with human brain activity (2024, May 27) retrieved 11 July 2024 from https://medicalxpress.com/news/2024-05-ai-large-language-align-human.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Large language models in health: Useful, but not a miracle cure

112 shares

Feedback to editors

Blood fat profiles confirm health benefits of replacing butter with high-quality plant oils

1 hour ago

Major trial looks at most effective speech therapy for people with Parkinson's disease

12 hours ago

Models show promise in predicting cognitive decline in early Alzheimer's

14 hours ago

New material derived from graphene improves the performance of neuroprostheses

16 hours ago

Discovery could help with early detection of vision loss in age-related macular degeneration

16 hours ago

New Co-STAR T cells show promise for treating cancers in laboratory study

16 hours ago

Microproteins exclusively produced in liver tumors could lead to cancer vaccines

16 hours ago

Scientists demonstrate a combination treatment can increase human insulin-producing cells in vivo

16 hours ago

Cognitive skills in early toddlerhood: Study demonstrates importance of 16-months

17 hours ago

Brain region involved in oxycodone relapse identified

17 hours ago

Load comments (0)

Improving AI large language models helps them better align with human brain activity

Blood fat profiles confirm health benefits of replacing butter with high-quality plant oils

Major trial looks at most effective speech therapy for people with Parkinson's disease

Models show promise in predicting cognitive decline in early Alzheimer's

New material derived from graphene improves the performance of neuroprostheses

Discovery could help with early detection of vision loss in age-related macular degeneration

New Co-STAR T cells show promise for treating cancers in laboratory study

Microproteins exclusively produced in liver tumors could lead to cancer vaccines

Scientists demonstrate a combination treatment can increase human insulin-producing cells in vivo

Cognitive skills in early toddlerhood: Study demonstrates importance of 16-months

Brain region involved in oxycodone relapse identified

Large language models in health: Useful, but not a miracle cure

Microsoft's small language model outperforms larger models on standardized math tests

Two types of LLMs found able to equal or outperform humans on theory of mind tests

Study argues that large language models can reveal breakthroughs in neuroscience that humans alone cannot

AI can 'lie and BS' like its maker, but still not intelligent like humans, argues researcher

Researchers find LLMs are easy to manipulate into giving harmful information

Researchers build first-ever molecular atlas of blood vessel pathways in human brain

International study uncovers a molecule that could alleviate stroke-related brain injury

Researchers hone ChatGPT, creating AI tools for digital pathology

New form of repetitive magnetic brain stimulation reduces treatment time for bipolar disorder

Discovery could help reduce adverse side effects of popular next-generation obesity medications

Scientists find a determining mechanism for synapse formation in the cerebellum

Phys.org

Tech Xplore

Science X

Improving AI large language models helps them better align with human brain activity

Blood fat profiles confirm health benefits of replacing butter with high-quality plant oils

Major trial looks at most effective speech therapy for people with Parkinson's disease

Models show promise in predicting cognitive decline in early Alzheimer's

New material derived from graphene improves the performance of neuroprostheses

Discovery could help with early detection of vision loss in age-related macular degeneration

New Co-STAR T cells show promise for treating cancers in laboratory study

Microproteins exclusively produced in liver tumors could lead to cancer vaccines

Scientists demonstrate a combination treatment can increase human insulin-producing cells in vivo

Cognitive skills in early toddlerhood: Study demonstrates importance of 16-months

Brain region involved in oxycodone relapse identified

Related Stories

Large language models in health: Useful, but not a miracle cure

Microsoft's small language model outperforms larger models on standardized math tests

Two types of LLMs found able to equal or outperform humans on theory of mind tests

Study argues that large language models can reveal breakthroughs in neuroscience that humans alone cannot

AI can 'lie and BS' like its maker, but still not intelligent like humans, argues researcher

Researchers find LLMs are easy to manipulate into giving harmful information

Recommended for you

Researchers build first-ever molecular atlas of blood vessel pathways in human brain

International study uncovers a molecule that could alleviate stroke-related brain injury

Researchers hone ChatGPT, creating AI tools for digital pathology

New form of repetitive magnetic brain stimulation reduces treatment time for bipolar disorder

Discovery could help reduce adverse side effects of popular next-generation obesity medications

Scientists find a determining mechanism for synapse formation in the cerebellum

Newsletter sign up

Donate and enjoy an ad-free experience