GPT-D: Inducing Dementia-related Linguistic Anomalies by Deliberate Degradation of Artificial Neural Language Models

Changye Li Institute of Health Informatics, University of Minnesota David Knopman {knopman}@mayo.edu Weizhe Xu {xuweizhe, cohenta}@uw.edu Trevor Cohen {xuweizhe, cohenta}@uw.edu Serguei Pakhomov Pharmaceutical Care and Health Systems, University of Minnesota

Abstract

Deep learning (DL) techniques involving fine-tuning large numbers of model parameters have delivered impressive performance on the task of discriminating between language produced by cognitively healthy individuals, and those with Alzheimer’s disease (AD). However, questions remain about their ability to generalize beyond the small reference sets that are publicly available for research. As an alternative to fitting model parameters directly, we propose a novel method by which a Transformer DL model (GPT-2) pre-trained on general English text is paired with an artificially degraded version of itself (GPT-D), to compute the ratio between these two models’ perplexities on language from cognitively healthy and impaired individuals. This technique approaches state-of-the-art performance on text data from a widely used "Cookie Theft" picture description task, and unlike established alternatives also generalizes well to spontaneous conversations. Furthermore, GPT-D generates text with characteristics known to be associated with AD, demonstrating the induction of dementia-related linguistic anomalies. Our study is a step toward better understanding of the relationships between the inner workings of generative neural language models, the language that they produce, and the deleterious effects of dementia on human speech and language characteristics.

1 Introduction

Alzheimer’s disease (AD) dementia affects every aspect of cognition, including language use. Over 50 million people are currently diagnosed with AD dementia, and this number is expected to triple by 2050 (Organization et al., 2017; Patterson, 2018; Prince et al., 2016). Furthermore, over half of the individuals living with dementia are undiagnosed (Lang et al., 2017). While AD has no known cure, timely diagnosis can prevent or alleviate adverse outcomes ranging from anxiety over unexplained symptoms to family discord and catastrophic events (Stokes et al., 2015; Boise et al., 1999; Bond et al., 2005). However, diagnosis of AD dementia is time-consuming and challenging for patients and physicians alike, and currently relies on patient and caregiver reports, extensive neuropsychological examinations, and invasive imaging and diagnostic procedures (Patterson, 2018). Automated analysis of spoken language can potentially provide accurate, easy-to-use, safe and cost-effective tools for monitoring AD-related cognitive markers. In particular, studies have demonstrated that supervised machine learning methods can learn to differentiate accurately between patients with dementia and healthy controls (Fraser et al., 2016; Orimaye et al., 2017), with particularly strong performance from recent deep learning (DL) models (Balagopalan et al., 2020; Roshanzamir et al., 2021). However, the large number of parameters employed in DL presents a danger of overfitting to the small datasets concerned, and hinders interpretability of model predictions - both critical concerns for clinical artificial intelligence applications (Graham et al., 2020).

As an alternative to fitting model parameters directly, we propose a novel method by which a pre-trained Transformer (Vaswani et al., 2017) model, GPT-2 (Radford et al., 2019) is paired with an artificially degraded version of itself (GPT-D), to compute the ratio of model perplexities on language from cognitively healthy and impaired individuals. We anticipate that semantic information lost with dementia progression may be localized to particular layers of a neural language model, and that one can simulate this information loss by systematically modifying parameters in these layers. Specifically, we hypothesize that impairing certain layers of a DL model can result in linguistic deficits that are also observed in dementia. We further hypothesize that unlike prior work fitting model parameters to labeled “Cooke Theft” transcripts, this approach will detect task-agnostic linguistic anomalies, permitting evaluation of language from casual conversations. We evaluate these hypotheses by targeting individual layers for induction of dementia-related linguistic anomalies, resulting in a degraded model – GPT-D. We then assess the ability of a paired perplexity approach combining GPT-2 with GPT-D to identify transcripts from participants with dementia. In addition, we assess generalization performance, and consider the extent to which the best-performing degraded model reflects linguistic anomalies known to occur in AD dementia: usage of higher frequency words, and repetitiveness. The contributions of this work can be summarized as follows: a) we develop a novel method for automated detection of dementia-related linguistic anomalies, involving deliberate degradation of a pre-trained Transformer model; b) this method exhibits state-of-the-art (SOTA) within-set performance for models trained on text alone, and is distinguished by its ability to generalize from cognitive tasks to conversational data; c) the degradation process induces linguistic anomalies observed in dementia in language generated by GPT-D¹¹1Our code is available at https://github.com/LinguisticAnomalies/hammer-nets.

2 Background

Building on a rich body of evidence that machine learning methods can learn to distinguish between language from healthy controls and dementia patients (for a review, see Lyu (2018); Petti et al. (2020)), recent work leveraging pre-trained Transformer models has demonstrated improvements in performance over prior approaches. Balagopalan et al. (2020) fine-tuned the BERT (Devlin et al., 2019) model on the training set of the AD Recognition through Spontaneous Speech (ADReSS) Challenge (Luz et al., 2020), which was developed, in part, to address the lack of standardized train/test splits and subset definitions in prior work using DementiaBank (Becker et al., 1994) (DB). Balagopalan et al. (2020) report an accuracy of 83.3% on the test set, an improvement over machine learning models with expert-defined features. Performance can also be further boosted by introducing more data from the same picture description task (Guo et al., 2021). These findings suggest a promising direction, as models can be developed without extensive feature engineering. However, additional task-specific data are not always available. DL models with millions of parameters are vulnerable to overfitting with small data sets, which may be difficult to detect as they are hard to interpret.

However, some DL models can be distilled into a single interpretable feature: language model (LM) perplexity (PPL). PPL is a measurement of how well a language sample fits a trained LM. Intuitively, a model trained on language from cognitively healthy participants should be “surprised” by language from participants with dementia, and the opposite should also be true. Accordingly, the difference between the paired perplexities from “cognitively healthy” and “dementia” language models produces SOTA results on the task of identifying transcripts from participants with dementia (Fritsch et al., 2019; Cohen and Pakhomov, 2020), effectively condensing neural network parameters to a single diagnostically useful feature. Contemporary deep LMs such as GPT-2 are already trained on large amounts of text, that has presumably been authored predominantly by cognitively healthy individuals. The difficulty with leveraging these models within the paired perplexity paradigm arises from the lack of a correspondingly large set of text from participants with dementia. We negotiate this difficulty by deliberately degrading a Transformer model to limit its semantic processing capabilities, obviating the need for large amounts of dementia-specific training data. We show that the resulting models can effectively identify transcripts from participants with dementia, generalize across language samples and tasks, and generate text with linguistic characteristics of this condition.

3 Methods

Refer to caption — Table 1: Basic characteristics of datasets

Dataset		Dementia			Healthy Controls
		N
	MMSE	participants
	Transcript
length
	N
	MMSE
	Transcript
length
	Mean (SD)
ADReSS	train	54	17.1 (5.5)	104 (63)	54	29.1 (1.9)	114 (49)
	test	24	19.5 (5.4)	95 (47)	24	28.8 (1.5)	120 (72)
	all	78	17.8 (5.5)	101 (58)	78	29 (1.2)	116 (56)
DB		169	20.2 (4.6)	959 (534)	99	29.1 (1.1)	1085 (556)
CCC		234	NA	1213 (943)	48	NA	714 (308)

Dataset	Combination Impairment Pattern			Cumulative Impairment Pattern
	AUC (SD)	ACC (SD)	r with MMSE (SD)	AUC (SD)	ACC (SD)	r with MMSE (SD)
ADReSS	0.80 (0.06)	0.71 (0.07)	-0.52 (0.08)	0.79 (0.02)	0.68 (0.03)	-0.51 (0.05)
DB	0.81 (0.07)	0.76 (0.04)	-0.45 (0.06)	0.83 (0.02)	0.73 (0.02)	-0.41 (0.14)
CCC	0.77 (0.04)	0.71 (0.04)	–	0.72 (0.04)	0.64 (0.09)	–

	Testing dataset
Training method	ADReSS	DB	CCC
(Best pattern:AUC)	AUC/ACC	AUC/ACC	AUC/ACC
Cumulative Impairment Pattern
ADReSS (0-8:0.80)	–	–	0.77/0.72
DB (0-4:0.82)	–	–	0.69/0.68
CCC (0-2:0.72)	0.70/0.63	0.74/0.63	–
Combination Impairment Pattern
ADReSS (0-6,8:0.80)	–	–	0.76/0.71
DB (0-6,8:0.80)	–	–	0.76/0.71
CCC (1-3,5,7,9-11:0.79)	0.69/0.61	0.72/0.67	–
Fine-tuned BERT
ADReSS	–	–	0.64/0.63
DB	–	–	0.67/0.6
CCC	0.71/0.66	0.7/0.65	–
Fine-tuned DistilBERT
ADReSS	–	–	0.67/0.57
DB	–	–	0.67/0.6
CCC	0.65/0.62	0.47/0.45	–

Cumulative
Dataset (Pattern)	LF		TTR
	GPT-2	GPT-D	GPT-2	GPT-D
ADReSS (0-8)	9.48	9.82*	72%	50%
DB (0-4)	9.49	9.83*	72%	49%
CCC (0-2)	9.48	9.54	72%	51%
Combination
ADReSS/DB (0-6,8)	9.5	9.41	72%	55%
CCC (1-3,5,7,9-11)	9.45	9.92**	73%	64%

Bird Transcript Sentence	GPT-2 Model Output	GPT-D Model Output
There are two children and their mother in the kitchen.	One of them is a little girl, but she’s not very good at math or science so I don’t really know what to do with her.	This is not a child, but they have no one parent’s daughter or son of sister to father? brother family home with parents who can’t let me know that I am sorry for my mommy baby daddy if you’re just love them so much like it was good old time when we were talking about how many times
The little boy has climbed up, on a three legged stool to get some cookies from the jar in the cupboard.	I’m so glad you’re here, he says as we sit down at his kitchen table and watch him play with one of our favorite treats: chocolate chip cookie doughnuts! Oh my gosh! I say excitedly before turning back around for another bite that will make me feel good about what’s been happening this whole time…	I don’t know how do you like it’s what is so good for kids love me and they’re all i’m not sure no one day of year after years ago when people say ’I’ve been there was never ever again were always will be back then next time we got into my mom said she told her mother would tell them about their favorite thing that came out at nighttime before school started saying You can go home now or maybe even if your dad says he’ll give him

GPT-D: Inducing Dementia-related Linguistic Anomalies by Deliberate Degradation of Artificial Neural Language Models

Abstract

1 Introduction

2 Background

3 Methods

3.1 Data

3.2 Modeling and Evaluation

3.2.1 Artificial Impairment: Locations

3.2.2 Artificial Impairment: Patterns

3.3 Interpretation of Neural Model Behavior

4 Results

5 Discussion

6 Conclusion

Acknowledgement

Responsible NLP Research

References

Appendix