Text Psychometrics: Assessing Psychological Constructs in Text Using Natural Language Processing

Key findings

This work proposes 'Text Psychometrics' — applying questionnaire-style validity and reliability evaluation to NLP-based construct measurement — and demonstrates it by assessing content validity in crisis-counseling and Reddit corpora and by predicting imminent suicide risk from 49 known risk factors.

Abstract

Source: openalex

Large language models (LLMs) have revolutionized natural language processing (NLP). Yet when used to assess psychological constructs in text, they are generally not evaluated for the types of validity, reliability, and standardization typically expected from traditional questionnaires with rating scales. This study bridges that gap by demonstrating how to evaluate the psychometric properties of text-based models, which we call Text Psychometrics.We first review different NLP methods, compare their ability to address key challenges in psychological research such as explainability, and outline methods for evaluating them on many desirable psychometric properties. We then demonstrate this through two empirical studies. Study 1 classifies thousands of crisis counseling conversations and Reddit posts into different types of mental health issues and introduces a novel method to evaluate text models for content validity —the extent to which a test captures the full range of expressions of a construct. Study 2 examines prospective criterion validity by estimating how 49 known suicide risk factors predict imminent risk in crisis counseling conversations.In sum, NLP studies in psychology often rely on only a few validation metrics; here, we demonstrate the need for broader psychometric evaluation and provide a practical blueprint and future directions for achieving it.

Topics

ml-nlp-knowledge
mental-health-psychiatry

Text Psychometrics: Assessing Psychological Constructs in Text Using Natural Language Processing

Key findings

Abstract

Topics

Lab authors