Gerontechnology Journal

Who Is Represented in ChatGPT Usability Studies? A Scoping Review on Age Diversity
A. J. Kim, M. Choi.
Full text PDF

( Download count: 115)

Abstract

PURPOSE: Large language models (LLMs) have transformed how we search for and obtain information in our daily lives. Since late 2022, ChatGPT, the most widely used conversational Al agent, has provided information and advice in a natural way, much like conversing with a person. But would older adults, who are not Al natives, also find ChatGPT highly usable? Currently, there is limited understanding of ChatGPT's usability from an age diversity perspective. This paper aims to synthesize the latest evidence from usability evaluation studies on ChatGPT while providing implications for digital inclusion across diverse age groups. METHOD: Following the PRISMA-ScR guidelines, we searched three major databases (Web of Science, Scopus, and PubMed) using the keywords "usability" and "ChatGPT." After removing duplicates, 301 academic papers published since 2023 were identified. After screening using Covidence software, 24 studies met the inclusion criteria and were included in the analysis. RESULTS AND DISCUSSION: Among these 24 studies, only two included participants aged 65 or older, the most commonly used institutional criterion for defining older adults. Including those two, five studies included participants aged 60 or older; however, 15 studies excluded this age group entirely, while four studies did not report participants' ages. Regarding usability assessment, the most common measurement method was the System Usability Scale (SUS), used in 11 studies [1, 2]. Based on studies using the SUS or the Chatbot Usability Questionnaire (CUQ), which shares the same interpretation criteria as the SUS [3], the pooled mean scores were 87.32 (±11.77) for studies including participants aged 60 or older (N = 94) and 80.40 (±15.42) for studies with only participants younger than 60 (N = 66). The standardized mean difference between the two groups was moderate (Hedges' g = 0.52, 95% CI [0.19, 0.85]), indicating significantly higher usability scores in the former group. Compared to the SUS benchmark of 68 points, ChatGPT can be considered to have high usability for both groups. However, very few usability evaluation studies of ChatGPT include older adults, and even in those studies, the proportion of participants aged 60 or older is low. Therefore, it is difficult to conclude that ChatGPT is an easy-to-use technology for diverse age groups, including older adults. Future research should empirically assess the usability of ChatGPT specifically within the aging population, ensuring diverse backgrounds are represented to eliminate selection bias and promote digital equity.

Keywords: usability, artificial intelligence, human-Al interaction, ChatGPT, chatbot, digital equity

A. J. Kim, M. Choi. (2026). Who Is Represented in ChatGPT Usability Studies? A Scoping Review on Age Diversity. Gerontechnology, 25(s),1-1
https://doi.org/10.4017/gt.2026.25.2.1673.3