The Value of Unstructured Electronic Health Record Data in Geriatric Syndrome Case Identification

Published: July 4, 2018
Category: Bibliography
Authors: Ashwini Davison MD, Bruce Leff MD, Cynthia M. Boyd MPH MD, Hadi Kharrazi MD PhD, Joe Kimura MPH MD, Jonathan P. Weiner; DrPH, Laura J. Anzaldi BS, Leilani Hernandez MPH
Countries: USA
Language: English
Types: Population Health
Settings: Health Plan



To examine the value of unstructured electronic health record (EHR) data (free‐text notes) in identifying a set of geriatric syndromes.


Retrospective analysis of unstructured EHR notes using a natural language processing (NLP) algorithm.


Large multispecialty group.


Older adults (N=18,341; average age 75.9, 58.9% female).


We compared the number of geriatric syndrome cases identified using structured claims and structured and unstructured EHR data. We also calculated these rates using a population‐level claims database as a reference and identified comparable epidemiological rates in peer‐reviewed literature as a benchmark.


Using insurance claims data resulted in a geriatric syndrome prevalence ranging from 0.03% for lack of social support to 8.3% for walking difficulty. Using structured EHR data resulted in similar prevalence rates, ranging from 0.03% for malnutrition to 7.85% for walking difficulty. Incorporating unstructured EHR notes, enabled by applying the NLP algorithm, identified considerably higher rates of geriatric syndromes: absence of fecal control (2.1%, 2.3 times as much as structured claims and EHR data combined), decubitus ulcer (1.4%, 1.7 times as much), dementia (6.7%, 1.5 times as much), falls (23.6%, 3.2 times as much), malnutrition (2.5%, 18.0 times as much), lack of social support (29.8%, 455.9 times as much), urinary retention (4.2%, 3.9 times as much), vision impairment (6.2%, 7.4 times as much), weight loss (19.2%, 2.9 as much), and walking difficulty (36.34%, 3.4 as much). The geriatric syndrome rates extracted from structured data were substantially lower than published epidemiological rates, although adding the NLP results considerably closed this gap.


Claims and structured EHR data give an incomplete picture of burden related to geriatric syndromes. Geriatric syndromes are likely to be missed if unstructured data are not analyzed. Pragmatic NLP algorithms can assist with identifying individuals at high risk of experiencing geriatric syndromes and improving coordination of care for older adults.

Please log in/register to access.

Log in/Register

LinkedIn Facebook Twitter

© The Johns Hopkins University, The Johns Hopkins Hospital, and Johns Hopkins Health System.
All rights reserved. Terms of Use Privacy Statement

Back to top