Challenges with quality of race and ethnicity data in observational databases.

Overview

abstract

OBJECTIVE: We sought to assess the quality of race and ethnicity information in observational health databases, including electronic health records (EHRs), and to propose patient self-recording as an improvement strategy. MATERIALS AND METHODS: We assessed completeness of race and ethnicity information in large observational health databases in the United States (Healthcare Cost and Utilization Project and Optum Labs), and at a single healthcare system in New York City serving a racially and ethnically diverse population. We compared race and ethnicity data collected via administrative processes with data recorded directly by respondents via paper surveys (National Health and Nutrition Examination Survey and Hospital Consumer Assessment of Healthcare Providers and Systems). Respondent-recorded data were considered the gold standard for the collection of race and ethnicity information. RESULTS: Among the 160 million patients from the Healthcare Cost and Utilization Project and Optum Labs datasets, race or ethnicity was unknown for 25%. Among the 2.4 million patients in the single New York City healthcare system's EHR, race or ethnicity was unknown for 57%. However, when patients directly recorded their race and ethnicity, 86% provided clinically meaningful information, and 66% of patients reported information that was discrepant with the EHR. DISCUSSION: Race and ethnicity data are critical to support precision medicine initiatives and to determine healthcare disparities; however, the quality of this information in observational databases is concerning. Patient self-recording through the use of patient-facing tools can substantially increase the quality of the information while engaging patients in their health. CONCLUSIONS: Patient self-recording may improve the completeness of race and ethnicity information.

authors

Polubriaginof, Fernanda C G

Ryan, Patrick

Salmasian, Hojjat

Shapiro, Andrea Wells

Perotte, Adler

Safford, Monika M
Hripcsak, George
Smith, Shaun
Tatonetti, Nicholas P
Vawdrey, David K

publication date

August 1, 2019

published in

Journal of the American Medical Informatics Association : JAMIA Journal

Research

keywords

Databases, Factual
Ethnicity
Racial Groups

Identity

PubMed Central ID

PMC6696496

Scopus Document Identifier

85071354053

Digital Object Identifier (DOI)

10.1093/jamia/ocz113

PubMed ID

31365089

Additional Document Info

has global citation frequency

74

volume

26

issue

8-9

VIVO Weill Cornell Medical College

Challenges with quality of race and ethnicity data in observational databases. Academic Article

Overview

abstract

authors

publication date

published in

Research

keywords

Identity

PubMed Central ID

Scopus Document Identifier

Digital Object Identifier (DOI)

PubMed ID

Additional Document Info

has global citation frequency

volume

issue