The Complete Inpatient Record Using Comprehensive Electronic Data (CIRCE) project: A team-based approach to clinically validated, research-ready electronic health record data
Learning Health Systems January 15, 2025
Research Areas
PAIR Center Research Team
Topics
Overview
INTRODUCTION: The rapid adoption of electronic health record (EHR) systems has resulted in extensive archives of data relevant to clinical research, hospital operations, and the development of learning health systems. However, EHR data are not frequently available, cleaned, standardized, validated, and ready for use by stakeholders. We describe an in-progress effort to overcome these challenges with cooperative, systematic data extraction and validation.
METHODS: A multi-disciplinary team of investigators collaborated to create the Complete Inpatient Record Using Comprehensive Electronic Data (CIRCE) Project dataset, which captures EHR data from six hospitals within the University of Pennsylvania Health System. Analysts and clinical researchers jointly iteratively reviewed SQL queries and their output to validate desired data elements. Data from patients aged ≥18 years with at least one encounter at an acute care hospital or hospice occurring since 7/1/2017 were included. The CIRCE Project includes three layers: (1) raw data comprised of direct SQL query output, (2) cleaned data with errors removed, and (3) transformed data with standardized implementations of commonly used case definitions and clinical scores.
RESULTS: Between July 1, 2017 and December 31, 2023, the dataset captured 1,629,920 encounters from 740,035 patients. Most encounters were emergency department only visits (n = 965,834, 59.3%), followed by inpatient admissions without an intensive care unit admission (n = 518,367, 23.7%). The median age was 46.9 years (25th–75th percentiles = 31.1–64.7) at the time of the first encounter. Most patients were female (n = 418,303, 56.5%), a significant proportion were of non-White race (n = 272,018, 36.8%), and 54,625 (7.4%) were of Hispanic/Latino ethnicity.
CONCLUSIONS: The CIRCE Project represents a novel cooperative research model to capture clinically validated EHR data from a large diverse academic health system in the greater Philadelphia region and is designed to facilitate collaboration and data sharing to support learning health system activities. Ultimately, these data will be de-identified and converted to a publicly available resource.
Sponsors
Penn Medicine
Authors
Andrea LC Schneider, Jennifer C Ginestra, Meeta Prasad Kerlin, Michael GS Shashaty, Todd A Miano, Daniel S Herman, Oscar JL Mitchell, Rachel Bennett, Alexander T Moffett, John Chandler, Atul Kalanuria, Zahra Faraji, Nicholas S Bishop, Benjamin Schmid, Angela T Chen, Kathryn H Bowles, Thomas Joseph, Rachel Kohn, Rachel R Kelz, George L Anesi, Monisha Kumar, Ari B Friedman, Emily Vail, Nuala J Meyer, Blanca E Himes, Gary E Weissman